whest

NumPy-compatible math primitives with analytical FLOP counting

Built for the ARC Whitebox Estimation Challenge by AIcrowd

whest is a drop-in replacement for a subset of NumPy that counts floating-point operations as you compute. Algorithms submitted to the ARC Whitebox Estimation Challenge are scored by their analytical FLOP cost, not wall-clock time, so researchers can focus on algorithmic innovation rather than hardware tuning. Every arithmetic call deducts from a fixed budget; exceed it and execution stops immediately.

Why whest?

NumPy

whest

import numpy as np

depth, width = 5, 256

# Weight init
scale = np.sqrt(2 / width)
weights = [
    np.random.randn(width, width) * scale
    for _ in range(depth)
]

# Forward pass
x = np.random.randn(width)
h = x
for i, W in enumerate(weights):
    h = np.einsum('ij,j->i', W, h)
    if i < depth - 1:
        h = np.maximum(h, 0)
# Total FLOPs? No idea.

import whest as we

depth, width = 5, 256

# Weight init
scale = we.sqrt(2 / width)
weights = [
    we.random.randn(width, width) * scale
    for _ in range(depth)
]

# Forward pass
x = we.random.randn(width)
h = x
for i, W in enumerate(weights):
    h = we.einsum('ij,j->i', W, h)
    if i < depth - 1:
        h = we.maximum(h, 0)
we.budget_summary()  # 984,321 FLOPs

Key Features

NumPy-compatible API -- import whest as we and write familiar NumPy code
Analytical FLOP counting -- deterministic, hardware-independent cost tracking
Budget enforcement -- operations are checked before execution; exceeding the budget raises a clear error
Symmetry-aware einsum -- automatic FLOP savings for repeated operands and declared symmetry groups
Transparent diagnostics -- inspect per-operation costs, cumulative budget usage, and detailed summaries at any time
Truncated SVD -- top-k singular value decomposition with O(m * n * k) cost

What's Supported

Module	Operations	Cost Model	Status
Core (`we.*`)	339	Varies by category (unary, binary, reduction, free)	Supported
`we.linalg`	31	Per-operation formulas	Supported
`we.fft`	18	`5n * ceil(log2(n))` for transforms	Supported
`we.random`	51	`numel(output)` per sample; shuffle: `n*ceil(log2(n))`	Supported
`we.stats`	24	Per-distribution CDF/PDF/PPF formulas	Supported
`we.polynomial`	10	Per-operation formulas	Supported
Total	473 supported		35 blocked

Blocked operations (I/O, config, and system calls) raise a helpful AttributeError with a link to the docs.

Quick Start

Installation

pip install git+https://github.com/AIcrowd/whest.git
# or
uv add git+https://github.com/AIcrowd/whest.git

Basic Usage

import whest as we

depth, width = 5, 256

with we.BudgetContext(flop_budget=10**8) as budget:
    # Weight init
    scale = we.sqrt(2 / width)
    weights = [we.random.randn(width, width) * scale
               for _ in range(depth)]

    # Forward pass
    x = we.random.randn(width)
    h = x
    for i, W in enumerate(weights):
        h = we.einsum('ij,j->i', W, h)
        if i < depth - 1:
            h = we.maximum(h, 0)

    print(budget.summary())

whest FLOP Budget Summary
=========================
  Total budget:     100,000,000
  Used:                 984,321  (1.0%)
  Remaining:         99,015,679  (99.0%)

  By operation:
    random.randn          327,936  ( 33.3%)  [6 calls]
    multiply              327,680  ( 33.3%)  [5 calls]
    einsum                327,680  ( 33.3%)  [5 calls]
    maximum                 1,024  (  0.1%)  [4 calls]
    sqrt                        1  (  0.0%)  [1 call]

Plan Your Budget Before Executing

# Query FLOP costs without running anything (no BudgetContext needed)
cost = we.flops.einsum_cost('ij,jk->ik', shapes=[(256, 256), (256, 256)])
print(f"Matmul cost: {cost:,}")  # 16,777,216

cost = we.flops.svd_cost(m=256, n=256, k=10)
print(f"SVD cost: {cost:,}")     # 655,360

Symmetry Savings

When you pass the same array object multiple times, whest automatically detects the symmetry and reduces the FLOP count:

with we.BudgetContext(flop_budget=10**8) as budget:
    X = we.ones((100, 100))

    # Gram matrix: both operands are the same X.
    # whest auto-detects this and induces S2{j,k} on the output,
    # giving ~1/2 the dense cost (since R[j,k] = R[k,j]).
    R = we.einsum("ij,ik->jk", X, X)
    print(f"Cost with equal-operand detection: {budget.flops_used:,}")

This works for any einsum where the same Python object appears at multiple operand positions. See the exploit-symmetry guide for more examples including triple products and block symmetries.

How It Works

FLOPs are tracked automatically. A global default budget activates on first use, or you can wrap code in an explicit BudgetContext for a custom limit. Free ops (tensor creation, reshaping) cost 0 FLOPs.
FLOP costs are analytical. Costs are computed from tensor shapes, not measured from execution. A matmul of (m, k) @ (k, n) always costs m * k * n FLOPs regardless of hardware.
Budget is checked before execution. If an operation would exceed the remaining budget, BudgetExhaustedError is raised and the operation does not run.
All tensors are plain numpy.ndarray. Standard whest arrays are regular NumPy arrays with no hidden state. SymmetricTensor is a lightweight ndarray subclass that carries symmetry metadata for einsum savings — it works everywhere a normal array does.

Sharp Edges

Budget is always active. A global default budget (1e15 FLOPs, configurable via WHEST_DEFAULT_BUDGET env var) activates automatically. Use an explicit BudgetContext to set a custom limit.

35 operations are blocked. I/O, config, and system-level functions (save, load, set_printoptions, etc.) raise AttributeError by design. These have no meaningful FLOP cost and are not part of the competition API.

sort, argsort, trace, and random sampling all have analytical FLOP costs based on their algorithmic complexity.

Nested explicit BudgetContexts are not allowed. Opening an explicit BudgetContext while another explicit BudgetContext is already active raises RuntimeError. However, opening an explicit context while only the global default is active is fine — the explicit context temporarily replaces the default.

Cost is analytical, not wall-clock. Two operations with the same shapes always report the same FLOP cost, regardless of data values, cache effects, or hardware. This is intentional -- it makes scores reproducible across machines.

SymmetricTensor propagation rules. Symmetry metadata (used by einsum for FLOP savings) propagates through reshaping, slicing, and unary pointwise operations (e.g., exp, log). Binary pointwise operations (e.g., add, multiply) intersect the symmetry groups of both operands. Reductions may drop symmetry on the reduced axis. If the result doesn't carry the symmetry you expect, declare symmetry groups explicitly with we.as_symmetric().

Documentation

Getting Started

How-To Guides

Concepts

Development

Contributor Guide

API Reference

Full API Reference

For AI Agents:

Cheat Sheet -- compact reference for fast lookup

Development

git clone https://github.com/AIcrowd/whest.git
cd whest
make install
make test                      # core test suite
make docs-serve                # local docs at http://127.0.0.1:8000

For the monorepo layout, client/server workflows, and generated-doc rules, see Contributor Guide.

Citation

@misc{whest2026,
  title  = {whest: NumPy-compatible math primitives with FLOP counting},
  author = {AIcrowd},
  year   = {2026},
  url    = {https://github.com/AIcrowd/whest},
  note   = {Built for the ARC Whitebox Estimation Challenge}
}

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 692 Commits
.aicrowd/superpowers		.aicrowd/superpowers
.githooks		.githooks
.github/workflows		.github/workflows
benchmarks		benchmarks
docker		docker
docs		docs
examples		examples
scripts		scripts
src/whest		src/whest
tests		tests
whest-client		whest-client
whest-server		whest-server
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

whest

Why whest?

Key Features

What's Supported

Quick Start

Installation

Basic Usage

Plan Your Budget Before Executing

Symmetry Savings

How It Works

Sharp Edges

Documentation

Development

Citation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

whest

Why whest?

Key Features

What's Supported

Quick Start

Installation

Basic Usage

Plan Your Budget Before Executing

Symmetry Savings

How It Works

Sharp Edges

Documentation

Development

Citation

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages