The Frontier Index
The state of the AI frontier, in one place
16 frontier and open-weight models across 10 providers — the cheapest to run, the biggest context windows, the newest arrivals, and how far open weights now undercut proprietary APIs. Every figure derives from one hand-verified, versioned ledger, last verified 21 Jun 2026.
Cheapest input
GPT-5.4 nano
$0.20 per 1M in · OpenAI
Cheapest to run
Llama 4 Maverick
$0.38 / 1M · 3:1 blend
Longest context
GPT-5.5
1.05M tokens · OpenAI
Newest arrival
Claude Fable 5
released 9 Jun 2026
Open weights
5 of 16
self-hostable models tracked
Providers
10
16 models, one verified ledger
The number that matters
At the ~1M-token frontier, the cheapest open-weights model — Llama 4 Maverick — runs about 4.1× cheaper than the cheapest proprietary one, Grok 4.3.
$0.38 vs $1.56 per 1M on a blended $/1M at a 3:1 input:output mix. Across all 16 models, blended list price spans $0.38 to $20 per 1M — a 53× range. The trade-off: open weights mean you run the infrastructure, and the top proprietary models still lead on some frontier reasoning.
Figures are base list rates for prompts up to ~200K tokens; some models add a long-context surcharge above that, and open-weights API rates are representative third-party-hosted prices — self-hosting is free.
Pick by requirement
Set your real constraints — context window, open weights, multimodal input — and see the cheapest model in the ledger that clears them.
Cheapest that qualifies(16 models match)
- Llama 4 MaverickcheapestMeta1M$0.38/1M($0.22 in · $0.85 out)
- GPT-5.4 nanoOpenAI400K$0.46/1M($0.20 in · $1.25 out)
- GLM-5Zhipu / Z.ai200K$0.93/1M($0.60 in · $1.92 out)
Ranked by blended $/1M at a 3:1 input:output mix. Prices are base list rates (up to ~200K tokens); open-weights API figures are representative hosted rates (self-hosting is free).
Cheapest to run
All 16 models, filterable →Ranked by blended $/1M at a 3:1 input:output mix. The blend weights input over output because assistant and agent workloads re-send a lot of context.
| # | Model | Blended $/1M | In · Out | Context |
|---|---|---|---|---|
| 1 | Llama 4 MaverickMeta · open | $0.38* | $0.22 · $0.85 | 1M |
| 2 | GPT-5.4 nanoOpenAI | $0.46 | $0.20 · $1.25 | 400K |
| 3 | GLM-5Zhipu / Z.ai · open | $0.93 | $0.60 · $1.92 | 200K |
| 4 | Kimi K2 ThinkingMoonshot AI · open | $1.07 | $0.60 · $2.50 | 256K |
| 5 | Grok 4.3xAI | $1.56* | $1.25 · $2.50 | 1M |
| 6 | GPT-5.4 miniOpenAI | $1.69 | $0.75 · $4.50 | 400K |
Blended figures are the base list rate for prompts up to ~200K tokens. * = a long-context surcharge applies above that (hover for the tier). Open-weights API rates are representative third-party-hosted prices; self-hosting is free.
Frequently asked
What's the cheapest AI model right now?
By raw input price, GPT-5.4 nano (OpenAI) is lowest at $0.20 per 1M input tokens. But real workloads pay for output too — the cheapest to actually run, on a blended $/1M at a 3:1 input:output mix, is Llama 4 Maverick at $0.38 per 1M. Open-weights models can also be self-hosted for the cost of your own hardware.
Which model has the largest context window?
GPT-5.5 leads at 1.05M tokens. It's not alone at the top: 10 of the 16 models tracked have a 1M-class (~1,000K-token) window, so very-long-context work is no longer a single-vendor feature.
Are open-weights models really cheaper than proprietary ones?
At the ~1M-token frontier, yes: the cheapest open-weights model, Llama 4 Maverick, runs about 4.1× cheaper than the cheapest proprietary one, Grok 4.3, on a blended $/1M at a 3:1 input:output mix ($0.38 vs $1.56 per 1M). Open weights also self-host for hardware cost alone — the trade-off is that you run the infrastructure and the top proprietary models still lead on some frontier reasoning tasks.
How current is this, and how do you verify it?
Every figure is hand-verified against each provider's own pricing and cross-checked for contradictions — last fully verified 21 Jun 2026. The catalog is a single, versioned source of truth (version 2026-07-04) that every BitByteCore tool reads, so a price can't drift between pages. Model pricing moves fast, so confirm with the provider before you rely on a figure.
Can I use or cite this data?
Yes. The full catalog is published as machine-readable JSON at https://bitbytecore.com/data/frontier.json and as schema.org Dataset markup on this page. Quote or cite it with clear attribution and a link back.
How this stays honest
One versioned catalog (v2026-07-04, verified 21 Jun 2026) is the single source every BitByteCore model tool reads — so a price can't drift between pages. It's published openly for anyone to cite.
Put the numbers to work
- CalculatorAI API Cost CalculatorPlug in your own usage → monthly spend across every model, ranked cheapest-first.Open
- CounterToken CounterPaste text → exact token count, context-fit, and the cost to send it. Runs in your browser.Open
- CalculatorContext Window CalculatorWill your document fit? See which models can hold it in one window, and the cost.Open
- PickerWhich Model Should I Use?Answer a few questions → a top pick plus two alternatives, with the reasoning.Open