Question 1

Which 2026 model has the cheapest API pricing?

Accepted Answer

Among frontier-class options, third-party-hosted open-weights models are cheapest — Llama 4 Maverick runs about $0.22 input / $0.85 output per 1M tokens. Among major proprietary APIs, OpenAI's GPT-5.4 nano ($0.20 / $1.25) and Google's smaller Flash tiers are the lowest. DeepSeek V4 is unusually cheap for its capability, often discounted well below its $1.74 / $3.48 list rate.

Question 2

Which model has the largest context window?

Accepted Answer

Google's Gemini 3.1 Pro leads the major APIs with a 1M+ token window (up to ~2M reported), and Meta's Llama 4 Scout sibling advertises up to 10M. Among the flagships here, Gemini 3.1 Pro, GPT-5.5 (~1.05M), Claude Opus 4.8 / Sonnet 4.6 / Fable 5 (1M), Grok 4.3 (1M), and DeepSeek V4 (1M) all reach roughly 1M tokens.

Question 3

What are the best open-weights models in mid-2026?

Accepted Answer

The strongest open-weights options are DeepSeek V4 (Apache-2.0), Meta's Llama 4 Maverick, Moonshot's Kimi K2 Thinking, Zhipu's MIT-licensed GLM-5, and Mistral Large 3 (Apache-2.0). Several rival proprietary frontier models on coding and agentic tasks, and all can be self-hosted or run via many providers.

Question 4

Why are some prices shown with an asterisk and a footnote?

Accepted Answer

Several flagships use context-tiered pricing: the listed figure is the rate for prompts up to ~200K tokens, and the footnote shows the higher long-context rate. For example, Gemini 3.1 Pro jumps from $2 / $12 to $4 / $18 above 200K tokens, and GPT-5.5 charges 2× input / 1.5× output beyond 272K tokens. Claude Opus 4.8, by contrast, has no long-context premium.

Question 5

Is Grok 5 available yet?

Accepted Answer

Not as a generally available API as of June 2026. Grok 5 has been widely discussed (rumored ~6T params, 1.5M context) but xAI had not published an official release or pricing at the time of writing, so this table uses Grok 4.3, the current GA flagship.

Model	Context	In $/1M	Out $/1M	Weights	Modality
Qwen 3.7 MaxAlibaba · 2026-05Agent-first flagship; strong tool-use + multilingual (esp. Chinese) reasoning	1M	$2.5*	$7.5	Proprietary	Multimodal
Claude Fable 5Anthropic · 2026-06-09Anthropic's most capable widely-released model; top-end reasoning + agentic work	1M	$10*	$50	Proprietary	Text + image in
Claude Haiku 4.5Anthropic · 2025-10Fastest Claude with near-frontier intelligence at low cost	200K	$1	$5	Proprietary	Text + image in
Claude Opus 4.8Anthropic · 2026Most capable Opus-tier model for complex reasoning + long-horizon agentic coding	1M	$5	$25	Proprietary	Text + image in
Claude Sonnet 4.6Anthropic · 2026Best balance of speed and intelligence; strong agentic + coding workhorse	1M	$3*	$15	Proprietary	Text + image in
DeepSeek V4 (Pro)DeepSeek · 2026-04-24Open-weights MoE (~32–37B active) frontier value; sparse attention	1M	$1.74*	$3.48	Open	Multimodal
Gemini 3.1 ProGoogle · 2026-02-19Frontier long-context multimodal reasoning; largest practical context in class	1M	$2*	$12	Proprietary	Multimodal (text/image/audio/video)
Gemini 3.5 FlashGoogle · 2026-05-20Flagship-class fast model; 280+ tok/s, default in Gemini app + Search AI Mode	1M	$1.5	$9	Proprietary	Multimodal (text/image/audio/video)
Llama 4 MaverickMeta · 2025-04-05Open-weights MoE; cheapest frontier-ish option, huge ecosystem + fine-tunability	1M	$0.22*	$0.85	Open	Multimodal (text + image)
Mistral Large 3Mistral AI · 2025-12-01Open Apache-2.0 European flagship; self-hostable, strong multilingual + coding	256K	$2	$6	Open	Text + image
Kimi K2 ThinkingMoonshot AI · 2025-11Open 1T-param MoE (32B active) built for long-horizon agentic reasoning	256K	$0.6	$2.5	Open	Text (agentic)
GPT-5.4 miniOpenAI · 2026Cost-efficient mid-tier for high-volume tasks with solid reasoning	400K	$0.75	$4.5	Proprietary	Text + image in
GPT-5.4 nanoOpenAI · 2026Cheapest OpenAI tier for simple, latency-sensitive, high-volume calls	400K	$0.2	$1.25	Proprietary	Text + image in
GPT-5.5OpenAI · 2026-04-23OpenAI flagship; first OpenAI model with a 1M-token API context window	1.05M	$5*	$30	Proprietary	Text + image in
Grok 4.3xAI · 2026xAI's fastest + most intelligent GA model; native real-time X / web search	1M	$1.25*	$2.5	Proprietary	Text + image in
GLM-5Zhipu / Z.ai · 2026-02-11Open MIT-licensed frontier model; very strong coding + agentic value	200K	$0.6	$1.92	Open	Multimodal

AI Model Comparison (2026)

Frequently asked