Question 1

How much does it cost to run an AI agent?

Accepted Answer

It depends on three things: how many times the agent runs, how many LLM calls (steps) each run takes, and how big the context is at each step. Set those above and you'll get a monthly estimate for each major model. The key driver people miss is that an agent re-sends its growing history every step, so a 10-step agent costs far more than 10 single calls — often 3-6× more once context compounds.

Question 2

Why does context accumulation make agents expensive?

Accepted Answer

An agent is a loop: at each step it sends the system prompt, the task, everything it has said so far, and every tool result back to the model, then gets a new response. So step 8 might carry the tokens from steps 1-7. Input tokens grow roughly quadratically with the step count, and you pay for those input tokens every step. Toggle 'Context accumulates' to see the difference — it's usually the largest line in an agent's bill.

Question 3

How do I cut my agent's cost?

Accepted Answer

Four levers: (1) use a cheaper or smaller model for routine steps and reserve the frontier model for the hard ones; (2) trim or summarize the running context instead of re-sending everything; (3) cache the stable system prompt (most providers bill cached tokens far less); and (4) reduce step count with better planning. Running a capable model locally removes per-token cost entirely, which is why agentic workloads are a strong case for local inference.

Question 4

What counts as a 'step' and 'tool-result tokens'?

Accepted Answer

A step is one LLM call in the agent's loop (reason → act → observe). Tool-result tokens are whatever a tool returns — a web page, a database row, a file — that gets fed back into the model's context for the next step. These are easy to underestimate: a single scraped page can be thousands of tokens, and in an accumulating agent they're re-sent on every subsequent step.

Question 5

Are these prices exact?

Accepted Answer

They're approximate public list prices ($/1M tokens) and are directional, not a quote — confirm current pricing with each provider. A real agent's bill also shifts with prompt caching, retries, and parallel branches. Use this to compare models and sanity-check a budget, not to bill a client.

AI Agent Cost Calculator

Frequently asked

Liked the tool? Get the signal.