GPT-4o token & cost calculator
OpenAI GPT-4o is the model a large fraction of production LLM code is still pinned to — the literal default in countless codebases written between its launch and the GPT-4.1/GPT-5 era. It is now a legacy model: no longer headlined on OpenAI's pricing page, superseded on both price and capability, but still callable at its grandfathered rate of $2.50 / $10 per million input / output tokens.
This page exists because "what does GPT-4o actually cost on my prompt" is still a real question for teams running it in anger. The honest framing: GPT-4o isn't the model you'd choose today, it's the model you're already running — and the useful decision is whether the cost of staying is worth the cost of moving.
Saved scenariosnone yet
Saved on this browser only — never uploaded. Up to 10 scenarios.
Tip: save a scenario when you have a prompt + model + response length you might revisit. Useful for sizing features before committing to a vendor.
Verify privacysince this page loaded — updates live
Open DevTools → Network. Type into the calculator. No request bodies should contain your prompt text.
Pricing
Flat $2.50 / $10 per million tokens, no context tier. The Batch API still applies its 50% discount to GPT-4o, which is the single biggest lever if you're committed to staying on it for asynchronous workloads.
| Tier | Input $/M | Output $/M |
|---|---|---|
| All input | $2.5 | $10 |
| Context window | 128,000 tokens | |
Verified against openai.com on 2026-05-18.
Worked examples
Note how unremarkable these numbers are at 2026 rates — that's the point. GPT-4o was priced as a frontier model; today the same scenarios on GPT-5 Mini or GPT-4.1 Mini cost a fraction of this, which is why the migration math usually favors moving.
| Scenario | Input | Output | Cost |
|---|---|---|---|
Short chat turn A typical Q&A turn with a small system prompt. | 800 | 400 | <$0.01 |
System prompt + tool spec A larger context window with a tool schema, single response. | 5,000 | 500 | $0.018 |
Long document Q&A A long-form input (e.g. transcript) with a structured response. | 50,000 | 1,500 | $0.140 |
The decision rule that actually matters: the token bill is rarely why you'd stay on GPT-4o — output stability is. If you have a validated pipeline whose prompts were tuned against this exact model and whose outputs feed downstream logic, the re-evaluation cost of switching can dwarf months of the price difference. Use the comparison view to quantify the delta, then put a real number on your eval effort and compare the two.
How is this counted?
GPT-4o uses OpenAI's canonical o200k_base tokenizer — identical vocab to GPT-4.1 and GPT-5. We count via gpt-tokenizer (MIT), calibration factor 1.0, so the count is exact. Inputs over 50,000 characters tokenize in a Web Worker so the page stays responsive.
FAQ
Is GPT-4o still available in 2026?
Should I migrate off GPT-4o?
Is the token count exact?
What's the context window?
Does my prompt leave the browser?
Compare against every other model
To see this exact prompt scored against every supported model, sorted by total cost, paste it into the home calculator and toggle Compare across all models. For a legacy model like GPT-4o this is the fastest way to see, on your real input, exactly how much you're paying for staying put.
Related models
The migrations worth pricing first: GPT-5 Mini and GPT-4.1 Mini (both cheaper and stronger, the usual destinations), and GPT-4o mini (the in-family step down if you only need to cut cost without re-validating across generations).