OpenAI offers a broad GPT-5 lineup spanning frontier, mid and budget tiers, all with prompt caching and batch discounts, plus a high-end Pro variant for the most demanding reasoning tasks.
| Model | $ input /1M | $ output /1M | $ cached /1M | Batch | ≈ $/mo * |
|---|---|---|---|---|---|
| GPT-5.5 ProFRONTIER | $30 | $180 | — | −50% | $11,400 |
| GPT-5.5FRONTIER | $5 | $30 | $0.50 | −50% | $1,270 |
| GPT-5.4FRONTIER | $2.50 | $15 | $0.25 | −50% | $635 |
| GPT-5.4 miniMID | $0.75 | $4.50 | $0.075 | −50% | $190.5 |
| GPT-5.4 nanoBUDGET | $0.20 | $1.25 | $0.02 | −50% | $52.3 |
* Example workload — chatbot, 100k requests/mo, 2,000 input / 300 output tokens per request, 70% of input cached. Computed by the same engine as the calculator. Batch: the −50% is OpenAI's verified Batch API discount; the ≈ $/mo column is computed without it.
Cached input is billed at 10% of the input rate on every GPT model with a published cache price; GPT-5.5 Pro doesn't list one, so the calculator charges it the full input rate. A major lever for chatbots and agents where most of the prompt repeats — the calculator models this with your cache share.
The Batch API runs asynchronous jobs at a verified −50% on both input and output across all models we track — flip the Batch toggle in the calculator to model it.
GPT-5.5, GPT-5.5 Pro and GPT-5.4 run a verified 1M-token context window; GPT-5.4 mini and GPT-5.4 nano are 400k tokens. GPT-5.5, GPT-5.5 Pro and GPT-5.4 bill higher rates on long-context prompts — this table and the calculator use standard rates.
| Use case | Verdict |
|---|---|
| You want one vendor covering frontier down to budget models with a consistent API | OpenAI's lineup fits |
| Workload is asynchronous and tolerant of delay | Batch tier lowers the per-token cost |
| Hard requirement for the lowest possible token price | Compare against budget-tier and open-weight providers in the table |