xAI's Grok models offer a flagship tier with a large context window and a low output-to-input price ratio, plus a flat cached-input rate, which can suit output-heavy workloads.
| Model | $ input /1M | $ output /1M | $ cached /1M | Batch | ≈ $/mo * |
|---|---|---|---|---|---|
| Grok 4.3FRONTIER | $1.25 | $2.50 | $0.20 | — | $178 |
| Grok Build 0.1MID | $1 | $2 | $0.20 | — | $148 |
* Example workload — chatbot, 100k requests/mo, 2,000 input / 300 output tokens per request, 70% of input cached. Computed by the same engine as the calculator. Batch: no verified batch discount published for Grok at our last revision — we only list discounts we've confirmed.
Cache pricing differs per model: Grok 4.3 at $0.20/1M (16% of input); Grok Build 0.1 at $0.20/1M (20% of input). The calculator models this with your cache share.
No verified batch discount published at our last revision — we only list discounts we've confirmed, so the Batch toggle in the calculator leaves Grok prices unchanged.
Grok 4.3 runs a verified 1M-token context window; Grok Build 0.1 is 256k tokens.
| Use case | Verdict |
|---|---|
| Output-heavy workloads | Grok's output rate is relatively low versus input |
| Large context within a single request | The flagship tier carries a large window |
| You need a confirmed batch discount up front | The exact batch multiplier is unpublished |