Mistral is the European option in this comparison; its publicly priced flagship, Mistral Large 3, offers caching and a batch discount at a low per-token rate with a sizeable context window.
| Model | $ input /1M | $ output /1M | $ cached /1M | Batch | ≈ $/mo * |
|---|---|---|---|---|---|
| Magistral MediumFRONTIER | $2 | $5 | — | −50% | $550 |
| Mistral Medium 3.5FRONTIER | $1.50 | $7.50 | — | −50% | $525 |
| Devstral 2FRONTIER | $0.40 | $2 | — | −50% | $140 |
| Mistral Large 3FRONTIER | $0.50 | $1.50 | $0.05 | −50% | $82 |
| Magistral SmallMID | $0.50 | $1.50 | — | −50% | $145 |
| CodestralMID | $0.30 | $0.90 | — | −50% | $87 |
| Mistral Small 4MID | $0.10 | $0.30 | — | −50% | $29 |
| Ministral 3 (14B)BUDGET | $0.20 | $0.20 | — | −50% | $46 |
| Ministral 3 (8B)BUDGET | $0.15 | $0.15 | — | −50% | $34.5 |
| Devstral Small 2BUDGET | $0.10 | $0.30 | — | −50% | $29 |
| Ministral 3 (3B)BUDGET | $0.10 | $0.10 | — | −50% | $23 |
* Example workload — chatbot, 100k requests/mo, 2,000 input / 300 output tokens per request, 70% of input cached. Computed by the same engine as the calculator. Batch: the −50% is Mistral's verified Batch API discount; the ≈ $/mo column is computed without it.
Cached input is billed at 10% of the input rate on every Mistral model with a published cache price; Mistral Medium 3.5, Magistral Medium, Devstral 2, Mistral Small 4, Magistral Small, Codestral, Devstral Small 2, Ministral 3 (3B), Ministral 3 (8B) and Ministral 3 (14B) don't list one, so the calculator charges them the full input rate. A major lever for chatbots and agents where most of the prompt repeats — the calculator models this with your cache share.
The Batch API runs asynchronous jobs at a verified −50% on both input and output across all models we track — flip the Batch toggle in the calculator to model it.
Mistral Large 3, Mistral Medium 3.5, Devstral 2, Mistral Small 4, Devstral Small 2, Ministral 3 (3B), Ministral 3 (8B) and Ministral 3 (14B) run a verified 256k-token context window; Magistral Medium, Magistral Small and Codestral are 128k tokens.
| Use case | Verdict |
|---|---|
| You want an EU-based vendor for data-residency reasons | Mistral is the European option here |
| A single flagship model covers your needs | Mistral Large 3 with caching and batch fits |
| You need a budget or specialised tier with a verified public price | Only the flagship is currently verifiable |