LLM / providers / mistral

Mistral AI API Pricing

Mistral is the European option in this comparison; its publicly priced flagship, Mistral Large 3, offers caching and a batch discount at a low per-token rate with a sizeable context window.

Prices verified June 2026 · changes logged in the changelog

Heads up: The Mistral Large 3 rate reflects a promotional price rather than a settled long-term list price, so treat it as subject to change when forecasting. Mistral's wider lineup (mid, small and specialised models) does not yet have verified public per-token pricing in our data, so only the flagship is shown.

Model	$ input /1M	$ output /1M	$ cached /1M	Batch	≈ $/mo *
Magistral MediumFRONTIER	$2	$5	—	−50%	$550
Mistral Medium 3.5FRONTIER	$1.50	$7.50	—	−50%	$525
Devstral 2FRONTIER	$0.40	$2	—	−50%	$140
Mistral Large 3FRONTIER	$0.50	$1.50	$0.05	−50%	$82
Magistral SmallMID	$0.50	$1.50	—	−50%	$145
CodestralMID	$0.30	$0.90	—	−50%	$87
Mistral Small 4MID	$0.10	$0.30	—	−50%	$29
Ministral 3 (14B)BUDGET	$0.20	$0.20	—	−50%	$46
Ministral 3 (8B)BUDGET	$0.15	$0.15	—	−50%	$34.5
Devstral Small 2BUDGET	$0.10	$0.30	—	−50%	$29
Ministral 3 (3B)BUDGET	$0.10	$0.10	—	−50%	$23

* Example workload — chatbot, 100k requests/mo, 2,000 input / 300 output tokens per request, 70% of input cached. Computed by the same engine as the calculator. Batch: the −50% is Mistral's verified Batch API discount; the ≈ $/mo column is computed without it.

Prompt caching

Cached input is billed at 10% of the input rate on every Mistral model with a published cache price; Mistral Medium 3.5, Magistral Medium, Devstral 2, Mistral Small 4, Magistral Small, Codestral, Devstral Small 2, Ministral 3 (3B), Ministral 3 (8B) and Ministral 3 (14B) don't list one, so the calculator charges them the full input rate. A major lever for chatbots and agents where most of the prompt repeats — the calculator models this with your cache share.

Batch / async

The Batch API runs asynchronous jobs at a verified −50% on both input and output across all models we track — flip the Batch toggle in the calculator to model it.

Context window

Mistral Large 3, Mistral Medium 3.5, Devstral 2, Mistral Small 4, Devstral Small 2, Ministral 3 (3B), Ministral 3 (8B) and Ministral 3 (14B) run a verified 256k-token context window; Magistral Medium, Magistral Small and Codestral are 128k tokens.

When Mistral is worth it

Use case	Verdict
You want an EU-based vendor for data-residency reasons	Mistral is the European option here
A single flagship model covers your needs	Mistral Large 3 with caching and batch fits
You need a budget or specialised tier with a verified public price	Only the flagship is currently verifiable

Is Mistral the right price for your workload?

The calculator puts these eleven models next to the other 17 we track — at your volume, token mix and cache share.

Open calculator

Frequently asked questions

Why does Mistral show only one model?

Only the flagship tier has a verified public per-token price in our data. Mistral's wider lineup (mid, small and specialised models) is expected to be added once its public per-token rates and context windows are confirmed.

Is the Mistral Large 3 price stable?

It reflects a promotional rate rather than a settled long-term list price, so treat it as subject to change for long-running budgets. The current figure and whether caching and batch apply are shown in the table above.

All 28 models → OpenAI pricing → Anthropic pricing → Gemini pricing → DeepSeek pricing → Grok pricing → Mistral alternatives → Cheapest LLM API → Price changelog →