Question 1

What is Mistral AI?

Accepted Answer

Mistral AI is an API provider that hosts large language models. Active models: 3; From (input): $0.15 / 1M tok; Avg throughput: 46 tok/s; Avg latency: 1.70 s; Max context: 262K.

Question 2

How many models does Mistral AI offer?

Accepted Answer

Mistral AI currently serves 3 active models out of 17 historical offerings on LLM Stats.

Question 3

What is Mistral AI's API pricing?

Accepted Answer

Mistral AI input pricing starts from $0.15 per 1M tokens, with the most expensive offering at $0.5 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

Question 4

How fast is Mistral AI?

Accepted Answer

Mistral AI averages 46 output tokens per second across its catalog, with average latency of 1.70s. Per-model performance is shown in the Performance tab.

Question 5

Does Mistral AI support multimodal models?

Accepted Answer

Yes. Mistral AI's catalog includes 2 vision-capable models. See the Models and Capabilities tabs for the full per-model breakdown.

Question 6

Whose models does Mistral AI host?

Accepted Answer

Mistral AI hosts models from Mistral AI. See the Models tab for the full catalog grouped by creator.

Question 7

How do I start using Mistral AI?

Accepted Answer

Sign up at https://mistral.ai to get an API key, then call Mistral AI's API directly from your application. Use the Pricing and Performance tabs above to pick the right model for your latency, cost, and context-window requirements.

Model	Input /M	Output /M	Throughput	Context	Capabilities
Voxtral Mini	$0.0670/min	—	—	—	—
Mistral Small 4	$0.150	$0.600	—	256K	Vision
Mistral Small 4	$0.150	$0.600	—	256K	—
Mistral Large 3 (675B Instruct 2512)	$0.500	$1.50	46t/s	262K	Vision
Mistral Large 3 (675B Instruct 2512)	$0.500	$1.50	46t/s	262K	—

Mistral AI: API pricing, performance & models

Catalog

FAQ