At a glance

Mistral AIpricing, performance & catalog

The citable facts about Mistral AI's 4 models — sourced from provider APIs and refreshed continuously.

Lowest price
Mistral Small 4 at $0.150 per 1M input tokens
Highest throughput
Mistral Large 3 (675B Instruct 2512) at 110 chars/s
Largest context
Mistral Large 3 (675B Instruct 2512) at 262K tokens
Catalog
4 active models from 1 organization

FAQ

Common questions about Mistral AI.

What is Mistral AI?

Mistral AI is an API provider that hosts large language models. Active models: 4; From (input): $0.15 / 1M tok; Median throughput: 86 c/s; P95 latency: 2.34s; Success rate (7d): 100%.

How many models does Mistral AI offer?

Mistral AI currently serves 4 active models out of 18 historical offerings on LLM Stats.

What is Mistral AI's API pricing?

Mistral AI input pricing starts from $0.15 per 1M tokens, with the most expensive offering at $1.5 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

How fast is Mistral AI?

Mistral AI delivers a median throughput of 86 characters per second and P95 latency of 2.34s across its model catalog. See the Models tab for per-model throughput and latency breakdowns.

Is Mistral AI reliable?

Mistral AI has a 100% success rate across 105 API calls in the last 7 days, with a 0% error rate.

Is Mistral AI OpenAI compatible?

Most providers expose an OpenAI-compatible /v1/chat/completions endpoint so you can switch from OpenAI to Mistral AI by changing only the base URL and API key. Check https://mistral.ai for the exact endpoint format and any provider-specific parameters.

Does Mistral AI support multimodal models?

Yes. Mistral AI's catalog includes 3 vision-capable models. See the Models and Capabilities tabs for the full per-model breakdown.

Whose models does Mistral AI host?

Mistral AI hosts models from Mistral AI. See the Models tab for the full catalog grouped by creator.

How do I start using Mistral AI?

Sign up at https://mistral.ai to get an API key, then call Mistral AI's API directly from your application. Most clients work out of the box by pointing the OpenAI SDK at Mistral AI's base URL with your key. Use the Models and Pricing tabs above to pick the right model for your latency, cost, and context-window requirements.