API Provider8 active models1 organizationplatform.minimax.io

MiniMax: API pricing, performance & models

MiniMax hosts 8 active AI models, with input pricing from $0.30 per 1M tokens, averaging 93 tok/s output throughput, with up to 1.0M context window. Compare MiniMax's API pricing, latency, and feature support against other LLM providers.

AudioFirst-party only
8Active
Pricing
$0.300/MFrom
$0.300/MAvg
Performance
93tok/sThroughput
3.25sLatency
1.0MMax

Catalog

Type
Price
9 models
Model
Speech 2.5 Turbo Preview
Speech 2.5 HD Preview
Speech 02 Turbo
Speech 02 HD
MiniMax M2.7
MiniMax M2.5
MiniMax M2.5
MiniMax M2.1
MiniMax M2

FAQ

Common questions about MiniMax.

What is MiniMax?

MiniMax is an API provider that hosts large language models. Active models: 8; From (input): $0.30 / 1M tok; Avg throughput: 93 tok/s; Avg latency: 3.25 s; Max context: 1.0M.

How many models does MiniMax offer?

MiniMax currently serves 8 active models out of 8 historical offerings on LLM Stats.

What is MiniMax's API pricing?

MiniMax input pricing starts from $0.30 per 1M tokens, with the most expensive offering at $0.3 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

How fast is MiniMax?

MiniMax averages 93 output tokens per second across its catalog, with average latency of 3.25s. Per-model performance is shown in the Performance tab.

Does MiniMax support multimodal models?

Yes. MiniMax's catalog includes 1 vision-capable and 4 audio models. See the Models and Capabilities tabs for the full per-model breakdown.

Whose models does MiniMax host?

MiniMax hosts models from MiniMax. See the Models tab for the full catalog grouped by creator.

How do I start using MiniMax?

Sign up at https://platform.minimax.io to get an API key, then call MiniMax's API directly from your application. Use the Pricing and Performance tabs above to pick the right model for your latency, cost, and context-window requirements.