- Organizations
- Xiaomi
- MiMo-V2-Flash
MiMo-V2-Flash: Benchmarks, Pricing & Context Window
MiMo-V2-Flash is a language model from Xiaomi, released in December 2025.
MiMo-V2-Flash is a powerful, efficient, and ultra-fast foundation language model that excels in reasoning, coding, and agentic scenarios. It is a Mixture-of-Experts model with 309B total parameters and 15B active parameters, featuring a
MiMo-V2-Flash pricing
Providers
MiMo-V2-Flash starts at $0.100 per million input tokens and $0.300 per million output tokens via Xiaomi.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.100 | $0.300 | 256.0K | 16.4K | — | — | — |
MiMo-V2-Flash API
API access coming soon
MiMo-V2-Flash will be available through our gateway shortly.
MiMo-V2-Flash examples
Recent arena outputs from MiMo-V2-Flash, picked from the highest-ranked matchups.
MiMo-V2-Flash license
MiMo-V2-Flash is released under the MIT license, which permits commercial use, has 309.0B parameters.
- License
- MIT
- Commercial use allowed
- Parameters
- 309.0B
MIT License - allows commercial use
FAQ
Common questions about MiMo-V2-Flash.