- Organizations
- Xiaomi
- MiMo-V2-Omni
MiMo-V2-Omni: Benchmarks, Pricing & Context Window
MiMo-V2-Omni is a language model from Xiaomi, released in March 2026, with multimodal input.
MiMo-V2-Omni is Xiaomi's omni foundation model uniting frontier multimodal understanding with strong agentic capability. It fuses dedicated image, video, and audio encoders into a single shared backbone, processing all modalities
MiMo-V2-Omni pricing
Providers
MiMo-V2-Omni starts at $0.400 per million input tokens and $2.00 per million output tokens via Xiaomi.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency p95 s | Throughput P95 | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.400 | $2.00 | 262.0K | 16.4K | — | — | — |
MiMo-V2-Omni API
API access coming soon
MiMo-V2-Omni will be available through our gateway shortly.
MiMo-V2-Omni examples
Recent arena outputs from MiMo-V2-Omni, picked from the highest-ranked matchups.
MiMo-V2-Omni license
MiMo-V2-Omni is released under the Proprietary license, which restricts commercial use.
- License
- Proprietary
- Non-commercial
Proprietary license - usage restrictions apply
FAQ
Common questions about MiMo-V2-Omni.