- Organizations
- DeepSeek
- DeepSeek-V4-Pro-Max
DeepSeek-V4-Pro-Max: Benchmarks, Pricing & Context Window
DeepSeek-V4-Pro-Max is a language model from DeepSeek, released in April 2026.
DeepSeek-V4-Pro-Max is the maximum reasoning effort mode of DeepSeek-V4-Pro, a 1.6T-parameter MoE model with 49B activated parameters and a 1M-token context window. It introduces a hybrid attention architecture combining Compressed Sparse
DeepSeek-V4-Pro-Max pricing
Providers
DeepSeek-V4-Pro-Max starts at $1.74 per million input tokens and $3.48 per million output tokens via DeepInfra. See all 2 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $1.74 | $3.48 | 1.0M | 65.5K | 24.25 | 13 c/s | fp4 | |||
| $1.74 | $3.48 | 1.0M | 393.2K | — | — | — |
DeepSeek-V4-Pro-Max API
API access coming soon
DeepSeek-V4-Pro-Max will be available through our gateway shortly.
DeepSeek-V4-Pro-Max examples
Recent arena outputs from DeepSeek-V4-Pro-Max, picked from the highest-ranked matchups.
DeepSeek-V4-Pro-Max license
DeepSeek-V4-Pro-Max is released under the MIT license, which permits commercial use, has 1.6T parameters.
- License
- MIT
- Commercial use allowed
- Parameters
- 1.6T
MIT License - allows commercial use
FAQ
Common questions about DeepSeek-V4-Pro-Max.