- Organizations
- DeepSeek
- DeepSeek-V3
DeepSeek-V3: Benchmarks, Pricing & Context Window
DeepSeek-V3 is a language model from DeepSeek, released in December 2024.
A powerful Mixture-of-Experts (MoE) language model with 671B total parameters (37B activated per token). Features Multi-head Latent Attention (MLA), auxiliary-loss-free load balancing, and multi-token prediction training. Pre-trained on
Input
Text
Output
Text
DeepSeek-V3 pricing
Providers
DeepSeek-V3 starts at $0.270 per million input tokens and $1.10 per million output tokens via DeepSeek.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.270 | $1.10 | 131.1K | 131.1K | 0.50 | 100 c/s | — |
DeepSeek-V3 API
API access coming soon
DeepSeek-V3 will be available through our gateway shortly.
DeepSeek-V3 examples
Recent arena outputs from DeepSeek-V3, picked from the highest-ranked matchups.
DeepSeek-V3 license
DeepSeek-V3 is released under the MIT + Model License (Commercial use allowed) license, which restricts commercial use, has 671.0B parameters.
- License
- MIT + Model License (Commercial use allowed)
- Non-commercial
- Parameters
- 671.0B
FAQ
Common questions about DeepSeek-V3.