DeepSeekReleased on Dec 25, 2024

DeepSeek-V3: Benchmarks, Pricing & Context Window

DeepSeek-V3 is a language model from DeepSeek, released in December 2024.

A powerful Mixture-of-Experts (MoE) language model with 671B total parameters (37B activated per token). Features Multi-head Latent Attention (MLA), auxiliary-loss-free load balancing, and multi-token prediction training. Pre-trained on

Input
Text
Output
Text

DeepSeek-V3 pricing

Providers

DeepSeek-V3 starts at $0.270 per million input tokens and $1.10 per million output tokens via DeepSeek.

ProviderInput $/MOutput $/MMax InputMax OutputLatency sThroughputQuantInputOutput
DeepSeek logoDeepSeek
$0.270$1.10131.1K131.1K
0.50
100 c/s

DeepSeek-V3 API

API access coming soon

DeepSeek-V3 will be available through our gateway shortly.

DeepSeek-V3 examples

Recent arena outputs from DeepSeek-V3, picked from the highest-ranked matchups.

DeepSeek-V3 license

DeepSeek-V3 is released under the MIT + Model License (Commercial use allowed) license, which restricts commercial use, has 671.0B parameters.

License
MIT + Model License (Commercial use allowed)
Non-commercial
Parameters
671.0B

FAQ

Common questions about DeepSeek-V3.

What is the DeepSeek-V3 release date?

DeepSeek-V3 was released on December 25, 2024 by DeepSeek.

Who created DeepSeek-V3?

DeepSeek-V3 was created by DeepSeek.

How many parameters does DeepSeek-V3 have?

DeepSeek-V3 has 671.0 billion parameters.

What is the license for DeepSeek-V3?

DeepSeek-V3 is released under the MIT + Model License (Commercial use allowed) license. This is an open-source/open-weight license.