API Provider4 active models1 organizationdeepseek.com

DeepSeek: API pricing, performance & models

DeepSeek hosts 4 active AI models, with input pricing from $0.14 per 1M tokens, averaging 55 tok/s output throughput, with up to 1.0M context window. Compare DeepSeek's API pricing, latency, and feature support against other LLM providers.

First-party only
4Active
Pricing
$0.140/MFrom
$0.677/MAvg
Performance
55tok/sThroughput
0.30sLatency
1.0MMax

Catalog

Type
Price
4 models
Model
DeepSeek-V4-Flash-Max
DeepSeek-V4-Pro-Max
DeepSeek-V3.2 (Non-thinking)
DeepSeek-R1-0528

FAQ

Common questions about DeepSeek.

What is DeepSeek?

DeepSeek is an API provider that hosts large language models. Active models: 4; From (input): $0.14 / 1M tok; Avg throughput: 55 tok/s; Avg latency: 0.30 s; Max context: 1.0M.

How many models does DeepSeek offer?

DeepSeek currently serves 4 active models out of 9 historical offerings on LLM Stats.

What is DeepSeek's API pricing?

DeepSeek input pricing starts from $0.14 per 1M tokens, with the most expensive offering at $1.74 per 1M tokens. See the Pricing tab above for the full per-model breakdown.

How fast is DeepSeek?

DeepSeek averages 55 output tokens per second across its catalog, with average latency of 0.30s. Per-model performance is shown in the Performance tab.

Whose models does DeepSeek host?

DeepSeek hosts models from DeepSeek. See the Models tab for the full catalog grouped by creator.

How do I start using DeepSeek?

Sign up at https://deepseek.com/ to get an API key, then call DeepSeek's API directly from your application. Use the Pricing and Performance tabs above to pick the right model for your latency, cost, and context-window requirements.