When was DeepSeek-V4-Pro-Max released?

DeepSeek-V4-Pro-Max was released on April 23, 2026 by DeepSeek. This is the official DeepSeek-V4-Pro-Max release date tracked on LLM Stats.

How much does DeepSeek-V4-Pro-Max cost?

DeepSeek-V4-Pro-Max costs $1.74 per million input tokens and $3.48 per million output tokens through the LLM Stats API, which works with any OpenAI-compatible SDK. Across tracked providers, the lowest price is $1.60 per million input tokens via Novita.

Is DeepSeek-V4-Pro-Max available via API?

Yes. DeepSeek-V4-Pro-Max is available through the LLM Stats API and works with any OpenAI-compatible SDK — point your client at the gateway base URL and pass the model name. It is served by 5 providers tracked on LLM Stats.

How big is DeepSeek-V4-Pro-Max?

DeepSeek-V4-Pro-Max has 1600 billion parameters. It was trained on 32.0 trillion tokens. It ships as an open-weight model, so you can download and run it on your own hardware.

Who created DeepSeek-V4-Pro-Max?

DeepSeek-V4-Pro-Max was created by DeepSeek.

What is the license for DeepSeek-V4-Pro-Max?

DeepSeek-V4-Pro-Max is released under the MIT license. This is an open-source / open-weight license that permits self-hosting.

What is DeepSeek-V4-Pro-Max latency?

DeepSeek-V4-Pro-Max p95 time to first token is 3.20 seconds via Together over the trailing 7 days. Lower time to first token means the model begins responding sooner for chat, agents and API workloads.

Where can I use DeepSeek-V4-Pro-Max?

DeepSeek-V4-Pro-Max is available through 5 providers including Novita, DeepInfra, DeepSeek, and 2 more.

What models should I compare DeepSeek-V4-Pro-Max against?

Common DeepSeek-V4-Pro-Max comparisons include DeepSeek-V4-Pro-Max vs Claude Opus 4.6, DeepSeek-V4-Pro-Max vs Gemini 3 Flash, DeepSeek-V4-Pro-Max vs Claude Opus 4.5. Compare them side by side for benchmark scores, pricing, context window, latency and API availability.

DeepSeek-V4-Pro-Max API Pricing, Context Window & Benchmarks

Name: DeepSeek-V4-Pro-Max
Author: DeepSeek

DeepSeek-V4-Pro-Max benchmarks

Rankings

Quality Tracker

DeepSeek-V4-Pro-Max Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

llm-stats.com - Thu Jul 23 2026

Notice missing or incorrect data?

DeepSeek-V4-Pro-Max pricing

Providers

DeepSeek-V4-Pro-Max starts at $1.60 per million input tokens and $3.20 per million output tokens via Novita. See all 5 providers below with their per-token pricing, latency, throughput, and modality support.

Provider	Input $/M	Output $/M	Context in / out	TTFT p50 / p95 s	Output avg / p5 c/s	Success 7d	Modalities in / out
Novita	$1.60	$3.20	1.0M/131.1K	2.20/6.36	113/3	83.33%(6)	/
DeepInfra	$1.74	$3.48	1.0M/65.5K	1.39/5.87	210/124	100.00%(20)	/
DeepSeek	$1.74	$3.48	1.0M/393.2K	1.13/3.94	159/38	75.00%(4)	/
Fireworks	$1.74	$3.48	1.0M/65.5K	2.32/12.87	172/9	66.67%(12)	/
Together	$1.74	$3.48	524.3K/65.5K	0.48/3.20	233/72	71.43%(7)	/

Cached input is the discounted price for prompt tokens served from a provider cache. TTFT is time to first token. Output is characters per second; p5 is the sustained floor exceeded by 95% of observed requests. Success is calculated from completed versus failed requests over the trailing seven days.

Loading chart...

DeepSeek-V4-Pro-Max model size

DeepSeek-V4-Pro-Max has 1.6 trillion parameters and was trained on 32 trillion tokens. See how it compares to other models in the same parameter range.

ParametersTraining tokens

1.6TMoE

32Ttokens

20× tokens-to-params ratio

Frontier (200B+)

1.6T

1B7B70B405B

DeepSeek-V4-Pro-Max context window

Input and output token limits for DeepSeek-V4-Pro-Max, plus how it ranks on long-context understanding.

InputOutput

1.0Mtokens

393Ktokens

≈ 1.6k pages of text

1.0M

8K128K1M

DeepSeek-V4-Pro-Max API

POST/v1/chat/completions

Modeldeepseek-v4-pro-max

API key●

Prompt●

Stream

Run a request to see the response

Use it in your code

Billed at $1.74 input / $3.48 output per 1M tokens through the LLM Stats gateway.

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://gateway.llm-stats.com/v1"
)

response = client.chat.completions.create(
    model="deepseek-v4-pro-max",
    messages=[
        {"role": "user", "content": "What is machine learning?"}
    ]
)

print(response.choices[0].message.content)

Need an API key? Create one above in the playground, or read the API documentation.

DeepSeek-V4-Pro-Max latency

DeepSeek-V4-Pro-Max time to first token, sustained output throughput, and failed-request rate from live API traffic over the trailing 7 days.

Provider operational metrics

Time to first token, output throughput, and failed-request rate from live API traffic

Loading chart...

DeepSeek-V4-Pro-Max examples

Recent arena outputs from DeepSeek-V4-Pro-Max, picked from the highest-ranked matchups.

DeepSeek-V4-Pro-Max license

DeepSeek-V4-Pro-Max is released under the MIT license, which permits commercial use, has 1.6T parameters.

License: MIT; Commercial use allowed
Parameters: 1.6T

MIT License - allows commercial use

DeepSeek-V4-Pro-Max resources

Official sources for DeepSeek-V4-Pro-Max: api documentation, official playground, source repository, model weights.

DeepSeek-V4-Pro-Max vs other models

The most-compared alternatives to DeepSeek-V4-Pro-Max are Claude Opus 4.6, Gemini 3 Flash, Claude Opus 4.5. Open any pair side-by-side for benchmarks, pricing, context, and latency.