- Organizations
- DeepSeek
- DeepSeek-V3.2 (Non-thinking)
DeepSeek-V3.2 (Non-thinking): Benchmarks, Pricing & Context Window
DeepSeek-V3.2 (Non-thinking) is a language model from DeepSeek, released in December 2025.
DeepSeek-V3.2 in non-thinking mode. A powerful language model with 685B parameters using DeepSeek Sparse Attention (DSA) for efficient long-context processing. Supports JSON output, tool calls, and chat prefix completion. This is the
DeepSeek-V3.2 (Non-thinking) pricing
Providers
DeepSeek-V3.2 (Non-thinking) starts at $0.280 per million input tokens and $0.420 per million output tokens via DeepSeek.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.280 | $0.420 | 131.1K | 8.2K | 1.36 | 286 c/s | — |
DeepSeek-V3.2 (Non-thinking) API
API access coming soon
DeepSeek-V3.2 (Non-thinking) will be available through our gateway shortly.
DeepSeek-V3.2 (Non-thinking) examples
Recent arena outputs from DeepSeek-V3.2 (Non-thinking), picked from the highest-ranked matchups.
DeepSeek-V3.2 (Non-thinking) license
DeepSeek-V3.2 (Non-thinking) is released under the MIT license, which permits commercial use, has 685.0B parameters.
- License
- MIT
- Commercial use allowed
- Parameters
- 685.0B
MIT License - allows commercial use
FAQ
Common questions about DeepSeek-V3.2 (Non-thinking).