MoonshotAI logo

Kimi K2-Thinking-0905

Overview

Overview

Kimi K2 Thinking is the latest, most capable version of open-source thinking model. Starting with Kimi K2, it is built as a thinking agent that reasons step-by-step while dynamically invoking tools. It sets a new state-of-the-art on Humanity's Last Exam (HLE), BrowseComp, and other benchmarks by dramatically scaling multi-step reasoning depth and maintaining stable tool-use across 200–300 sequential calls. At the same time, K2 Thinking is a native INT4 quantization model with 256k context window, achieving lossless reductions in inference latency and GPU memory usage. Key features include deep thinking & tool orchestration with end-to-end training to interleave chain-of-thought reasoning with function calls, native INT4 quantization via Quantization-Aware Training (QAT) achieving lossless 2x speed-up, and stable long-horizon agency maintaining coherent goal-directed behavior across up to 200–300 consecutive tool invocations.

Kimi K2-Thinking-0905 was released on September 5, 2025. API access is available through DeepInfra, Novita, Fireworks.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
1000.0B
License
MIT
Training Data
Unknown
Tags
tuning:thinking

Benchmarks

Benchmarks

Kimi K2-Thinking-0905 Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Sat Feb 21 2026
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing

Pricing, performance, and capabilities for Kimi K2-Thinking-0905 across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
DeepInfra logo
DeepInfrafp4
$0.47$2.00262.1K262.1K
fp4
Text
Image
Audio
Video
Text
Image
Audio
Video
Novita logo
Novitabf16
$0.48$2.00262.1K262.1K
bf16
Text
Image
Audio
Video
Text
Image
Audio
Video
Fireworks logo
Fireworks
$0.60$2.50262.1K262.1K
Text
Image
Audio
Video
Text
Image
Audio
Video

Price Comparison for Kimi K2-Thinking-0905

Price per 1M input tokens (USD), lower is better

LLM Stats Logollm-stats.com - Sat Feb 21 2026
No data available
No data available

API Access

API Access Coming Soon

API access for Kimi K2-Thinking-0905 will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about Kimi K2-Thinking-0905

Kimi K2-Thinking-0905 was released on September 5, 2025 by MoonshotAI.
Kimi K2-Thinking-0905 was created by MoonshotAI.
Kimi K2-Thinking-0905 has 1000.0 billion parameters.
Kimi K2-Thinking-0905 is released under the MIT license. This is an open-source/open-weight license.