LongCat-Flash-Thinking-2601
Overview
Overview
LongCat-Flash-Thinking-2601 is an upgraded version of LongCat-Flash-Thinking with 560B total parameters (MoE, ~27B activated). It achieves open-source SOTA performance on core evaluation benchmarks including Agentic Search, Agentic Tool Use, and Tool-Integrated Reasoning (TIR). Features Heavy Thinking mode that contributes +4-6 points on demanding agentic reasoning benchmarks. Mid-training with structured agentic trajectories improves pass@k by up to +12 points, and context management yields +17.5 improvement.
LongCat-Flash-Thinking-2601 was released on January 14, 2026. API access is available through Meituan.
Performance
Timeline
Specifications
Benchmarks
Benchmarks
LongCat-Flash-Thinking-2601 Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing
Pricing, performance, and capabilities for LongCat-Flash-Thinking-2601 across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Meituan | $0.30 | $1.20 | 128.0K | 128.0K | 3.0 | 100.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
API Access
API Access Coming Soon
API access for LongCat-Flash-Thinking-2601 will be available soon through our gateway.
Recent Posts
Recent Reviews
FAQ
Common questions about LongCat-Flash-Thinking-2601