- Organizations
- Meituan
- LongCat-Flash-Thinking
LongCat-Flash-Thinking: Benchmarks, Pricing & Size
LongCat-Flash-Thinking is a language model from Meituan, released in September 2025.
LongCat-Flash-Thinking is Meituan's reasoning model built on the LongCat-Flash foundation with 560B total parameters (MoE, ~27B activated). It introduces a training pipeline specifically tuned for advanced reasoning, featuring Re-thinking
LongCat-Flash-Thinking pricing
Providers
LongCat-Flash-Thinking starts at $0.300 per million input tokens and $1.20 per million output tokens via Meituan.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency p95 s | Throughput P95 | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.300 | $1.20 | 128.0K | 128.0K | 3.00 | 100 c/s | — |
LongCat-Flash-Thinking model size
LongCat-Flash-Thinking has 560 billion parameters. See how it compares to other models in the same parameter range.
LongCat-Flash-Thinking API
API access coming soon
LongCat-Flash-Thinking will be available through our gateway shortly.
LongCat-Flash-Thinking examples
Recent arena outputs from LongCat-Flash-Thinking, picked from the highest-ranked matchups.
LongCat-Flash-Thinking license
LongCat-Flash-Thinking is released under the MIT license, which permits commercial use, has 560.0B parameters.
- License
- MIT
- Commercial use allowed
- Parameters
- 560.0B
MIT License - allows commercial use
FAQ
Common questions about LongCat-Flash-Thinking.