Name: Kimi K2 Instruct
Author: MoonshotAI

Question 1

When was Kimi K2 Instruct released?

Accepted Answer

Kimi K2 Instruct was released on July 11, 2025 by MoonshotAI. This is the official Kimi K2 Instruct release date tracked on LLM Stats.

Question 2

How much does Kimi K2 Instruct cost?

Accepted Answer

Kimi K2 Instruct pricing starts at $0.50 per million input tokens and $0.50 per million output tokens via Fireworks, the lowest price among tracked providers.

Question 3

Is Kimi K2 Instruct available via API?

Accepted Answer

Yes, Kimi K2 Instruct is available via API. See the official documentation for authentication and endpoint details. It is served by 2 providers tracked on LLM Stats.

Question 4

How big is Kimi K2 Instruct?

Accepted Answer

Kimi K2 Instruct has 1000 billion parameters. It was trained on 15.5 trillion tokens. It ships as an open-weight model, so you can download and run it on your own hardware.

Question 5

Who created Kimi K2 Instruct?

Accepted Answer

Kimi K2 Instruct was created by MoonshotAI.

Question 6

What is the license for Kimi K2 Instruct?

Accepted Answer

Kimi K2 Instruct is released under the MIT license. This is an open-source / open-weight license that permits self-hosting.

Question 7

What is Kimi K2 Instruct latency?

Accepted Answer

Kimi K2 Instruct p95 time to first token is 0.95 seconds via Novita over the trailing 7 days. Lower time to first token means the model begins responding sooner for chat, agents and API workloads.

Question 8

Where can I use Kimi K2 Instruct?

Accepted Answer

Kimi K2 Instruct is available through 2 providers including Fireworks, Novita.

Question 9

Where is the Kimi K2 Instruct paper or technical report?

Accepted Answer

Kimi K2 Instruct has a paper or technical report available at https://moonshotai.github.io/Kimi-K2/. Use that source for architecture, training, release and evaluation details.

Question 10

What models should I compare Kimi K2 Instruct against?

Accepted Answer

Common Kimi K2 Instruct comparisons include Kimi K2 Instruct vs MiMo-V2.5-Pro, Kimi K2 Instruct vs LongCat-Flash-Chat, Kimi K2 Instruct vs Qwen3 VL 235B A22B Instruct. Compare them side by side for benchmark scores, pricing, context window, latency and API availability.

Provider	Input $/M	Output $/M	Workload 1M + 100K	Context in / out	TTFT p50 / p95 s	Output avg / p5 c/s	Success 7d	Modalities in / out
Fireworks	$0.500	$0.500	$0.550	200.0K/200.0K	—/—	—/—	—	/
Novita	$0.570	$2.30	$0.800	131.1K/131.1K	—/0.95	45/—	—	/

Kimi K2 Instruct: API Pricing, Context Window & Benchmarks

Kimi K2 Instruct benchmarks

Rankings

Quality Tracker

Kimi K2 Instruct Performance Across Datasets

Kimi K2 Instruct pricing

Providers

Kimi K2 Instruct model size

Kimi K2 Instruct context window