- Organizations
- Meta
- Llama 4 Scout
Llama 4 Scout: Benchmarks, Pricing & Context Window
Llama 4 Scout is a language model from Meta, released in April 2025, with multimodal input.
Llama 4 Scout is a natively multimodal model capable of processing both text and images. It features a 17 billion activated parameter (109B total) mixture-of-experts (MoE) architecture with 16 experts, supporting a wide range of multimodal
Llama 4 Scout pricing
Providers
Llama 4 Scout starts at $0.0800 per million input tokens and $0.300 per million output tokens via DeepInfra. See all 6 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.0800 | $0.300 | 10.0M | 10.0M | 0.31 | 76 c/s | — | |||
| $0.0800 | $0.300 | 10.0M | 10.0M | 0.43 | 140 c/s | — | |||
| $0.100 | $0.500 | 10.0M | 10.0M | 0.85 | 70 c/s | — | |||
| $0.110 | $0.340 | 10.0M | 10.0M | 1.08 | 776 c/s | — | |||
| $0.150 | $0.600 | 10.0M | 10.0M | 0.53 | 116 c/s | — | |||
| $0.180 | $0.590 | 10.0M | 10.0M | 0.54 | 107 c/s | — |
Llama 4 Scout API
API access coming soon
Llama 4 Scout will be available through our gateway shortly.
Llama 4 Scout examples
Recent arena outputs from Llama 4 Scout, picked from the highest-ranked matchups.
Llama 4 Scout license
Llama 4 Scout is released under the Llama 4 Community License Agreement license, which restricts commercial use, has 109.0B parameters.
- License
- Llama 4 Community License Agreement
- Non-commercial
- Parameters
- 109.0B
FAQ
Common questions about Llama 4 Scout.