Llama 4 Maverick
MetaOverview
Llama 4 Maverick is a natively multimodal model capable of processing both text and images. It features a 17 billion active parameter mixture-of-experts (MoE) architecture with 128 experts, supporting a wide range of multimodal tasks such as conversational interaction, image analysis, and code generation. The model includes a 1 million token context window.
Llama 4 Maverick was released on April 5, 2025. API access is available through 7 providers, including DeepInfra, Novita and others.
Performance
Timeline
Other Details
Related Models
Compare Llama 4 Maverick to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.
Performance visualization loading...
Gathering benchmark data from similar models
Benchmarks
Llama 4 Maverick Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for Llama 4 Maverick across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
DeepInfra | $0.17 | $0.60 | 1.0M | 1.0M | 0.38 | 83.59 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Novita | $0.17 | $0.85 | 1.0M | 1.0M | 0.62 | 69.42 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Lambda | $0.18 | $0.60 | 1.0M | 1.0M | 0.65 | 93.69 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Groq | $0.20 | $0.60 | 1.0M | 1.0M | 0.27 | 307.3 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Fireworks | $0.22 | $0.88 | 1.0M | 1.0M | 0.62 | 63.03 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Together | $0.27 | $0.85 | 1.0M | 1.0M | 0.2 | 97.93 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Sambanova | $0.63 | $1.79 | 1.0M | 1.0M | 2.04 | 638.7 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Price Comparison for Llama 4 Maverick
Price per 1M input tokens (USD), lower is better
Throughput Comparison for Llama 4 Maverick
Tokens per second, higher is better
Latency Comparison for Llama 4 Maverick
Time to first token (s), lower is better
Llama 4 Maverick API Providers: Price vs Throughput
Example Outputs
Recent Posts
Recent Reviews
API Access
API Access Coming Soon
API access for Llama 4 Maverick will be available soon through our gateway.
FAQ
Common questions about Llama 4 Maverick
