Pixtral Large
Overview
A 124B parameter multimodal model built on top of Mistral Large 2, featuring frontier-level image understanding capabilities. Excels at understanding documents, charts, and natural images while maintaining strong text-only performance. Features a 123B multimodal decoder and 1B parameter vision encoder with a 128K context window supporting up to 30 high-resolution images.
Pixtral Large was released on November 18, 2024. API access is available through Mistral AI.
Performance
Timeline
Specifications
Benchmarks
Pixtral Large Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for Pixtral Large across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Mistral AI | $2.00 | $6.00 | 128.0K | 128.0K | 0.5 | 0.1 c/s | — | Text Image Audio Video | Text Image Audio Video |
API Access
API Access Coming Soon
API access for Pixtral Large will be available soon through our gateway.
Recent Posts
Recent Reviews
FAQ
Common questions about Pixtral Large
