Pixtral-12B
MistralOverview
A 12B parameter multimodal model with a 400M parameter vision encoder, capable of understanding both natural images and documents. Excels at multimodal tasks while maintaining strong text-only performance. Supports variable image sizes and multiple images in context.
Pixtral-12B was released on September 17, 2024. API access is available through Mistral AI.
Performance
Timeline
Other Details
Related Models
Compare Pixtral-12B to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.
Performance visualization loading...
Gathering benchmark data from similar models
Benchmarks
Pixtral-12B Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for Pixtral-12B across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Mistral AI | $0.15 | $0.15 | 128.0K | 8.2K | 0.5 | 0.1 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Example Outputs
Recent Posts
Recent Reviews
API Access
API Access Coming Soon
API access for Pixtral-12B will be available soon through our gateway.
FAQ
Common questions about Pixtral-12B
