- Organizations
- xAI
- Grok-1.5V
Grok-1.5V
A multimodal model capable of processing text and visual information, including documents, diagrams, charts, screenshots, and photographs. Notable for strong real-world spatial understanding capabilities.
Benchmarks
Arena Performance
Grok-1.5V Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Notice missing or incorrect data?Start an Issue discussion→
FAQ
Common questions about Grok-1.5V
Grok-1.5V was released on April 12, 2024 by xAI.
Grok-1.5V was created by xAI.
Grok-1.5V is released under the Proprietary license.
Yes, Grok-1.5V is a multimodal model that can process both text and images as input.