
Qwen2.5 VL 72B Instruct
QwenOverview
Qwen2.5-VL is the new flagship vision-language model of Qwen, significantly improved from Qwen2-VL. It excels at recognizing objects, analyzing text/charts/layouts in images, acting as a visual agent, understanding long videos (over 1 hour) with event pinpointing, performing visual localization (bounding boxes/points), and generating structured outputs from documents.
Qwen2.5 VL 72B Instruct was released on January 26, 2025.
Performance
Timeline
Other Details
Related Models
Compare Qwen2.5 VL 72B Instruct to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.
Performance visualization loading...
Gathering benchmark data from similar models
Benchmarks
Qwen2.5 VL 72B Instruct Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for Qwen2.5 VL 72B Instruct across different providers:
Example Outputs
Recent Posts
Recent Reviews
API Access
API Access Coming Soon
API access for Qwen2.5 VL 72B Instruct will be available soon through our gateway.
FAQ
Common questions about Qwen2.5 VL 72B Instruct
