Qwen logo

Qwen2.5 VL 7B Instruct

Overview

Qwen2.5-VL is a vision-language model from the Qwen family. Key enhancements include visual understanding (objects, text, charts, layouts), visual agent capabilities (tool use, computer/phone control), long video comprehension with event pinpointing, visual localization (bounding boxes/points), and structured output generation.

Qwen2.5 VL 7B Instruct was released on January 26, 2025.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
8.3B
License
Apache 2.0
Training Data
Unknown
Tags
tuning:instruct

Benchmarks

Qwen2.5 VL 7B Instruct Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Wed Dec 24 2025
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing, performance, and capabilities for Qwen2.5 VL 7B Instruct across different providers:

No pricing information available for this model.

API Access

API Access Coming Soon

API access for Qwen2.5 VL 7B Instruct will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about Qwen2.5 VL 7B Instruct

Qwen2.5 VL 7B Instruct was released on January 26, 2025.
Qwen2.5 VL 7B Instruct has 8.3 billion parameters.