GPT-4o
OpenAIOverview
GPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English languages, vision, and audio understanding.
GPT-4o was released on August 6, 2024. API access is available through Azure, OpenAI.
Performance
Timeline
Other Details
Related Models
Compare GPT-4o to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.
Performance visualization loading...
Gathering benchmark data from similar models
Benchmarks
GPT-4o Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for GPT-4o across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Azure | $2.50 | $10.00 | 128.0K | 16.4K | 0.53 | 99.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
OpenAI | $2.50 | $10.00 | 128.0K | 16.4K | 0.5 | 132.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Price Comparison for GPT-4o
Price per 1M input tokens (USD), lower is better
Throughput Comparison for GPT-4o
Tokens per second, higher is better
Latency Comparison for GPT-4o
Time to first token (s), lower is better
GPT-4o API Providers: Price vs Throughput
Example Outputs
Recent Posts
Recent Reviews
API Access
API Access Coming Soon
API access for GPT-4o will be available soon through our gateway.
FAQ
Common questions about GPT-4o
