GPT-4o
Overview
GPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English languages, vision, and audio understanding.
GPT-4o was released on May 13, 2024. API access is available through Azure, OpenAI.
Performance
Timeline
ReleasedUnknown
Knowledge CutoffUnknown
Specifications
Parameters
Unknown
License
Proprietary
Training Data
Unknown
Benchmarks
GPT-4o Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Notice missing or incorrect data?Start an Issue discussion→
Pricing
Pricing, performance, and capabilities for GPT-4o across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Azure | $2.50 | $10.00 | 128.0K | 4.1K | 0.54 | 92.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
OpenAI | $2.50 | $10.00 | 128.0K | 4.1K | 0.5 | 100.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Price Comparison for GPT-4o
Price per 1M input tokens (USD), lower is better
Throughput Comparison for GPT-4o
Tokens per second, higher is better
Latency Comparison for GPT-4o
Time to first token (s), lower is better
GPT-4o API Providers: Price vs Throughput
API Access
API Access Coming Soon
API access for GPT-4o will be available soon through our gateway.
Recent Posts
Recent Reviews
FAQ
Common questions about GPT-4o
GPT-4o was released on May 13, 2024.
