GPT-4o
GPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English
Benchmarks
Arena Performance
GPT-4o Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Notice missing or incorrect data?Start an Issue discussion→
FAQ
Common questions about GPT-4o
GPT-4o was released on August 6, 2024 by OpenAI.
GPT-4o was created by OpenAI.
GPT-4o is released under the Proprietary license.
Yes, GPT-4o is a multimodal model that can process both text and images as input.