GPT-4o mini
Overview
GPT-4o mini is OpenAI's latest cost-efficient small model, designed to make AI intelligence more accessible and affordable. It excels in textual intelligence and multimodal reasoning, outperforming previous models like GPT-3.5 Turbo. With a context window of 128K tokens and support for text and vision, it offers low-cost, real-time applications such as customer support chatbots. Priced at 15 cents per million input tokens and 60 cents per million output tokens, it is significantly cheaper than its predecessors. Safety is prioritized with built-in measures and improved resistance to security threats.
GPT-4o mini was released on July 18, 2024. API access is available through Azure.
Performance
Timeline
Specifications
Benchmarks
GPT-4o mini Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for GPT-4o mini across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Azure | $0.15 | $0.60 | 128.0K | 16.4K | 0.52 | 92.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
API Access
API Access Coming Soon
API access for GPT-4o mini will be available soon through our gateway.
Recent Posts
Recent Reviews
FAQ
Common questions about GPT-4o mini
