Llama 3.3 70B Instruct
MetaOverview
Llama 3.3 is a multilingual large language model optimized for dialogue use cases across multiple languages. It is a pretrained and instruction-tuned generative model with 70 billion parameters, outperforming many open-source and closed chat models on common industry benchmarks. Llama 3.3 supports a context length of 128,000 tokens and is designed for commercial and research use in multiple languages.
Llama 3.3 70B Instruct was released on December 6, 2024. API access is available through 9 providers, including Lambda, DeepInfra and others.
Performance
Timeline
Other Details
Related Models
Compare Llama 3.3 70B Instruct to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.
Performance visualization loading...
Gathering benchmark data from similar models
Benchmarks
Llama 3.3 70B Instruct Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for Llama 3.3 70B Instruct across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Lambda | $0.20 | $0.20 | 128.0K | 128.0K | 0.65 | 42.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
DeepInfra | $0.23 | $0.40 | 128.0K | 128.0K | 0.65 | 37.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Hyperbolic | $0.40 | $0.40 | 128.0K | 128.0K | 0.65 | 42.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Groq | $0.59 | $7.90 | 128.0K | 128.0K | 0.65 | 268.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Sambanova | $0.60 | $1.20 | 128.0K | 128.0K | 0.65 | 1096.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Cerebras | $0.70 | $0.80 | 128.0K | 128.0K | 0.65 | 2220.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Bedrock | $0.72 | $0.72 | 128.0K | 128.0K | 0.5 | 100.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Together | $0.88 | $0.88 | 128.0K | 128.0K | 0.65 | 65.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Fireworks | $0.89 | $0.89 | 128.0K | 128.0K | 0.65 | 197.0 tok/s | — | Text Image Audio Video | Text Image Audio Video |
Price Comparison for Llama 3.3 70B Instruct
Price per 1M input tokens (USD), lower is better
Throughput Comparison for Llama 3.3 70B Instruct
Tokens per second, higher is better
Latency Comparison for Llama 3.3 70B Instruct
Time to first token (s), lower is better
Llama 3.3 70B Instruct API Providers: Price vs Throughput
Example Outputs
Recent Posts
Recent Reviews
API Access
API Access Coming Soon
API access for Llama 3.3 70B Instruct will be available soon through our gateway.
FAQ
Common questions about Llama 3.3 70B Instruct
