Meta logo

Llama 3.3 70B Instruct

Meta
llama-3.3-70b-instructVariant

Overview

Llama 3.3 is a multilingual large language model optimized for dialogue use cases across multiple languages. It is a pretrained and instruction-tuned generative model with 70 billion parameters, outperforming many open-source and closed chat models on common industry benchmarks. Llama 3.3 supports a context length of 128,000 tokens and is designed for commercial and research use in multiple languages.

Llama 3.3 70B Instruct was released on December 6, 2024. API access is available through 9 providers, including Lambda, DeepInfra and others.

Performance

Timeline

Release DateUnknown
Knowledge CutoffUnknown

Other Details

Parameters
70.0B
License
Llama 3.3 Community License Agreement
Training Data
Unknown
Tags
tuning:instruct

Related Models

Compare Llama 3.3 70B Instruct to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.

Performance visualization loading...

Gathering benchmark data from similar models

Benchmarks

Llama 3.3 70B Instruct Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Sun Dec 14 2025
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing, performance, and capabilities for Llama 3.3 70B Instruct across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
Lambda logo
Lambda
$0.20$0.20128.0K128.0K0.6542.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
DeepInfra logo
DeepInfra
$0.23$0.40128.0K128.0K0.6537.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Hyperbolic logo
Hyperbolic
$0.40$0.40128.0K128.0K0.6542.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Groq logo
Groq
$0.59$7.90128.0K128.0K0.65268.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Sambanova logo
Sambanova
$0.60$1.20128.0K128.0K0.651096.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Cerebras logo
Cerebras
$0.70$0.80128.0K128.0K0.652220.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Bedrock logo
Bedrock
$0.72$0.72128.0K128.0K0.5100.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Together logo
Together
$0.88$0.88128.0K128.0K0.6565.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Fireworks logo
Fireworks
$0.89$0.89128.0K128.0K0.65197.0 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video

Price Comparison for Llama 3.3 70B Instruct

Price per 1M input tokens (USD), lower is better

LLM Stats Logollm-stats.com - Sun Dec 14 2025

Throughput Comparison for Llama 3.3 70B Instruct

Tokens per second, higher is better

LLM Stats Logollm-stats.com - Sun Dec 14 2025

Latency Comparison for Llama 3.3 70B Instruct

Time to first token (s), lower is better

LLM Stats Logollm-stats.com - Sun Dec 14 2025

Llama 3.3 70B Instruct API Providers: Price vs Throughput

Example Outputs

Recent Posts

Recent Reviews

API Access

API Access Coming Soon

API access for Llama 3.3 70B Instruct will be available soon through our gateway.

FAQ

Common questions about Llama 3.3 70B Instruct

Llama 3.3 70B Instruct was released on December 6, 2024.
Llama 3.3 70B Instruct has 70.0 billion parameters.