Meta logo

Llama 3.3 70B Instruct

Overview

Overview

Llama 3.3 is a multilingual large language model optimized for dialogue use cases across multiple languages. It is a pretrained and instruction-tuned generative model with 70 billion parameters, outperforming many open-source and closed chat models on common industry benchmarks. Llama 3.3 supports a context length of 128,000 tokens and is designed for commercial and research use in multiple languages.

Llama 3.3 70B Instruct was released on December 6, 2024. API access is available through 9 providers, including Lambda, DeepInfra and others.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
70.0B
License
Llama 3.3 Community License Agreement
Training Data
Unknown
Tags
tuning:instruct

Benchmarks

Benchmarks

Llama 3.3 70B Instruct Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Sun Feb 08 2026
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing

Pricing, performance, and capabilities for Llama 3.3 70B Instruct across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
Lambda logo
Lambda
$0.20$0.20128.0K128.0K
0.65
42.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
DeepInfra logo
DeepInfra
$0.23$0.40128.0K128.0K
0.65
37.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Hyperbolic logo
Hyperbolic
$0.40$0.40128.0K128.0K
0.65
42.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Groq logo
Groq
$0.59$7.90128.0K128.0K
0.65
268.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Sambanova logo
Sambanova
$0.60$1.20128.0K128.0K
0.65
1096.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Cerebras logo
Cerebras
$0.70$0.80128.0K128.0K
0.65
2220.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Bedrock logo
Bedrock
$0.72$0.72128.0K128.0K
0.5
100.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Together logo
Together
$0.88$0.88128.0K128.0K
0.65
65.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Fireworks logo
Fireworks
$0.89$0.89128.0K128.0K
0.65
197.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video

Price Comparison for Llama 3.3 70B Instruct

Price per 1M input tokens (USD), lower is better

LLM Stats Logollm-stats.com - Sun Feb 08 2026

Throughput Comparison for Llama 3.3 70B Instruct

Tokens per second, higher is better

LLM Stats Logollm-stats.com - Sun Feb 08 2026

Latency Comparison for Llama 3.3 70B Instruct

Time to first token (s), lower is better

LLM Stats Logollm-stats.com - Sun Feb 08 2026

API Access

API Access Coming Soon

API access for Llama 3.3 70B Instruct will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about Llama 3.3 70B Instruct

Llama 3.3 70B Instruct was released on December 6, 2024 by Meta.
Llama 3.3 70B Instruct was created by Meta.
Llama 3.3 70B Instruct has 70.0 billion parameters.
Llama 3.3 70B Instruct is released under the Llama 3.3 Community License Agreement license. This is an open-source/open-weight license.