Meta logo

Llama 3.1 405B Instruct

Overview

Llama 3.1 405B Instruct is a large language model optimized for multilingual dialogue use cases. It outperforms many available open source and closed chat models on common industry benchmarks. The model supports 8 languages and has a 128K token context length.

Llama 3.1 405B Instruct was released on July 23, 2024. API access is available through 8 providers, including Lambda, DeepInfra and others.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
405.0B
License
Llama 3.1 Community License
Training Data
Unknown
Tags
tuning:instruct

Benchmarks

Llama 3.1 405B Instruct Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Sat Jan 03 2026
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing, performance, and capabilities for Llama 3.1 405B Instruct across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
Lambda logo
Lambda
$0.89$0.89128.0K128.0K
0.5
42.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
DeepInfra logo
DeepInfra
$1.79$1.79128.0K128.0K
0.5
27.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Fireworks logo
Fireworks
$3.00$3.00128.0K128.0K
0.5
78.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Bedrock logo
Bedrock
$3.00$3.00128.0K128.0K
0.5
100.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Together logo
Together
$3.50$3.50128.0K128.0K
0.5
35.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Hyperbolic logo
Hyperbolic
$4.00$4.00128.0K128.0K
0.5
40.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Google logo
Google
$5.00$16.00128.0K128.0K
0.4
42.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Replicate logo
Replicate
$9.50$9.50128.0K128.0K
0.5
22.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video

Price Comparison for Llama 3.1 405B Instruct

Price per 1M input tokens (USD), lower is better

LLM Stats Logollm-stats.com - Sat Jan 03 2026

Throughput Comparison for Llama 3.1 405B Instruct

Tokens per second, higher is better

LLM Stats Logollm-stats.com - Sat Jan 03 2026

Latency Comparison for Llama 3.1 405B Instruct

Time to first token (s), lower is better

LLM Stats Logollm-stats.com - Sat Jan 03 2026

API Access

API Access Coming Soon

API access for Llama 3.1 405B Instruct will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about Llama 3.1 405B Instruct

Llama 3.1 405B Instruct was released on July 23, 2024.
Llama 3.1 405B Instruct has 405.0 billion parameters.