Llama 3.1 405B Instruct
Overview
Llama 3.1 405B Instruct is a large language model optimized for multilingual dialogue use cases. It outperforms many available open source and closed chat models on common industry benchmarks. The model supports 8 languages and has a 128K token context length.
Llama 3.1 405B Instruct was released on July 23, 2024. API access is available through 8 providers, including Lambda, DeepInfra and others.
Performance
Timeline
Specifications
Benchmarks
Llama 3.1 405B Instruct Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Pricing
Pricing, performance, and capabilities for Llama 3.1 405B Instruct across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
Lambda | $0.89 | $0.89 | 128.0K | 128.0K | 0.5 | 42.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
DeepInfra | $1.79 | $1.79 | 128.0K | 128.0K | 0.5 | 27.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
Fireworks | $3.00 | $3.00 | 128.0K | 128.0K | 0.5 | 78.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
Bedrock | $3.00 | $3.00 | 128.0K | 128.0K | 0.5 | 100.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
Together | $3.50 | $3.50 | 128.0K | 128.0K | 0.5 | 35.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
Hyperbolic | $4.00 | $4.00 | 128.0K | 128.0K | 0.5 | 40.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
Google | $5.00 | $16.00 | 128.0K | 128.0K | 0.4 | 42.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
Replicate | $9.50 | $9.50 | 128.0K | 128.0K | 0.5 | 22.0 c/s | — | Text Image Audio Video | Text Image Audio Video |
Price Comparison for Llama 3.1 405B Instruct
Price per 1M input tokens (USD), lower is better
Throughput Comparison for Llama 3.1 405B Instruct
Tokens per second, higher is better
Latency Comparison for Llama 3.1 405B Instruct
Time to first token (s), lower is better
API Access
API Access Coming Soon
API access for Llama 3.1 405B Instruct will be available soon through our gateway.
Recent Posts
Recent Reviews
FAQ
Common questions about Llama 3.1 405B Instruct
