- Organizations
- Meta
- Llama 3.1 405B Instruct
Llama 3.1 405B Instruct: Benchmarks, Pricing & Context Window
Llama 3.1 405B Instruct is a language model from Meta, released in July 2024.
Llama 3.1 405B Instruct is a large language model optimized for multilingual dialogue use cases. It outperforms many available open source and closed chat models on common industry benchmarks. The model supports 8 languages and has a 128K
Llama 3.1 405B Instruct pricing
Providers
Llama 3.1 405B Instruct starts at $0.890 per million input tokens and $0.890 per million output tokens via Lambda. See all 8 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.890 | $0.890 | 128.0K | 128.0K | 0.50 | 42 c/s | — | |||
| $1.79 | $1.79 | 128.0K | 128.0K | 0.50 | 27 c/s | — | |||
| $3.00 | $3.00 | 128.0K | 128.0K | 0.50 | 78 c/s | — | |||
| $3.00 | $3.00 | 128.0K | 128.0K | 0.50 | 100 c/s | — | |||
| $3.50 | $3.50 | 128.0K | 128.0K | 0.50 | 35 c/s | — | |||
| $4.00 | $4.00 | 128.0K | 128.0K | 0.50 | 40 c/s | — | |||
| $5.00 | $16.00 | 128.0K | 128.0K | 0.40 | 42 c/s | — | |||
| $9.50 | $9.50 | 128.0K | 128.0K | 0.50 | 22 c/s | — |
Llama 3.1 405B Instruct API
API access coming soon
Llama 3.1 405B Instruct will be available through our gateway shortly.
Llama 3.1 405B Instruct examples
Recent arena outputs from Llama 3.1 405B Instruct, picked from the highest-ranked matchups.
Llama 3.1 405B Instruct license
Llama 3.1 405B Instruct is released under the Llama 3.1 Community License license, which restricts commercial use, has 405.0B parameters.
- License
- Llama 3.1 Community License
- Non-commercial
- Parameters
- 405.0B
FAQ
Common questions about Llama 3.1 405B Instruct.