- Organizations
- Meta
- Llama 3.1 70B Instruct
Llama 3.1 70B Instruct: Benchmarks, Pricing & Context Window
Llama 3.1 70B Instruct is a language model from Meta, released in July 2024.
Llama 3.1 70B Instruct is a large language model optimized for multilingual dialogue use cases. It outperforms many available open source and closed chat models on common industry benchmarks.
Llama 3.1 70B Instruct pricing
Providers
Llama 3.1 70B Instruct starts at $0.200 per million input tokens and $0.200 per million output tokens via Lambda. See all 9 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.200 | $0.200 | 128.0K | 128.0K | 0.50 | 42 c/s | — | |||
| $0.350 | $0.400 | 128.0K | 128.0K | 0.50 | 25 c/s | — | |||
| $0.400 | $0.400 | 128.0K | 128.0K | 0.50 | 100 c/s | — | |||
| $0.590 | $0.780 | 128.0K | 128.0K | 0.50 | 250 c/s | — | |||
| $0.600 | $0.600 | 128.0K | 128.0K | 0.20 | 1204 c/s | — | |||
| $0.890 | $0.890 | 128.0K | 128.0K | 0.50 | 94 c/s | — | |||
| $0.890 | $0.890 | 128.0K | 128.0K | 0.50 | 32 c/s | — | |||
| $0.890 | $0.890 | 128.0K | 128.0K | 0.50 | 100 c/s | — | |||
| $5.00 | $10.00 | 128.0K | 128.0K | 0.50 | 74 c/s | — |
Llama 3.1 70B Instruct API
API access coming soon
Llama 3.1 70B Instruct will be available through our gateway shortly.
Llama 3.1 70B Instruct examples
Recent arena outputs from Llama 3.1 70B Instruct, picked from the highest-ranked matchups.
Llama 3.1 70B Instruct license
Llama 3.1 70B Instruct is released under the Llama 3.1 Community License license, which restricts commercial use, has 70.0B parameters.
- License
- Llama 3.1 Community License
- Non-commercial
- Parameters
- 70.0B
FAQ
Common questions about Llama 3.1 70B Instruct.