- Organizations
- Meta
- Llama 3.2 3B Instruct
Llama 3.2 3B Instruct: Benchmarks, Pricing & Context Window
Llama 3.2 3B Instruct is a language model from Meta, released in September 2024.
Llama 3.2 3B Instruct is a large language model that supports a context length of 128K tokens and are state-of-the-art in their class for on-device use cases like summarization, instruction following, and rewriting tasks running locally at
Llama 3.2 3B Instruct pricing
Providers
Llama 3.2 3B Instruct starts at $0.0100 per million input tokens and $0.0200 per million output tokens via DeepInfra.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.0100 | $0.0200 | 128.0K | 128.0K | 0.24 | 172 c/s | — |
Llama 3.2 3B Instruct API
API access coming soon
Llama 3.2 3B Instruct will be available through our gateway shortly.
Llama 3.2 3B Instruct examples
Recent arena outputs from Llama 3.2 3B Instruct, picked from the highest-ranked matchups.
Llama 3.2 3B Instruct license
Llama 3.2 3B Instruct is released under the Llama 3.2 Community License license, which restricts commercial use, has 3.2B parameters.
- License
- Llama 3.2 Community License
- Non-commercial
- Parameters
- 3.2B
FAQ
Common questions about Llama 3.2 3B Instruct.