Llama 3.2 3B Instruct
Overview
Overview
Llama 3.2 3B Instruct is a large language model that supports a context length of 128K tokens and are state-of-the-art in their class for on-device use cases like summarization, instruction following, and rewriting tasks running locally at the edge.
Llama 3.2 3B Instruct was released on September 25, 2024. API access is available through DeepInfra.
Performance
Timeline
ReleasedUnknown
Knowledge CutoffUnknown
Specifications
Parameters
3.2B
License
Llama 3.2 Community License
Training Data
Unknown
Tags
tuning:instruct
Benchmarks
Benchmarks
Llama 3.2 3B Instruct Performance Across Datasets
Scores sourced from the model's scorecard, paper, or official blog posts
Notice missing or incorrect data?Start an Issue discussion→
Pricing
Pricing
Pricing, performance, and capabilities for Llama 3.2 3B Instruct across different providers:
| Provider | Input ($/M) | Output ($/M) | Max Input | Max Output | Latency (s) | Throughput | Quantization | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
DeepInfra | $0.01 | $0.02 | 128.0K | 128.0K | 0.24 | 171.5 c/s | — | Text Image Audio Video | Text Image Audio Video |
API Access
API Access Coming Soon
API access for Llama 3.2 3B Instruct will be available soon through our gateway.
Recent Posts
Recent Reviews
FAQ
Common questions about Llama 3.2 3B Instruct
Llama 3.2 3B Instruct was released on September 25, 2024 by Meta.
Llama 3.2 3B Instruct was created by Meta.
Llama 3.2 3B Instruct has 3.2 billion parameters.
Llama 3.2 3B Instruct is released under the Llama 3.2 Community License license. This is an open-source/open-weight license.