Meta logo

Llama 3.2 3B Instruct

Overview

Overview

Llama 3.2 3B Instruct is a large language model that supports a context length of 128K tokens and are state-of-the-art in their class for on-device use cases like summarization, instruction following, and rewriting tasks running locally at the edge.

Llama 3.2 3B Instruct was released on September 25, 2024. API access is available through DeepInfra.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
3.2B
License
Llama 3.2 Community License
Training Data
Unknown
Tags
tuning:instruct

Benchmarks

Benchmarks

Llama 3.2 3B Instruct Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Sun Jan 25 2026
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing

Pricing, performance, and capabilities for Llama 3.2 3B Instruct across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
DeepInfra logo
DeepInfra
$0.01$0.02128.0K128.0K
0.24
171.5 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video

API Access

API Access Coming Soon

API access for Llama 3.2 3B Instruct will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about Llama 3.2 3B Instruct

Llama 3.2 3B Instruct was released on September 25, 2024 by Meta.
Llama 3.2 3B Instruct was created by Meta.
Llama 3.2 3B Instruct has 3.2 billion parameters.
Llama 3.2 3B Instruct is released under the Llama 3.2 Community License license. This is an open-source/open-weight license.