Meta logo

Llama 3.2 3B Instruct

Meta
llama-3.2-3b-instructVariant

Overview

Llama 3.2 3B Instruct is a large language model that supports a context length of 128K tokens and are state-of-the-art in their class for on-device use cases like summarization, instruction following, and rewriting tasks running locally at the edge.

Llama 3.2 3B Instruct was released on September 25, 2024. API access is available through DeepInfra.

Performance

Timeline

Release DateUnknown
Knowledge CutoffUnknown

Other Details

Parameters
3.2B
License
Llama 3.2 Community License
Training Data
Unknown
Tags
tuning:instruct

Related Models

Compare Llama 3.2 3B Instruct to other models by quality (GPQA score) vs cost. Higher scores and lower costs represent better value.

Performance visualization loading...

Gathering benchmark data from similar models

Benchmarks

Llama 3.2 3B Instruct Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Thu Dec 11 2025
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing, performance, and capabilities for Llama 3.2 3B Instruct across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
DeepInfra logo
DeepInfra
$0.01$0.02128.0K128.0K0.24171.5 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video

Example Outputs

Recent Posts

Recent Reviews

API Access

API Access Coming Soon

API access for Llama 3.2 3B Instruct will be available soon through our gateway.

FAQ

Common questions about Llama 3.2 3B Instruct

Llama 3.2 3B Instruct was released on September 25, 2024.
Llama 3.2 3B Instruct has 3.2 billion parameters.