Meta logo

Llama 3.2 3B Instruct

Overview

Llama 3.2 3B Instruct is a large language model that supports a context length of 128K tokens and are state-of-the-art in their class for on-device use cases like summarization, instruction following, and rewriting tasks running locally at the edge.

Llama 3.2 3B Instruct was released on September 25, 2024. API access is available through DeepInfra.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
3.2B
License
Llama 3.2 Community License
Training Data
Unknown
Tags
tuning:instruct

Benchmarks

Llama 3.2 3B Instruct Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Fri Dec 26 2025
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing, performance, and capabilities for Llama 3.2 3B Instruct across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
DeepInfra logo
DeepInfra
$0.01$0.02128.0K128.0K0.24171.5 tok/s
Text
Image
Audio
Video
Text
Image
Audio
Video

API Access

API Access Coming Soon

API access for Llama 3.2 3B Instruct will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about Llama 3.2 3B Instruct

Llama 3.2 3B Instruct was released on September 25, 2024.
Llama 3.2 3B Instruct has 3.2 billion parameters.