Meta logo

Llama 3.2 90B Instruct

Overview

Overview

Llama 3.2 90B is a large multimodal language model optimized for visual recognition, image reasoning, and captioning tasks. It supports a context length of 128,000 tokens and is designed for deployment on edge and mobile devices, offering state-of-the-art performance in image understanding and generative tasks.

Llama 3.2 90B Instruct was released on September 25, 2024. API access is available through 5 providers, including DeepInfra, Bedrock and others.

Performance

Timeline

ReleasedUnknown
Knowledge CutoffUnknown

Specifications

Parameters
90.0B
License
Llama 3.2
Training Data
Unknown
Tags
tuning:instruct

Benchmarks

Benchmarks

Llama 3.2 90B Instruct Performance Across Datasets

Scores sourced from the model's scorecard, paper, or official blog posts

LLM Stats Logollm-stats.com - Sat Feb 21 2026
Notice missing or incorrect data?Start an Issue discussion

Pricing

Pricing

Pricing, performance, and capabilities for Llama 3.2 90B Instruct across different providers:

ProviderInput ($/M)Output ($/M)Max InputMax OutputLatency (s)ThroughputQuantizationInputOutput
DeepInfra logo
DeepInfra
$0.35$0.40128.0K128.0K
0.5
24.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Bedrock logo
Bedrock
$0.72$0.72128.0K128.0K
0.5
100.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Fireworks logo
Fireworks
$0.89$0.89128.0K128.0K
0.5
50.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Together logo
Together
$1.20$1.20128.0K128.0K
0.5
57.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video
Hyperbolic logo
Hyperbolic
$2.00$2.00128.0K128.0K
0.5
42.0 c/s
Text
Image
Audio
Video
Text
Image
Audio
Video

Price Comparison for Llama 3.2 90B Instruct

Price per 1M input tokens (USD), lower is better

LLM Stats Logollm-stats.com - Sat Feb 21 2026

Throughput Comparison for Llama 3.2 90B Instruct

Tokens per second, higher is better

LLM Stats Logollm-stats.com - Sat Feb 21 2026

Latency Comparison for Llama 3.2 90B Instruct

Time to first token (s), lower is better

LLM Stats Logollm-stats.com - Sat Feb 21 2026

API Access

API Access Coming Soon

API access for Llama 3.2 90B Instruct will be available soon through our gateway.

Recent Posts

Recent Reviews

FAQ

Common questions about Llama 3.2 90B Instruct

Llama 3.2 90B Instruct was released on September 25, 2024 by Meta.
Llama 3.2 90B Instruct was created by Meta.
Llama 3.2 90B Instruct has 90.0 billion parameters.
Llama 3.2 90B Instruct is released under the Llama 3.2 license. This is an open-source/open-weight license.
Yes, Llama 3.2 90B Instruct is a multimodal model that can process both text and images as input.