GoogleReleased on Mar 12, 2025

Gemma 3 12B: Benchmarks, Pricing & Context Window

Gemma 3 12B is a language model from Google, released in March 2025, with multimodal input.

Gemma 3 12B is a 12-billion-parameter vision-language model from Google, handling text and image input and generating text output. It features a 128K context window, multilingual support, and open weights. Suitable for question answering,

Input
TextImage
Output
Text

Gemma 3 12B pricing

Providers

Gemma 3 12B starts at $0.0500 per million input tokens and $0.100 per million output tokens via DeepInfra.

ProviderInput $/MOutput $/MMax InputMax OutputLatency sThroughputQuantInputOutput
DeepInfra logoDeepInfra
$0.0500$0.100131.1K131.1K
0.20
33 c/s

Gemma 3 12B API

API access coming soon

Gemma 3 12B will be available through our gateway shortly.

Gemma 3 12B examples

Recent arena outputs from Gemma 3 12B, picked from the highest-ranked matchups.

Gemma 3 12B license

Gemma 3 12B is released under the Gemma license, which permits commercial use, has 12.0B parameters.

License
Gemma
Commercial use allowed
Parameters
12.0B

Google Gemma Terms of Use

FAQ

Common questions about Gemma 3 12B.

What is the Gemma 3 12B release date?

Gemma 3 12B was released on March 12, 2025 by Google.

Who created Gemma 3 12B?

Gemma 3 12B was created by Google.

How many parameters does Gemma 3 12B have?

Gemma 3 12B has 12.0 billion parameters.

What is the license for Gemma 3 12B?

Gemma 3 12B is released under the Gemma license.

Is Gemma 3 12B multimodal?

Yes, Gemma 3 12B is a multimodal model that can process both text and images as input.