- Organizations
- Gemma 3 4B
Gemma 3 4B: Benchmarks, Pricing & Context Window
Gemma 3 4B is a language model from Google, released in March 2025, with multimodal input.
Gemma 3 4B is a 4-billion-parameter vision-language model from Google, handling text and image input and generating text output. It features a 128K context window, multilingual support, and open weights. Suitable for question answering,
Gemma 3 4B pricing
Providers
Gemma 3 4B starts at $0.0200 per million input tokens and $0.0400 per million output tokens via DeepInfra.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency s | Throughput | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.0200 | $0.0400 | 131.1K | 131.1K | 0.20 | 33 c/s | — |
Gemma 3 4B API
API access coming soon
Gemma 3 4B will be available through our gateway shortly.
Gemma 3 4B examples
Recent arena outputs from Gemma 3 4B, picked from the highest-ranked matchups.
Gemma 3 4B license
Gemma 3 4B is released under the Gemma license, which permits commercial use, has 4.0B parameters, has a knowledge cutoff of August 2024.
- License
- Gemma
- Commercial use allowed
- Parameters
- 4.0B
- Knowledge cutoff
- August 2024
Google Gemma Terms of Use
FAQ
Common questions about Gemma 3 4B.