- Organizations
- Gemma 4 31B
Gemma 4 31B: Benchmarks, Pricing & Size
Gemma 4 31B is a language model from Google, released in April 2026, with multimodal input.
Gemma 4 31B is Google DeepMind's flagship dense multimodal model with 31 billion parameters and a 256K context window. Ranks #3 among open models on Arena AI. Built from the same research as Gemini 3, it features Per-Layer Embeddings,
Gemma 4 31B pricing
Providers
Gemma 4 31B starts at $0.130 per million input tokens and $0.380 per million output tokens via DeepInfra. See all 4 providers below with their per-token pricing, latency, throughput, and modality support.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency p95 s | Throughput P95 | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.130 | $0.380 | 262.1K | 131.1K | 0.85 | 17 c/s | fp8 | |||
| $0.140 | $0.400 | 262.1K | 131.1K | 5.92 | 24 c/s | — | |||
| $0.140 | $0.400 | 262.1K | 131.1K | 0.98 | 36 c/s | bfloat16 | |||
| $0.390 | $0.970 | 262.1K | 131.1K | 15.23 | — | — |
Gemma 4 31B model size
Gemma 4 31B has 30.7 billion parameters. See how it compares to other models in the same parameter range.
Gemma 4 31B API
API access coming soon
Gemma 4 31B will be available through our gateway shortly.
Gemma 4 31B examples
Recent arena outputs from Gemma 4 31B, picked from the highest-ranked matchups.
Gemma 4 31B license
Gemma 4 31B is released under the Apache 2.0 license, which permits commercial use, has 30.7B parameters, has a knowledge cutoff of January 2025.
- License
- Apache 2.0
- Commercial use allowed
- Parameters
- 30.7B
- Knowledge cutoff
- January 2025
Apache License 2.0 - allows commercial use
FAQ
Common questions about Gemma 4 31B.