- Organizations
- Gemma 4 26B-A4B
Gemma 4 26B-A4B: Benchmarks, Pricing & Size
Gemma 4 26B-A4B is a language model from Google, released in April 2026, with multimodal input.
Gemma 4 26B-A4B is Google DeepMind's Mixture-of-Experts multimodal model with 26 billion total parameters and 3.8 billion activated parameters per token, with a 256K context window. Ranks #6 among open models on Arena AI, outcompeting
Gemma 4 26B-A4B pricing
Providers
Gemma 4 26B-A4B starts at $0.130 per million input tokens and $0.400 per million output tokens via Novita.
| Provider | Input $/M | Output $/M | Max Input | Max Output | Latency p95 s | Throughput P95 | Quant | Input | Output |
|---|---|---|---|---|---|---|---|---|---|
| $0.130 | $0.400 | 262.1K | 131.1K | 8.67 | — | bfloat16 |
Gemma 4 26B-A4B model size
Gemma 4 26B-A4B has 25.2 billion parameters. See how it compares to other models in the same parameter range.
Gemma 4 26B-A4B API
API access coming soon
Gemma 4 26B-A4B will be available through our gateway shortly.
Gemma 4 26B-A4B examples
Recent arena outputs from Gemma 4 26B-A4B, picked from the highest-ranked matchups.
Gemma 4 26B-A4B license
Gemma 4 26B-A4B is released under the Apache 2.0 license, which permits commercial use, has 25.2B parameters, has a knowledge cutoff of January 2025.
- License
- Apache 2.0
- Commercial use allowed
- Parameters
- 25.2B
- Knowledge cutoff
- January 2025
Apache License 2.0 - allows commercial use
FAQ
Common questions about Gemma 4 26B-A4B.