Model Comparison

DeepSeek-V2.5 vs Gemma 3n E4B Instructed

DeepSeek-V2.5 significantly outperforms across most benchmarks. DeepSeek-V2.5 is 142.9x cheaper per token.

Performance Benchmarks

Comparative analysis across standard metrics

2 benchmarks

DeepSeek-V2.5 outperforms in 2 benchmarks (HumanEval, MMLU), while Gemma 3n E4B Instructed is better at 0 benchmarks.

DeepSeek-V2.5 significantly outperforms across most benchmarks.

Mon Apr 20 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

DeepSeek-V2.5 costs less

For input processing, DeepSeek-V2.5 ($0.14/1M tokens) is 142.9x cheaper than Gemma 3n E4B Instructed ($20.00/1M tokens).

For output processing, DeepSeek-V2.5 ($0.28/1M tokens) is 142.9x cheaper than Gemma 3n E4B Instructed ($40.00/1M tokens).

In conclusion, Gemma 3n E4B Instructed is more expensive than DeepSeek-V2.5.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers
Mon Apr 20 2026 • llm-stats.com
DeepSeek
DeepSeek-V2.5
Input tokens$0.14
Output tokens$0.28
Best providerDeepSeek
Google
Gemma 3n E4B Instructed
Input tokens$20.00
Output tokens$40.00
Best providerTogether
Notice missing or incorrect data?Start an Issue

Model Size

Parameter count comparison

228.0B diff

DeepSeek-V2.5 has 228.0B more parameters than Gemma 3n E4B Instructed, making it 2850.0% larger.

DeepSeek
DeepSeek-V2.5
236.0Bparameters
Google
Gemma 3n E4B Instructed
8.0Bparameters
236.0B
DeepSeek-V2.5
8.0B
Gemma 3n E4B Instructed

Context Window

Maximum input and output token capacity

Gemma 3n E4B Instructed accepts 32,000 input tokens compared to DeepSeek-V2.5's 8,192 tokens. Gemma 3n E4B Instructed can generate longer responses up to 32,000 tokens, while DeepSeek-V2.5 is limited to 8,192 tokens.

DeepSeek
DeepSeek-V2.5
Input8,192 tokens
Output8,192 tokens
Google
Gemma 3n E4B Instructed
Input32,000 tokens
Output32,000 tokens
Mon Apr 20 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Gemma 3n E4B Instructed supports multimodal inputs, whereas DeepSeek-V2.5 does not.

Gemma 3n E4B Instructed can handle both text and other forms of data like images, making it suitable for multimodal applications.

DeepSeek-V2.5

Text
Images
Audio
Video

Gemma 3n E4B Instructed

Text
Images
Audio
Video

License

Usage and distribution terms

DeepSeek-V2.5 is licensed under deepseek, while Gemma 3n E4B Instructed uses a proprietary license.

License differences may affect how you can use these models in commercial or open-source projects.

DeepSeek-V2.5

deepseek

Open weights

Gemma 3n E4B Instructed

Proprietary

Closed source

Release Timeline

When each model was launched

DeepSeek-V2.5 was released on 2024-05-08, while Gemma 3n E4B Instructed was released on 2025-06-26.

Gemma 3n E4B Instructed is 14 months newer than DeepSeek-V2.5.

DeepSeek-V2.5

May 8, 2024

2.0 years ago

Gemma 3n E4B Instructed

Jun 26, 2025

9 months ago

1.1yr newer

Knowledge Cutoff

When training data ends

Gemma 3n E4B Instructed has a documented knowledge cutoff of 2024-06-01, while DeepSeek-V2.5's cutoff date is not specified.

We can confirm Gemma 3n E4B Instructed's training data extends to 2024-06-01, but cannot make a direct comparison without DeepSeek-V2.5's cutoff date.

DeepSeek-V2.5

Gemma 3n E4B Instructed

Jun 2024

Provider Availability

DeepSeek-V2.5 is available from DeepSeek, DeepInfra, Hyperbolic. Gemma 3n E4B Instructed is available from Together.

DeepSeek-V2.5

deepseek logo
DeepSeek
Input Price:Input: $0.14/1MOutput Price:Output: $0.28/1M
deepinfra logo
Deepinfra
Input Price:Input: $0.70/1MOutput Price:Output: $1.40/1M
hyperbolic logo
Hyperbolic
Input Price:Input: $2.00/1MOutput Price:Output: $2.00/1M

Gemma 3n E4B Instructed

together logo
Together
Input Price:Input: $20.00/1MOutput Price:Output: $40.00/1M
* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Less expensive input tokens
Less expensive output tokens
Has open weights
Higher HumanEval score (89.0% vs 75.0%)
Higher MMLU score (80.4% vs 64.9%)
Larger context window (32,000 tokens)
Supports multimodal inputs

Detailed Comparison

AI Model Comparison Table
Feature
DeepSeek
DeepSeek-V2.5
Google
Gemma 3n E4B Instructed

FAQ

Common questions about DeepSeek-V2.5 vs Gemma 3n E4B Instructed

DeepSeek-V2.5 significantly outperforms across most benchmarks. DeepSeek-V2.5 is made by DeepSeek and Gemma 3n E4B Instructed is made by Google. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.
DeepSeek-V2.5 scores GSM8k: 95.1%, MT-Bench: 90.2%, HumanEval: 89.0%, BBH: 84.3%, AlignBench: 80.4%. Gemma 3n E4B Instructed scores HumanEval: 75.0%, MGSM: 67.0%, MMLU: 64.9%, Global-MMLU-Lite: 64.5%, MBPP: 63.6%.
DeepSeek-V2.5 is 142.9x cheaper for input tokens. DeepSeek-V2.5 costs $0.14/M input and $0.28/M output via deepseek. Gemma 3n E4B Instructed costs $20.00/M input and $40.00/M output via together.
DeepSeek-V2.5 supports 8K tokens and Gemma 3n E4B Instructed supports 32K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.
Key differences include context window (8K vs 32K), input pricing ($0.14 vs $20.00/M), multimodal support (no vs yes), licensing (deepseek vs Proprietary). See the full comparison above for benchmark-by-benchmark results.
DeepSeek-V2.5 is developed by DeepSeek and Gemma 3n E4B Instructed is developed by Google.