Model Comparison

Gemini 2.0 Flash-Lite vs Gemma 3n E4B Instructed

Gemini 2.0 Flash-Lite significantly outperforms across most benchmarks. Gemini 2.0 Flash-Lite is 196.1x cheaper per token.

Want to compare interactively?Try the playground

Performance Benchmarks

Comparative analysis across standard metrics

5 benchmarks

Gemini 2.0 Flash-Lite outperforms in 5 benchmarks (Global-MMLU-Lite, GPQA, HiddenMath, LiveCodeBench v5, MMLU-Pro), while Gemma 3n E4B Instructed is better at 0 benchmarks.

Gemini 2.0 Flash-Lite significantly outperforms across most benchmarks.

Sat May 30 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

Gemini 2.0 Flash-Lite costs less

For input processing, Gemini 2.0 Flash-Lite ($0.07/1M tokens) is 285.7x cheaper than Gemma 3n E4B Instructed ($20.00/1M tokens).

For output processing, Gemini 2.0 Flash-Lite ($0.30/1M tokens) is 133.3x cheaper than Gemma 3n E4B Instructed ($40.00/1M tokens).

In conclusion, Gemma 3n E4B Instructed is more expensive than Gemini 2.0 Flash-Lite.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers

Sat May 30 2026 • llm-stats.com

Gemini 2.0 Flash-Lite

Input tokens$0.07

Output tokens$0.30

Best providerGoogle

Gemma 3n E4B Instructed

Input tokens$20.00

Output tokens$40.00

Best providerTogether

Notice missing or incorrect data?Start an Issue→

Context Window

Maximum input and output token capacity

Gemini 2.0 Flash-Lite accepts 1,048,576 input tokens compared to Gemma 3n E4B Instructed's 32,000 tokens. Gemma 3n E4B Instructed can generate longer responses up to 32,000 tokens, while Gemini 2.0 Flash-Lite is limited to 8,192 tokens.

Gemini 2.0 Flash-Lite

Input1,048,576 tokens

Output8,192 tokens

Gemma 3n E4B Instructed

Input32,000 tokens

Output32,000 tokens

Sat May 30 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Both Gemini 2.0 Flash-Lite and Gemma 3n E4B Instructed support multimodal inputs.

They are both capable of processing various types of data, offering versatility in application.

Gemini 2.0 Flash-Lite

Text

Images

Audio

Video

Gemma 3n E4B Instructed

Text

Images

Audio

Video

License

Usage and distribution terms

Both models are licensed under proprietary licenses.

Both models have usage restrictions defined by their respective organizations.

Gemini 2.0 Flash-Lite

Proprietary

Closed source

Gemma 3n E4B Instructed

Proprietary

Closed source

Release Timeline

When each model was launched

Gemini 2.0 Flash-Lite was released on 2025-02-05, while Gemma 3n E4B Instructed was released on 2025-06-26.

Gemma 3n E4B Instructed is 5 months newer than Gemini 2.0 Flash-Lite.

Gemini 2.0 Flash-Lite

Feb 5, 2025

1.3 years ago

Gemma 3n E4B Instructed

Jun 26, 2025

11 months ago

4mo newer

Knowledge Cutoff

When training data ends

Both models have the same knowledge cutoff date of 2024-06-01.

They should have similar awareness of historical events and information up to this date.

Gemini 2.0 Flash-Lite

Jun 2024

Gemma 3n E4B Instructed

Jun 2024

Provider Availability

Gemini 2.0 Flash-Lite is available from Google. Gemma 3n E4B Instructed is available from Together.

Gemini 2.0 Flash-Lite

Google

Input Price:Input: $0.07/1MOutput Price:Output: $0.30/1M

Gemma 3n E4B Instructed

Together

Input Price:Input: $20.00/1MOutput Price:Output: $40.00/1M

* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion→

Key Takeaways

Gemini 2.0 Flash-Lite

View details

Google

Larger context window (1,048,576 tokens)

Less expensive input tokens

Less expensive output tokens

Higher Global-MMLU-Lite score (78.2% vs 64.5%)

Higher GPQA score (51.5% vs 23.7%)

Higher HiddenMath score (55.3% vs 37.7%)

Higher LiveCodeBench v5 score (28.9% vs 25.7%)

Higher MMLU-Pro score (71.6% vs 50.6%)

Gemma 3n E4B Instructed

View details

Google

No standout differentiators in the data we have for this pair.

Detailed Comparison

AI Model Comparison Table
Feature	Gemini 2.0 Flash-Lite	Gemma 3n E4B Instructed

FAQ

Common questions about Gemini 2.0 Flash-Lite vs Gemma 3n E4B Instructed.

Which is better, Gemini 2.0 Flash-Lite or Gemma 3n E4B Instructed?

Gemini 2.0 Flash-Lite significantly outperforms across most benchmarks. Gemini 2.0 Flash-Lite is made by Google and Gemma 3n E4B Instructed is made by Google. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.

How does Gemini 2.0 Flash-Lite compare to Gemma 3n E4B Instructed in benchmarks?

Gemini 2.0 Flash-Lite scores MATH: 86.8%, FACTS Grounding: 83.6%, Global-MMLU-Lite: 78.2%, MMLU-Pro: 71.6%, MMMU: 68.0%. Gemma 3n E4B Instructed scores HumanEval: 75.0%, MGSM: 67.0%, MMLU: 64.9%, Global-MMLU-Lite: 64.5%, MBPP: 63.6%.

Is Gemini 2.0 Flash-Lite cheaper than Gemma 3n E4B Instructed?

Gemini 2.0 Flash-Lite is 285.7x cheaper for input tokens. Gemini 2.0 Flash-Lite costs $0.07/M input and $0.30/M output via google. Gemma 3n E4B Instructed costs $20.00/M input and $40.00/M output via together.

What are the context window sizes for Gemini 2.0 Flash-Lite and Gemma 3n E4B Instructed?

Gemini 2.0 Flash-Lite supports 1.0M tokens and Gemma 3n E4B Instructed supports 32K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.

What are the main differences between Gemini 2.0 Flash-Lite and Gemma 3n E4B Instructed?

Key differences include context window (1.0M vs 32K), input pricing ($0.07 vs $20.00/M). See the full comparison above for benchmark-by-benchmark results.