Model Comparison

Nova Pro vs Phi-4-multimodal-instruct

Nova Pro significantly outperforms across most benchmarks. Phi-4-multimodal-instruct is 22.4x cheaper per token.

Performance Benchmarks

Comparative analysis across standard metrics

4 benchmarks

Nova Pro outperforms in 4 benchmarks (ChartQA, DocVQA, MMMU, TextVQA), while Phi-4-multimodal-instruct is better at 0 benchmarks.

Nova Pro significantly outperforms across most benchmarks.

Wed Apr 15 2026 • llm-stats.com

Arena Performance

Human preference votes

Pricing Analysis

Price comparison per million tokens

Phi-4-multimodal-instruct costs less

For input processing, Nova Pro ($0.80/1M tokens) is 16.0x more expensive than Phi-4-multimodal-instruct ($0.05/1M tokens).

For output processing, Nova Pro ($3.20/1M tokens) is 32.0x more expensive than Phi-4-multimodal-instruct ($0.10/1M tokens).

In conclusion, Nova Pro is more expensive than Phi-4-multimodal-instruct.*

* Using a 3:1 ratio of input to output tokens

Lowest available price from all providers
Wed Apr 15 2026 • llm-stats.com
Amazon
Nova Pro
Input tokens$0.80
Output tokens$3.20
Best providerAWS Bedrock
Microsoft
Phi-4-multimodal-instruct
Input tokens$0.05
Output tokens$0.10
Best providerDeepinfra
Notice missing or incorrect data?Start an Issue

Context Window

Maximum input and output token capacity

Nova Pro accepts 300,000 input tokens compared to Phi-4-multimodal-instruct's 128,000 tokens. Nova Pro can generate longer responses up to 300,000 tokens, while Phi-4-multimodal-instruct is limited to 128,000 tokens.

Amazon
Nova Pro
Input300,000 tokens
Output300,000 tokens
Microsoft
Phi-4-multimodal-instruct
Input128,000 tokens
Output128,000 tokens
Wed Apr 15 2026 • llm-stats.com

Input Capabilities

Supported data types and modalities

Both Nova Pro and Phi-4-multimodal-instruct support multimodal inputs.

They are both capable of processing various types of data, offering versatility in application.

Nova Pro

Text
Images
Audio
Video

Phi-4-multimodal-instruct

Text
Images
Audio
Video

License

Usage and distribution terms

Nova Pro is licensed under a proprietary license, while Phi-4-multimodal-instruct uses MIT.

License differences may affect how you can use these models in commercial or open-source projects.

Nova Pro

Proprietary

Closed source

Phi-4-multimodal-instruct

MIT

Open weights

Release Timeline

When each model was launched

Nova Pro was released on 2024-11-20, while Phi-4-multimodal-instruct was released on 2025-02-01.

Phi-4-multimodal-instruct is 2 months newer than Nova Pro.

Nova Pro

Nov 20, 2024

1.4 years ago

Phi-4-multimodal-instruct

Feb 1, 2025

1.2 years ago

2mo newer

Knowledge Cutoff

When training data ends

Phi-4-multimodal-instruct has a documented knowledge cutoff of 2024-06-01, while Nova Pro's cutoff date is not specified.

We can confirm Phi-4-multimodal-instruct's training data extends to 2024-06-01, but cannot make a direct comparison without Nova Pro's cutoff date.

Nova Pro

Phi-4-multimodal-instruct

Jun 2024

Provider Availability

Nova Pro is available from Bedrock. Phi-4-multimodal-instruct is available from DeepInfra.

Nova Pro

bedrock logo
AWS Bedrock
Input Price:Input: $0.80/1MOutput Price:Output: $3.20/1M

Phi-4-multimodal-instruct

deepinfra logo
Deepinfra
Input Price:Input: $0.05/1MOutput Price:Output: $0.10/1M
* Prices shown are per million tokens

Outputs Comparison

Notice missing or incorrect data?Start an Issue discussion

Key Takeaways

Larger context window (300,000 tokens)
Higher ChartQA score (89.2% vs 81.4%)
Higher DocVQA score (93.5% vs 93.2%)
Higher MMMU score (61.7% vs 55.1%)
Higher TextVQA score (81.5% vs 75.6%)
Less expensive input tokens
Less expensive output tokens
Has open weights

Detailed Comparison

AI Model Comparison Table
Feature
Amazon
Nova Pro
Microsoft
Phi-4-multimodal-instruct

FAQ

Common questions about Nova Pro vs Phi-4-multimodal-instruct

Nova Pro significantly outperforms across most benchmarks. Nova Pro is made by Amazon and Phi-4-multimodal-instruct is made by Microsoft. The best choice depends on your use case — compare their benchmark scores, pricing, and capabilities above.
Nova Pro scores ARC-C: 94.8%, GSM8k: 94.8%, DocVQA: 93.5%, IFEval: 92.1%, ChartQA: 89.2%. Phi-4-multimodal-instruct scores ScienceQA Visual: 97.5%, DocVQA: 93.2%, MMBench: 86.7%, POPE: 85.6%, OCRBench: 84.4%.
Phi-4-multimodal-instruct is 16.0x cheaper for input tokens. Nova Pro costs $0.80/M input and $3.20/M output via bedrock. Phi-4-multimodal-instruct costs $0.05/M input and $0.10/M output via deepinfra.
Nova Pro supports 300K tokens and Phi-4-multimodal-instruct supports 128K tokens. A larger context window lets you process longer documents, conversations, or codebases in a single request.
Key differences include context window (300K vs 128K), input pricing ($0.80 vs $0.05/M), licensing (Proprietary vs MIT). See the full comparison above for benchmark-by-benchmark results.
Nova Pro is developed by Amazon and Phi-4-multimodal-instruct is developed by Microsoft.