Cohere Agentic Question Answering
Progress Over Time
Interactive timeline showing model performance evolution on Cohere Agentic Question Answering
Cohere Agentic Question Answering Leaderboard
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Cohere | 218B | — | — |
What is Cohere Agentic Question Answering?
Cohere's internal North evaluation for measuring how well a model answers enterprise questions using MCP-connected cloud file systems. Scores are reported with LLM-as-a-judge techniques.
Cohere Agentic Question Answering is a text benchmark evaluating models on question answering, reasoning, and agents tasks. LLM Stats tracks 1 models on this benchmark, scored on a 0–1 scale. The current average is 0.7, with the leader at 0.7.
Compare leaders on the best AI for question answering, best AI for reasoning and best AI for agents leaderboards.
Current leaders
Command A+ from Cohere currently leads the Cohere Agentic Question Answering leaderboard with a score of 0.650 across 1 evaluated AI models.
FAQ
Common questions about the Cohere Agentic Question Answering benchmark and leaderboard.