Common Voice 15
Common Voice is a massively-multilingual collection of transcribed speech intended for speech technology research and development. Version 15.0 contains 28,750 recorded hours across 114 languages, consisting of crowdsourced voice recordings with corresponding transcriptions.
Progress Over Time
Interactive timeline showing model performance evolution on Common Voice 15
State-of-the-art frontier
Open
Proprietary
Common Voice 15 Leaderboard
1 models
| Context | Cost | License | ||||
|---|---|---|---|---|---|---|
| 1 | Alibaba Cloud / Qwen Team | 7B | — | — |
Notice missing or incorrect data?
FAQ
Common questions about Common Voice 15
Common Voice is a massively-multilingual collection of transcribed speech intended for speech technology research and development. Version 15.0 contains 28,750 recorded hours across 114 languages, consisting of crowdsourced voice recordings with corresponding transcriptions.
The Common Voice 15 paper is available at https://arxiv.org/abs/1912.06670. This paper provides detailed information about the benchmark methodology, dataset creation, and evaluation criteria.
The Common Voice 15 leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, Qwen2.5-Omni-7B by Alibaba Cloud / Qwen Team leads with a score of 0.076. The average score across all models is 0.076.
The highest Common Voice 15 score is 0.076, achieved by Qwen2.5-Omni-7B from Alibaba Cloud / Qwen Team.
1 models have been evaluated on the Common Voice 15 benchmark, with 0 verified results and 1 self-reported results.
Common Voice 15 is categorized under audio, language, and speech to text. The benchmark evaluates audio models with multilingual support.