Benchmarks/video/ActivityNet

ActivityNet

A large-scale video benchmark for human activity understanding. Provides samples from 203 activity classes with an average of 137 untrimmed videos per class and 1.41 activity instances per video, for a total of 849 video hours. The benchmark covers a wide range of complex human activities that are of interest to people in their daily living and can be used to compare algorithms for three scenarios: untrimmed video classification, trimmed activity classification, and activity detection.

PaperImplementation

Progress Over Time

Interactive timeline showing model performance evolution on ActivityNet

State-of-the-art frontier
Open
Proprietary

ActivityNet Leaderboard

1 models
ContextCostLicense
1
OpenAI
OpenAI
128K$2.50 / $10.00
Notice missing or incorrect data?

FAQ

Common questions about ActivityNet

A large-scale video benchmark for human activity understanding. Provides samples from 203 activity classes with an average of 137 untrimmed videos per class and 1.41 activity instances per video, for a total of 849 video hours. The benchmark covers a wide range of complex human activities that are of interest to people in their daily living and can be used to compare algorithms for three scenarios: untrimmed video classification, trimmed activity classification, and activity detection.
The ActivityNet paper is available at https://openaccess.thecvf.com/content_cvpr_2015/html/Heilbron_ActivityNet_A_Large-Scale_2015_CVPR_paper.html. This paper provides detailed information about the benchmark methodology, dataset creation, and evaluation criteria.
The ActivityNet dataset is available at https://github.com/activitynet/ActivityNet.
The ActivityNet leaderboard ranks 1 AI models based on their performance on this benchmark. Currently, GPT-4o by OpenAI leads with a score of 0.619. The average score across all models is 0.619.
The highest ActivityNet score is 0.619, achieved by GPT-4o from OpenAI.
1 models have been evaluated on the ActivityNet benchmark, with 0 verified results and 1 self-reported results.
ActivityNet is categorized under video and vision. The benchmark evaluates video models.