RULER 64k
RULER 64k evaluates the official 13-task RULER v1 suite at a 65536-token context budget.
Progress Over Time
Interactive timeline showing model performance evolution on RULER 64k
No timeline data available
RULER 64k Leaderboard
0 models • 0 verified
| Context | Cost | License |
|---|
Notice missing or incorrect data?
FAQ
Common questions about RULER 64k
RULER 64k evaluates the official 13-task RULER v1 suite at a 65536-token context budget.
The RULER 64k paper is available at https://arxiv.org/abs/2404.06654. This paper provides detailed information about the benchmark methodology, dataset creation, and evaluation criteria.
The RULER 64k dataset is available at https://github.com/NVIDIA/RULER.
RULER 64k is categorized under long context and reasoning. The benchmark evaluates text models.