RULER 16k
RULER 16k evaluates the official 13-task RULER v1 suite at a 16384-token context budget.
Progress Over Time
Interactive timeline showing model performance evolution on RULER 16k
No timeline data available
RULER 16k Leaderboard
0 models • 0 verified
| Context | Cost | License |
|---|
Notice missing or incorrect data?
FAQ
Common questions about RULER 16k
RULER 16k evaluates the official 13-task RULER v1 suite at a 16384-token context budget.
The RULER 16k paper is available at https://arxiv.org/abs/2404.06654. This paper provides detailed information about the benchmark methodology, dataset creation, and evaluation criteria.
The RULER 16k dataset is available at https://github.com/NVIDIA/RULER.
RULER 16k is categorized under long context and reasoning. The benchmark evaluates text models.