
November 28, 2024
Analyzing LLM Contamination in the Wild
Some LLMs are scoring suspiciously high on benchmarks. A data-driven analysis of which models likely saw test data during training and how to spot it.

Jonathan Chavez
Co-Founder @ LLM Stats

Some LLMs are scoring suspiciously high on benchmarks. A data-driven analysis of which models likely saw test data during training and how to spot it.
