Sources

Where the data comes from

Every score in the catalog is sourced from a public benchmark or live API metric. Here are the primary evaluations and the LLM Stats Score that aggregates them.