A Thorough Analysis of LLM Biases
In this post, we'll look at biases from a quantitative perspective across a variety of dimensions and models.
Benchmarks & Analysis
Independent analysis and practical insights into language models, exploring real-world performance, limitations, and capabilities.
In this post, we'll look at biases from a quantitative perspective across a variety of dimensions and models.
An introduction to applied LLMs, including a discussion of the current state of the field and some of the most important applications.
A detailed analysis of the pros and cons of fine-tuning vs RAG for building a custom LLM.
Same LLM, different performance scores. Here's how quantization choices by providers like Anthropic, OpenAI, and others affect model behavior and speed.
Some LLMs are scoring suspiciously high on benchmarks. A data-driven analysis of which models likely saw test data during training and how to spot it.