What are the latest LLM updates?

LLM Stats tracks meaningful model releases and product changes from major providers, including launch dates, pricing shifts, notable capabilities, and update summaries.

Where can I find AI updates today?

This page collects recent AI product and model updates in one place, with timeline-style coverage of releases, API changes, pricing changes, and deprecations.

What GPT updates have been released?

The update timeline highlights notable OpenAI releases and changes, including new model versions, capability updates, and pricing or API changes when available.

How do I track Claude and Gemini updates?

You can use the timeline to follow Anthropic and Google model releases, compare what changed between versions, and quickly spot important capability or API updates.

AI UpdatesToday

Name: LLM Updates Tracker
Author: LLM Stats

Track AI model updates and LLM releases in real time. Version launches, API changes, and notable improvements across GPT, Claude, Gemini, Llama, and 500+ language models.

Last 10

AI Model Releases

Track all LLM releases and version updates

Jun 16, 2026· 1 release

GLM-5.2

ReleaseOpen Source

Zhipu AI•5d ago

Jun 12, 2026· 1 release

Kimi K2.7 Code

CodingOpen Source

Moonshot AI•1w ago

Jun 10, 2026· 1 release

DiffusionGemma 26B-A4B

ReleaseOpen Source

Google•1w ago

Jun 9, 2026· 2 releases

Claude Fable 5

ReleaseProprietary

Anthropic•1w ago

North Mini Code 1.0

LightweightOpen Source

Cohere•1w ago

Jun 5, 2026· 1 release

U2

ReleaseProprietary

Unisound•2w ago

Jun 4, 2026· 1 release

Nemotron 3 Ultra (550B A55B)

ProOpen Source

NVIDIA•2w ago

Jun 2, 2026· 1 release

MAI-Code-1-Flash

FastProprietary

Microsoft•2w ago

1–8 of 321

Last 30 days

Models that got worse

Sigma-normalized vs. each model's baseline

How is this measured?⌄

For each model we reconstruct daily TrueSkill conservative ratings per arena from match-level vote outcomes, then compute a baseline from the first 21 days of activity (after a 3-day warm-up). The Quality Index is the sigma-normalized deviation from that baseline, weighted across arenas. Change shown is the difference between today and 30 days ago. A swing of ±0.5σ is noticeable; ±1σ is significant.

Weekly brief

The model releases, benchmark shifts, and analysis worth your week — in one email.

Open weights

Open Source AI Updates

Recent open-weight model releases with permissive licenses

Open LLM Leaderboard

GLM-5.2

Zhipu AI·Jun 16

Landscape

Open Source LLM Landscape

Open source LLM news has become increasingly important as open-weight models transform the AI landscape. Stay updated with open source LLM updates today covering models like Llama 3, Mistral, Qwen, and DeepSeek—now rivaling proprietary alternatives on many benchmarks while providing flexibility to fine-tune, self-host, and customize for specific domains.

Our open-source LLM coverage includes licensing terms (Apache 2.0, MIT, or custom licenses), parameter count affecting LLM inference costs, quantization support for efficient deployment, and the community ecosystem of fine-tuned variants and LLM tools.

Open LLM Leaderboard·Open Source AI Models

Primer

Understanding LLM Versioning

AI model versioning follows patterns that help developers understand capabilities and stability. Major versions (GPT-3 → GPT-4, Claude 2 → Claude 3) indicate significant capability improvements and may require prompt adjustments. Minor updates (GPT-4 → GPT-4 Turbo) offer performance optimizations, cost reductions, or context window expansions while maintaining compatibility.

Organizations use various naming conventions: OpenAI uses dated snapshots (gpt-4-0613), Anthropic uses descriptive tiers (Claude 3.5 Sonnet), and Google uses generation markers (Gemini 1.5 Pro). Understanding these patterns helps you make informed decisions about when to upgrade and how to manage deprecations.

Labs

Active AI Organizations

Track model releases from leading AI labs

View all

Side by side

Compare AI models

Free head-to-head playgrounds across image, video, website, game and chat modalities.

All arenas

Explain quantum computing…

Qubits exist in superposition…

Agent

+42

Trends

The Pace of AI Development

The AI industry is releasing new models at an unprecedented rate. We track 321+ model releases across major organizations. Capabilities that seemed cutting-edge months ago are now baseline expectations.

Key trends include reasoning models (OpenAI o1, DeepSeek-R1) trading speed for accuracy, multimodal capabilities becoming standard across frontier models, and efficiency improvements delivering GPT-4-level performance at dramatically lower costs.

50+ organizations·15+ providers·Updated daily

Inference

API Provider Updates

Pricing, latency, and feature updates from inference providers

Provider rankings

Replicate

99 active models·replicate.com/

Fal

68 active models·fal.ai

Input price

—

Max tokens

OpenAI

63 active models·openai.com

Google

46 active models·ai.google.dev

Gemini 3.5 Flash$150.00/M

Novita

43 active models·novita.ai/

Kimi K2.7 Code$95.00/M

xAI

22 active models·docs.x.ai

Grok-4 Fast Non-Reasoning$20.00/M

DeepInfra

20 active models·deepinfra.com/

MiMo-V2.5-Pro$100.00/M

MiMo-V2.5$40.00/M

Anthropic

16 active models·anthropic.com

Claude Opus 4.8$500.00/M

MiniMax

12 active models·platform.minimax.io

Buyer's guide

Choosing an API Provider

Key factors for selecting an inference provider

Pricing models

Providers charge per-token (input/output priced separately), per-request, or offer committed use discounts. For high-volume apps, $0.50/M token differences translate to thousands in monthly savings.

Latency & throughput

First-token latency matters for interactive apps; total generation time for batch processing. Throughput (tokens/sec) is critical for real-time applications and agent workflows.

Model selection

First-party providers (OpenAI, Anthropic) offer latest models first. Third-party providers (Together, Fireworks, Groq) often provide same quality at lower cost plus open-source alternatives.

Reliability & support

Uptime, rate limits, and SLAs vary significantly. For production workloads, consider multi-provider strategies with automatic failover. Check our provider rankings.

FAQ

Frequently Asked Questions

Common questions about LLM updates, version releases, and API changes

What are the latest LLM version updates?

⌄

LLM Stats tracks all major language model version releases in real-time. This includes updates from OpenAI (GPT series, o-series), Anthropic (Claude), Google (Gemini), Meta (Llama), Mistral, DeepSeek, and other providers. Our timeline shows release dates, new capabilities, benchmark improvements, and feature additions for each version.

How do I track API changes and pricing updates?

⌄

Our API Provider Updates section tracks pricing changes, new features, rate limit updates, and API endpoint modifications for all major providers including OpenAI, Anthropic, Google Cloud AI, Azure OpenAI, Together AI, Fireworks, Groq, and more. We also monitor throughput and latency changes.

How do I compare different versions of an LLM?

⌄

Use our Model Comparison tool to see side-by-side benchmark performance, capability differences, pricing changes, and context window improvements between model versions. You can compare versions like GPT-4 vs GPT-4 Turbo vs GPT-4o, or track improvements across Claude 3 vs Claude 3.5 releases.

Which organizations release the most LLM updates?

⌄

Our Active Organizations section shows the most active AI labs by recent releases. Major contributors include OpenAI, Anthropic, Google DeepMind, Meta AI, Mistral AI, xAI, DeepSeek, Alibaba (Qwen), and many others. Filter by organization to see their complete release history and model families.

How often is this page updated?

⌄

LLM Stats updates this page hourly to capture new model releases and API changes as they happen. Our team also manually verifies major announcements from provider blogs, press releases, and official documentation. For real-time updates, follow us on Twitter.

Explore More

Dive deeper into LLM data, benchmarks, and comparisons

LLM Leaderboard

Compare 500+ models

AI News

Latest AI headlines

Benchmarks

50+ evaluations

Best for Coding

Top coding models

Loading updates...