AI Flash Report

The frontier AI model tracker

Every major model release in one place — benchmarks, context windows, pricing, and the launch coverage that matters. Updated as models ship.

Top 10 on the leaderboard

All benchmarks →
# Model Company Score
1 Gemini 3.1 Pro Google 77.1%

All tracked models

See full hub →
All 59 tracked models
Model Company Released
Gemini 3.1 Pro
Google's flagship reasoning model with a 2x jump on hard multi-step tasks.
Google 2026-02-19
Claude Sonnet 4.6
Near-Opus quality at a fraction of the cost, with Agent Teams orchestration.
Anthropic 2026-02-17
DeepSeek V3.2
Open-weight MoE with a 1M+ token context window and strong coding.
DeepSeek 2026-02-12
GLM-5
First frontier model trained entirely on Huawei Ascend silicon.
Zhipu AI 2026-02-11
Seedance 2.0
ByteDance's video generation flagship — cinematic quality at scale
ByteDance 2026-02-10
GPT-5.3 Codex
Coding-specialized variant of GPT-5.3, tuned for agentic IDE workflows.
OpenAI 2026-02-05
Kimi K2
Moonshot's open-weight frontier MoE with strong agentic benchmarks.
Moonshot AI 2026-01-20
GPT-5.2 Codex
Prior-gen Codex variant of GPT-5.2 for agentic coding.
OpenAI 2025-12-18
Mistral Large 3
Mistral's flagship proprietary model, tuned for European enterprise.
Mistral 2025-12-15
GPT-5.2
Late-2025 GPT-5 refresh with improved reasoning and steerability.
OpenAI 2025-12-11
Claude Opus 4.5
Anthropic's top-tier reasoning model for complex research and agents.
Anthropic 2025-11-24
Gemini 3 Pro
First Gemini 3 tier release; strong multimodal + long-context.
Google 2025-11-18
GPT-5.1
Maintenance update to GPT-5 with steerability + latency improvements.
OpenAI 2025-11-12
GPT-5
OpenAI's flagship unified reasoning + chat model replacing the GPT-4 line.
OpenAI 2025-08-15
Claude Opus 4.1
Mid-2025 Opus refresh focused on agentic coding reliability.
Anthropic 2025-07-15
Gemini 2.5 Flash
Google's cost-optimized multimodal model with thinking mode.
Google 2025-06-20
Claude Sonnet 4
Claude 4 mid-tier with strong coding and long-horizon agentic reliability.
Anthropic 2025-05-22
Claude Sonnet 3.7
Extended-thinking update to Sonnet 3.5 with visible reasoning toggle.
Anthropic 2025-02-24
DeepSeek-V3
DeepSeek's breakthrough open-weight MoE rivaling GPT-4-class quality.
DeepSeek 2024-12-26
Gemini 2.0 Flash
First Gemini 2 model — fast, cheap, multimodal, with tool use native.
Google 2024-12-11
Moonshot Kimi
China's long-context pioneer — 1M token window at launch
Moonshot AI 2024-10-16
GPT-o1-preview
OpenAI's first public reasoning model — 'thinking before answering'
OpenAI 2024-09-12
Grok-2
Elon's second-gen Grok — real-time X/Twitter data access
xAI 2024-08-13
GitHub Copilot Chat
AI pair programmer embedded in VS Code and GitHub — 1M+ enterprise seats
Microsoft 2024-07-25
Claude 3.5 Sonnet
The mid-2024 Sonnet release that set the SOTA bar for coding and agents.
Anthropic 2024-06-20
Grok-1.5
Grok's reasoning upgrade — first xAI model with strong STEM benchmarks
xAI 2024-03-28
Claude 3 Opus
Anthropic's most powerful pre-Claude 4 model — tops GPT-4 on reasoning
Anthropic 2024-03-04
Claude 3 Sonnet
Balanced Claude 3 variant — best price/performance in the family
Anthropic 2024-03-04
Claude 3 Haiku
Fastest and cheapest Claude 3 — sub-second latency at $0.25/M
Anthropic 2024-03-04
Mistral Large Mistral 2024-02-26
Sora
OpenAI's text-to-video model — up to 60-second photorealistic clips
OpenAI 2024-02-15
Gemini 1.5 Pro
Google's first 1M-context model — multimodal needle-in-haystack champion
Google 2024-02-15
text-embedding-3-large
OpenAI's best embedding model — 3× cheaper than ada-002 with better MTEB
OpenAI 2024-01-25
GPT-4 Turbo
GPT-4 with 128K context and knowledge through April 2023 — 3× cheaper than GPT-4
OpenAI 2024-01-25
Midjourney v6
Midjourney's photorealism leap — text rendering finally works
Midjourney 2023-12-21
Grok-1
xAI's open-source release — 314B MoE, first frontier model fully open-sourced
xAI 2023-12-07
AlphaCode 2
DeepMind's coding specialist — top 15% of competitive programmers
Google DeepMind 2023-12-06
Gemini Ultra
Google's first model to beat GPT-4 on MMLU — 90%+ with CoT
Google 2023-12-06
Gemini Pro
Google's workhorse Gemini model — free tier in Google AI Studio
Google 2023-12-06
Claude 2.1
Claude 2 update — 200K context and reduced hallucinations
Anthropic 2023-11-21
Whisper v3
State-of-the-art open speech recognition — 99 languages, open weights
OpenAI 2023-11-06
DALL-E 3
OpenAI's best image model — coherent text in images, integrated into ChatGPT
OpenAI 2023-10-04
GPT-4V (Vision)
GPT-4 gains eyes — the model that made multimodal mainstream
OpenAI 2023-09-25
Code Llama 34B
Meta's code-specialized open model — top open-source coding at launch
Meta 2023-08-24
RT-2 (Robotics Transformer)
Google's vision-language-action model — robots that reason from web knowledge
Google 2023-07-28
Stable Diffusion XL
Open-weight image generation flagship — photorealistic, runs locally
Stability AI 2023-07-26
Llama 2 70B
Meta's open-weight landmark — 70B that matched GPT-3.5 and ignited open AI
Meta 2023-07-18
Claude 2
Claude's first major leap — 100K context and better at instructions
Anthropic 2023-07-11
PaLM 2
Google's multilingual flagship — powers Bard 2023, 100+ languages
Google 2023-05-10
GPT-4
The model that changed everything — GPT-4 set the standard for capable AI
OpenAI 2023-03-14
Claude 1.3
Anthropic's first public model — 100K context ahead of its time
Anthropic 2023-03-14
MusicLM
Google's text-to-music model — high-fidelity, 24kHz stereo from text prompts
Google 2023-01-26
ChatGPT (GPT-3.5 Turbo)
The product that launched the AI era — 100M users in 2 months
OpenAI 2022-11-30
DALL-E 2
OpenAI's image-generation breakthrough — photorealism and inpainting
OpenAI 2022-04-06
PaLM
Google's 540B pathways model — first to demonstrate chain-of-thought at scale
Google 2022-04-04
InstructGPT
RLHF alignment applied to GPT-3 — the precursor to ChatGPT
OpenAI 2022-01-27
Codex
GPT-3 trained on code — the engine behind GitHub Copilot v1
OpenAI 2021-08-10
DALL-E
The original image-generating AI — multimodal GPT-3 that shocked the world
OpenAI 2021-01-05
GPT-3
The model that showed scaling works — 175B parameters, few-shot learning pioneer
OpenAI 2020-06-11

Daily AI news, condensed

A curated daily roundup of frontier-AI news — model launches, research, deals, and product moves.

Read today's brief →