February
Major Release
Google's latest flagship model with a major 2x jump in reasoning capabilities
✨ Key Features
- 2x reasoning improvement
- ARC-AGI-2 score of 77.1%
- Enhanced multimodal understanding
- Deep Think mode
Major Release
Anthropic's latest Sonnet with Agent Teams capability and near-Opus performance at a fraction of the cost
✨ Key Features
- Agent Teams: orchestrate 2-16 Claude instances
- Near-Opus performance at 1/5th cost
- 80.8% SWE-bench Verified
- Fast mode research preview
Update
Major update with 10x context window expansion to over 1 million tokens
✨ Key Features
- 1M+ token context window (10x expansion)
- Improved reasoning capabilities
- Open source release
- Cost-effective inference
Major Release
First frontier AI model trained entirely without NVIDIA GPUs, using Huawei Ascend chips
✨ Key Features
- First frontier model trained on Huawei Ascend chips (no NVIDIA)
- #1 HLE score (50.4%)
- 1.2% hallucination rate via Slime RL
- 136x cheaper than Claude Opus 4.5
Major Release
First commercial video model generating synchronized audio and video in a single pass
✨ Key Features
- Synchronized audio + video generation in one pass
- Lip-sync in 8+ languages
- 10-30x cheaper than Sora 2
- Commercial pricing: $0.10-$0.80/min
Major Release
OpenAI's specialized self-improving coding model with state-of-the-art software engineering performance
✨ Key Features
- Self-improving agentic coding
- 25% faster than GPT-5.2-Codex
- 1,000+ tokens/sec generation
- First OpenAI model flagged 'high' on cybersecurity framework