AI Flash Report

Claude Sonnet 4.6 vs Mistral Large 3: Benchmarks, Pricing & Capabilities Compared

TL;DR — Claude Sonnet 4.6 wins for long-context · Mistral Large 3 wins for cost.

Claude Sonnet 4.6 Anthropic
Released
2026-02-17
Context window
500K tokens
Input price
$3.00 / Mtok
Output price
$15.00 / Mtok
Key features
  • Agent Teams: orchestrate 2-16 Claude instances
  • Near-Opus performance at 1/5th cost
  • 80.8% SWE-bench Verified
Mistral Large 3 Mistral
Released
2025-12-15
Context window
256K tokens
Input price
$2.00 / Mtok
Output price
$6.00 / Mtok
Key features
  • 128K context window
  • Improved multilingual capabilities
  • Enhanced function calling

Benchmark comparison

Benchmark Claude Sonnet 4.6 Mistral Large 3
HumanEval 95.2% 91.2%
MMLU 92.1% 89.4%

Pricing comparison

Metric Claude Sonnet 4.6 Mistral Large 3
Input ($/Mtok) $3.00 $2.00
Output ($/Mtok) $15.00 $6.00
Cached input ($/Mtok) $0.30
Cost per 1M-token roundtrip (1M in + 1M out) $18.00 $8.00

Context window & modalities

Attribute Claude Sonnet 4.6 Mistral Large 3
Context window 500K tokens 256K tokens
Input modalities text, image, PDF text, image
Output modalities text text
Knowledge cutoff 2025-10 2025-07

Verdict by use case

Coding
→ Claude Sonnet 4.6
Basis: HumanEval

Claude Sonnet 4.6 95.2% vs Mistral Large 3 91.2% on HumanEval.

Reasoning
→ Claude Sonnet 4.6
Basis: MMLU-Pro

Claude Sonnet 4.6 92.1% vs Mistral Large 3 89.4% on MMLU-Pro.

Math
Insufficient data
Basis: MATH / AIME

No shared math benchmark.

Long context
→ Claude Sonnet 4.6
Basis: Context window

Claude Sonnet 4.6 500K tokens vs Mistral Large 3 256K tokens.

Cost
→ Mistral Large 3
Basis: Input $/Mtok

Claude Sonnet 4.6 $3/Mtok vs Mistral Large 3 $2/Mtok input.

Changelog & releases

Claude Sonnet 4.6
Released 2026-02-17
  • Agent Teams: orchestrate 2–16 Claude instances in parallel
  • +8.5pt on SWE-bench Verified vs Sonnet 4
  • 1/5 the cost of Opus 4.5 at ~95% of coding quality
  • Fast mode research preview for lower-latency inference
Mistral Large 3
Released 2025-12-15
Predecessor: mistral-mistral-large

Related comparisons