AI Flash Report

Gemma 4 12B vs MiniMax-M3: Benchmarks, Pricing & Capabilities Compared

TL;DR — Gemma 4 12B wins for general use · MiniMax-M3 wins for reasoning + long-context.

Gemma 4 12B Google
Released
2026-06-03
Context window
131K tokens
Input price
$0.00 / Mtok
Output price
$0.00 / Mtok
MiniMax-M3 MiniMax
Released
2026-06-01
Context window
1M tokens
Input price
$0.30 / Mtok
Output price
$1.20 / Mtok

Benchmark comparison

Benchmark Gemma 4 12B MiniMax-M3
AA Intelligence Index 29.0 54.7
GPQA Diamond 75.3% 92.9%
HLE 14.6% 37.1%
IF-Bench 73.5% 82.9%
LiveCodeBench Reasoning 55.3% 74.0%
SciCode 38.2% 45.4%
TAU2-bench 34.8% 88.9%
TerminalBench-Hard 18.2% 42.4%

Pricing comparison

Metric Gemma 4 12B MiniMax-M3
Input ($/Mtok) $0.00 $0.30
Output ($/Mtok) $0.00 $1.20
Cached input ($/Mtok)
Cost per 1M-token roundtrip (1M in + 1M out) $0.00 $1.50

Context window & modalities

Attribute Gemma 4 12B MiniMax-M3
Context window 131K tokens 1M tokens
Input modalities text, image, video text, image, video
Output modalities text text
Knowledge cutoff

Verdict by use case

Coding
Insufficient data
Basis: SWE-bench

No shared coding benchmark.

Reasoning
→ MiniMax-M3
Basis: GPQA Diamond

Gemma 4 12B 75.3% vs MiniMax-M3 92.9% on GPQA Diamond.

Math
Insufficient data
Basis: MATH / AIME

No shared math benchmark.

Long context
→ MiniMax-M3
Basis: Context window

Gemma 4 12B 131K tokens vs MiniMax-M3 1M tokens.

Cost
→ Gemma 4 12B
Basis: Input $/Mtok

Gemma 4 12B $0/Mtok vs MiniMax-M3 $0.3/Mtok input.

Changelog & releases

Gemma 4 12B
Released 2026-06-03
MiniMax-M3
Released 2026-06-01

Related comparisons