AI Flash Report

Granite 4.1 3B vs Grok 4.3: Benchmarks, Pricing & Capabilities Compared

TL;DR — Granite 4.1 3B wins for general use · Grok 4.3 wins for reasoning + long-context.

Released
2026-04-29
Context window
131K tokens
Input price
$0.00 / Mtok
Output price
$0.00 / Mtok
Grok 4.3 xAI
Released
2026-04-30
Context window
1M tokens
Input price
$1.25 / Mtok
Output price
$2.50 / Mtok

Benchmark comparison

Benchmark Granite 4.1 3B Grok 4.3
AA Intelligence Index 8.5 53.2
GPQA Diamond 31.4% 90.1%
HLE 3.4% 35.0%
IF-Bench 33.7% 81.3%
LiveCodeBench Reasoning 3.0% 64.3%
SciCode 11.9% 47.3%
TAU2-bench 19.6% 97.7%
TerminalBench-Hard 2.3% 37.9%

Pricing comparison

Metric Granite 4.1 3B Grok 4.3
Input ($/Mtok) $0.00 $1.25
Output ($/Mtok) $0.00 $2.50
Cached input ($/Mtok)
Cost per 1M-token roundtrip (1M in + 1M out) $0.00 $3.75

Context window & modalities

Attribute Granite 4.1 3B Grok 4.3
Context window 131K tokens 1M tokens
Input modalities text text, image
Output modalities text text
Knowledge cutoff

Verdict by use case

Coding
Insufficient data
Basis: SWE-bench

No shared coding benchmark.

Reasoning
→ Grok 4.3
Basis: GPQA Diamond

Granite 4.1 3B 31.4% vs Grok 4.3 90.1% on GPQA Diamond.

Math
Insufficient data
Basis: MATH / AIME

No shared math benchmark.

Long context
→ Grok 4.3
Basis: Context window

Granite 4.1 3B 131K tokens vs Grok 4.3 1M tokens.

Cost
→ Granite 4.1 3B
Basis: Input $/Mtok

Granite 4.1 3B $0/Mtok vs Grok 4.3 $1.25/Mtok input.

Changelog & releases

Granite 4.1 3B
Released 2026-04-29
Grok 4.3
Released 2026-04-30

Related comparisons