AI Flash Report

Granite 4.1 8B vs Grok 4.3: Benchmarks, Pricing & Capabilities Compared

TL;DR — Granite 4.1 8B wins for cost · Grok 4.3 wins for reasoning + long-context.

Released
2026-04-29
Context window
131K tokens
Input price
$0.05 / Mtok
Output price
$0.10 / Mtok
Grok 4.3 xAI
Released
2026-04-30
Context window
1M tokens
Input price
$1.25 / Mtok
Output price
$2.50 / Mtok

Benchmark comparison

Benchmark Granite 4.1 8B Grok 4.3
AA Intelligence Index 12.4 53.2
GPQA Diamond 43.3% 90.1%
HLE 3.8% 35.0%
IF-Bench 38.6% 81.3%
LiveCodeBench Reasoning 12.0% 64.3%
SciCode 21.8% 47.3%
TAU2-bench 27.8% 97.7%
TerminalBench-Hard 0.0% 37.9%

Pricing comparison

Metric Granite 4.1 8B Grok 4.3
Input ($/Mtok) $0.05 $1.25
Output ($/Mtok) $0.10 $2.50
Cached input ($/Mtok)
Cost per 1M-token roundtrip (1M in + 1M out) $0.15 $3.75

Context window & modalities

Attribute Granite 4.1 8B Grok 4.3
Context window 131K tokens 1M tokens
Input modalities text text, image
Output modalities text text
Knowledge cutoff

Verdict by use case

Coding
Insufficient data
Basis: SWE-bench

No shared coding benchmark.

Reasoning
→ Grok 4.3
Basis: GPQA Diamond

Granite 4.1 8B 43.3% vs Grok 4.3 90.1% on GPQA Diamond.

Math
Insufficient data
Basis: MATH / AIME

No shared math benchmark.

Long context
→ Grok 4.3
Basis: Context window

Granite 4.1 8B 131K tokens vs Grok 4.3 1M tokens.

Cost
→ Granite 4.1 8B
Basis: Input $/Mtok

Granite 4.1 8B $0.05/Mtok vs Grok 4.3 $1.25/Mtok input.

Changelog & releases

Granite 4.1 8B
Released 2026-04-29
Grok 4.3
Released 2026-04-30

Related comparisons