AI Flash Report

DeepSeek V4 Flash vs Granite 4.1 30B: Benchmarks, Pricing & Capabilities Compared

TL;DR — DeepSeek V4 Flash wins for reasoning + long-context · Granite 4.1 30B wins for general use.

Released
2026-04-24
Context window
1M tokens
Input price
$0.14 / Mtok
Output price
$0.28 / Mtok
Released
2026-04-29
Context window
131K tokens
Input price
$0.00 / Mtok
Output price
$0.00 / Mtok

Benchmark comparison

Benchmark DeepSeek V4 Flash Granite 4.1 30B
AA Intelligence Index 46.5 14.7
GPQA Diamond 89.4% 48.1%
HLE 32.1% 4.2%
IF-Bench 79.2% 44.4%
LiveCodeBench Reasoning 63.0% 18.7%
SciCode 44.9% 25.8%
TAU2-bench 95.0% 42.1%
TerminalBench-Hard 35.6% 2.3%

Pricing comparison

Metric DeepSeek V4 Flash Granite 4.1 30B
Input ($/Mtok) $0.14 $0.00
Output ($/Mtok) $0.28 $0.00
Cached input ($/Mtok)
Cost per 1M-token roundtrip (1M in + 1M out) $0.42 $0.00

Context window & modalities

Attribute DeepSeek V4 Flash Granite 4.1 30B
Context window 1M tokens 131K tokens
Input modalities text text
Output modalities text text
Knowledge cutoff

Verdict by use case

Coding
Insufficient data
Basis: SWE-bench

No shared coding benchmark.

Reasoning
→ DeepSeek V4 Flash
Basis: GPQA Diamond

DeepSeek V4 Flash 89.4% vs Granite 4.1 30B 48.1% on GPQA Diamond.

Math
Insufficient data
Basis: MATH / AIME

No shared math benchmark.

Long context
→ DeepSeek V4 Flash
Basis: Context window

DeepSeek V4 Flash 1M tokens vs Granite 4.1 30B 131K tokens.

Cost
→ Granite 4.1 30B
Basis: Input $/Mtok

DeepSeek V4 Flash $0.14/Mtok vs Granite 4.1 30B $0/Mtok input.

Changelog & releases

DeepSeek V4 Flash
Released 2026-04-24
Granite 4.1 30B
Released 2026-04-29

Related comparisons