AI Flash Report

DeepSeek V4 Flash vs Qwen3.7 Max: Benchmarks, Pricing & Capabilities Compared

TL;DR — DeepSeek V4 Flash wins for cost · Qwen3.7 Max wins for general use.

Released
2026-04-24
Context window
1M tokens
Input price
$0.14 / Mtok
Output price
$0.28 / Mtok
Qwen3.7 Max Alibaba
Released
2026-05-19
Context window
1M tokens
Input price
$2.50 / Mtok
Output price
$7.50 / Mtok

Benchmark comparison

Benchmark DeepSeek V4 Flash Qwen3.7 Max
AA Intelligence Index 46.5 56.6
GPQA Diamond 89.4% 92.3%
HLE 32.1% 38.1%
IF-Bench 79.2% 80.5%
LiveCodeBench Reasoning 63.0% 69.0%
SciCode 44.9% 48.8%
TAU2-bench 95.0% 94.7%
TerminalBench-Hard 35.6% 50.8%

Pricing comparison

Metric DeepSeek V4 Flash Qwen3.7 Max
Input ($/Mtok) $0.14 $2.50
Output ($/Mtok) $0.28 $7.50
Cached input ($/Mtok)
Cost per 1M-token roundtrip (1M in + 1M out) $0.42 $10.00

Context window & modalities

Attribute DeepSeek V4 Flash Qwen3.7 Max
Context window 1M tokens 1M tokens
Input modalities text text
Output modalities text text
Knowledge cutoff

Verdict by use case

Coding
Insufficient data
Basis: SWE-bench

No shared coding benchmark.

Reasoning
→ Qwen3.7 Max
Basis: GPQA Diamond

DeepSeek V4 Flash 89.4% vs Qwen3.7 Max 92.3% on GPQA Diamond.

Math
Insufficient data
Basis: MATH / AIME

No shared math benchmark.

Long context
Tie
Basis: Context window

DeepSeek V4 Flash 1M tokens vs Qwen3.7 Max 1M tokens.

Cost
→ DeepSeek V4 Flash
Basis: Input $/Mtok

DeepSeek V4 Flash $0.14/Mtok vs Qwen3.7 Max $2.5/Mtok input.

Changelog & releases

DeepSeek V4 Flash
Released 2026-04-24
Qwen3.7 Max
Released 2026-05-19

Related comparisons