AI Flash Report

Gemini 3.5 Flash vs MiniCPM5-1B: Benchmarks, Pricing & Capabilities Compared

TL;DR — Gemini 3.5 Flash wins for reasoning + long-context · MiniCPM5-1B wins for general use.

Released
2026-05-19
Context window
1M tokens
Input price
$1.50 / Mtok
Output price
$9.00 / Mtok
MiniCPM5-1B OpenBMB
Released
2026-05-25
Context window
128K tokens
Input price
$0.00 / Mtok
Output price
$0.00 / Mtok

Benchmark comparison

Benchmark Gemini 3.5 Flash MiniCPM5-1B
AA Intelligence Index 43.3 17.9
GPQA Diamond 82.8% 26.9%
HLE 23.1% 4.6%
IF-Bench 47.3% 35.2%
LiveCodeBench Reasoning 53.3% 4.7%
SciCode 48.8% 1.4%
TAU2-bench 58.8% 82.5%
TerminalBench-Hard 46.2% 0.0%

Pricing comparison

Metric Gemini 3.5 Flash MiniCPM5-1B
Input ($/Mtok) $1.50 $0.00
Output ($/Mtok) $9.00 $0.00
Cached input ($/Mtok)
Cost per 1M-token roundtrip (1M in + 1M out) $10.50 $0.00

Context window & modalities

Attribute Gemini 3.5 Flash MiniCPM5-1B
Context window 1M tokens 128K tokens
Input modalities text, image, audio, video text
Output modalities text text
Knowledge cutoff

Verdict by use case

Coding
Insufficient data
Basis: SWE-bench

No shared coding benchmark.

Reasoning
→ Gemini 3.5 Flash
Basis: GPQA Diamond

Gemini 3.5 Flash 82.8% vs MiniCPM5-1B 26.9% on GPQA Diamond.

Math
Insufficient data
Basis: MATH / AIME

No shared math benchmark.

Long context
→ Gemini 3.5 Flash
Basis: Context window

Gemini 3.5 Flash 1M tokens vs MiniCPM5-1B 128K tokens.

Cost
→ MiniCPM5-1B
Basis: Input $/Mtok

Gemini 3.5 Flash $1.5/Mtok vs MiniCPM5-1B $0/Mtok input.

Changelog & releases

Gemini 3.5 Flash
Released 2026-05-19
MiniCPM5-1B
Released 2026-05-25

Related comparisons