AI Flash Report

Gemini 3.5 Flash vs MiniCPM-V 4.6 1.3B: Benchmarks, Pricing & Capabilities Compared

TL;DR — Gemini 3.5 Flash wins for reasoning + long-context · MiniCPM-V 4.6 1.3B wins for general use.

Released
2026-05-19
Context window
1M tokens
Input price
$1.50 / Mtok
Output price
$9.00 / Mtok
Released
2026-05-11
Context window
262K tokens
Input price
$0.00 / Mtok
Output price
$0.00 / Mtok

Benchmark comparison

Benchmark Gemini 3.5 Flash MiniCPM-V 4.6 1.3B
AA Intelligence Index 43.3 12.7
GPQA Diamond 82.8% 30.5%
HLE 23.1% 4.9%
IF-Bench 47.3% 26.7%
LiveCodeBench Reasoning 53.3% 6.3%
SciCode 48.8% 2.1%
TAU2-bench 58.8% 87.7%
TerminalBench-Hard 46.2% 0.0%

Pricing comparison

Metric Gemini 3.5 Flash MiniCPM-V 4.6 1.3B
Input ($/Mtok) $1.50 $0.00
Output ($/Mtok) $9.00 $0.00
Cached input ($/Mtok)
Cost per 1M-token roundtrip (1M in + 1M out) $10.50 $0.00

Context window & modalities

Attribute Gemini 3.5 Flash MiniCPM-V 4.6 1.3B
Context window 1M tokens 262K tokens
Input modalities text, image, audio, video text, image, video
Output modalities text text
Knowledge cutoff

Verdict by use case

Coding
Insufficient data
Basis: SWE-bench

No shared coding benchmark.

Reasoning
→ Gemini 3.5 Flash
Basis: GPQA Diamond

Gemini 3.5 Flash 82.8% vs MiniCPM-V 4.6 1.3B 30.5% on GPQA Diamond.

Math
Insufficient data
Basis: MATH / AIME

No shared math benchmark.

Long context
→ Gemini 3.5 Flash
Basis: Context window

Gemini 3.5 Flash 1M tokens vs MiniCPM-V 4.6 1.3B 262K tokens.

Cost
→ MiniCPM-V 4.6 1.3B
Basis: Input $/Mtok

Gemini 3.5 Flash $1.5/Mtok vs MiniCPM-V 4.6 1.3B $0/Mtok input.

Changelog & releases

Gemini 3.5 Flash
Released 2026-05-19
MiniCPM-V 4.6 1.3B
Released 2026-05-11

Related comparisons