AI Flash Report

MiniCPM5-1B vs Nemotron 3 Ultra 550B A55B: Benchmarks, Pricing & Capabilities Compared

TL;DR — MiniCPM5-1B wins for general use · Nemotron 3 Ultra 550B A55B wins for reasoning + long-context.

MiniCPM5-1B OpenBMB
Released
2026-05-25
Context window
128K tokens
Input price
$0.00 / Mtok
Output price
$0.00 / Mtok
Released
2026-06-04
Context window
262K tokens
Input price
$0.60 / Mtok
Output price
$2.60 / Mtok

Benchmark comparison

Benchmark MiniCPM5-1B Nemotron 3 Ultra 550B A55B
GPQA Diamond 27.8% 86.7%
HLE 6.5% 26.6%
IF-Bench 49.3% 81.4%
LiveCodeBench Reasoning 3.7% 67.0%
SciCode 4.4% 39.9%
TAU2-bench 81.0% 83.3%
TerminalBench-Hard 0.0% 36.4%

Pricing comparison

Metric MiniCPM5-1B Nemotron 3 Ultra 550B A55B
Input ($/Mtok) $0.00 $0.60
Output ($/Mtok) $0.00 $2.60
Cached input ($/Mtok)
Cost per 1M-token roundtrip (1M in + 1M out) $0.00 $3.20

Context window & modalities

Attribute MiniCPM5-1B Nemotron 3 Ultra 550B A55B
Context window 128K tokens 262K tokens
Input modalities text text
Output modalities text text
Knowledge cutoff

Verdict by use case

Coding
Insufficient data
Basis: SWE-bench

No shared coding benchmark.

Reasoning
→ Nemotron 3 Ultra 550B A55B
Basis: GPQA Diamond

MiniCPM5-1B 27.8% vs Nemotron 3 Ultra 550B A55B 86.7% on GPQA Diamond.

Math
Insufficient data
Basis: MATH / AIME

No shared math benchmark.

Long context
→ Nemotron 3 Ultra 550B A55B
Basis: Context window

MiniCPM5-1B 128K tokens vs Nemotron 3 Ultra 550B A55B 262K tokens.

Cost
→ MiniCPM5-1B
Basis: Input $/Mtok

MiniCPM5-1B $0/Mtok vs Nemotron 3 Ultra 550B A55B $0.6/Mtok input.

Changelog & releases

MiniCPM5-1B
Released 2026-05-25
Nemotron 3 Ultra 550B A55B
Released 2026-06-04

Related comparisons