GPT-5.5 Instant vs Qwen3.7 Max: Benchmarks, Pricing & Capabilities Compared
TL;DR — GPT-5.5 Instant wins for general use · Qwen3.7 Max wins for reasoning + cost + long-context.
GPT-5.5 Instant OpenAI
- Released
- 2026-05-05
- Context window
- 400K tokens
- Input price
- $5.00 / Mtok
- Output price
- $30.00 / Mtok
Qwen3.7 Max Alibaba
- Released
- 2026-05-19
- Context window
- 1M tokens
- Input price
- $2.50 / Mtok
- Output price
- $7.50 / Mtok
Benchmark comparison
| Benchmark | GPT-5.5 Instant | Qwen3.7 Max |
|---|---|---|
| AA Intelligence Index | 41.8 | 56.6 ✓ |
| Chatbot Arena Elo | 1474 | 1541 ✓ |
| GPQA Diamond | 84.6% | 92.3% ✓ |
| HLE | 20.3% | 38.1% ✓ |
| IF-Bench | 71.5% | 80.5% ✓ |
| LiveCodeBench Reasoning | 55.7% | 69.0% ✓ |
| SciCode | 50.3% ✓ | 48.8% |
| TAU2-bench | 49.4% | 94.7% ✓ |
| TerminalBench-Hard | 42.4% | 50.8% ✓ |
Pricing comparison
| Metric | GPT-5.5 Instant | Qwen3.7 Max |
|---|---|---|
| Input ($/Mtok) | $5.00 | $2.50 |
| Output ($/Mtok) | $30.00 | $7.50 |
| Cached input ($/Mtok) | — | — |
| Cost per 1M-token roundtrip (1M in + 1M out) | $35.00 | $10.00 |
Context window & modalities
| Attribute | GPT-5.5 Instant | Qwen3.7 Max |
|---|---|---|
| Context window | 400K tokens | 1M tokens |
| Input modalities | text, image | text |
| Output modalities | text | text |
| Knowledge cutoff | 2025-08-31 | — |
Verdict by use case
Coding
Insufficient data
Basis: SWE-bench
No shared coding benchmark.
Reasoning
→ Qwen3.7 Max
Basis: GPQA Diamond
GPT-5.5 Instant 84.6% vs Qwen3.7 Max 92.3% on GPQA Diamond.
Math
Insufficient data
Basis: MATH / AIME
No shared math benchmark.
Long context
→ Qwen3.7 Max
Basis: Context window
GPT-5.5 Instant 400K tokens vs Qwen3.7 Max 1M tokens.
Cost
→ Qwen3.7 Max
Basis: Input $/Mtok
GPT-5.5 Instant $5/Mtok vs Qwen3.7 Max $2.5/Mtok input.
Changelog & releases
GPT-5.5 Instant
Released 2026-05-05
Qwen3.7 Max
Released 2026-05-19