Claude Haiku 4.5 vs Gemini 2.5 Flash: Detailed Comparison

Choosing between Claude Haiku 4.5 (Anthropic) and Gemini 2.5 Flash (Google) comes down to three things: per-token pricing, context window, and which capability matters most for your workload. Claude Haiku 4.5 costs $1.00/M input vs $0.30/M for Gemini 2.5 Flash; context windows are 200K vs 1.0M tokens. Detailed breakdown below.

Side-by-side specs

SpecClaude Haiku 4.5Gemini 2.5 Flash
ProviderAnthropicGoogle
Released2025-10-012025-04-09
Input price $1.00/M $0.30/M
Output price $5.00/M $2.50/M
Cached input $0.1000/M $0.0750/M
Context window 200K 1.0M
Max output 16K 66K
Modalities text image text image audio video
Tokenizer claude-3 gemini

Capability matrix

CapabilityClaude Haiku 4.5Gemini 2.5 Flash
function calling Yes Yes
json mode Yes Yes
vision Yes Yes
streaming Yes Yes
tool use Yes No
audio No Yes
video No Yes

Per-call cost on typical workloads

Workload (in/out tokens)Claude Haiku 4.5Gemini 2.5 FlashCheaper by
Standard chat (1K / 500) $0.003500 $0.001550 Gemini 2.5 Flash by $0.001950
RAG (4K / 500) $0.006500 $0.002450 Gemini 2.5 Flash by $0.004050
Long doc (20K / 1K) $0.025000 $0.008500 Gemini 2.5 Flash by $0.016500
Very long context (100K / 2K) $0.107500 $0.033750 Gemini 2.5 Flash by $0.073750

When to choose Claude Haiku 4.5 over Gemini 2.5 Flash

  • Supports tool use — Gemini 2.5 Flash does not.

When to choose Gemini 2.5 Flash over Claude Haiku 4.5

  • Per-token input cost is 70% lower than Claude Haiku 4.5.
  • Larger context window (1.0M vs 200K).
  • Supports audio — Claude Haiku 4.5 does not.
  • Supports video — Claude Haiku 4.5 does not.

Related comparisons