Model Comparisons
Side-by-side comparisons of LLMs from across the industry. Each page pulls specs and benchmark data from the model database, so the numbers are current. Pick any two models — these are some commonly-searched pairs.
Popular comparisons
- gpt-5 vs claude-opus-4-7 Pricing, context, capabilities, benchmarks
- claude-sonnet-4-6 vs gpt-5 Pricing, context, capabilities, benchmarks
- gemini-2-5-pro vs claude-sonnet-4-6 Pricing, context, capabilities, benchmarks
- gpt-5-mini vs claude-haiku-4-5 Pricing, context, capabilities, benchmarks
- deepseek-v3 vs gpt-4o Pricing, context, capabilities, benchmarks
- claude-opus-4-7 vs gemini-2-5-pro Pricing, context, capabilities, benchmarks
- llama-3-3-70b vs deepseek-v3 Pricing, context, capabilities, benchmarks
- qwen3-coder-480b vs claude-sonnet-4-6 Pricing, context, capabilities, benchmarks
- o3 vs claude-opus-4-7 Pricing, context, capabilities, benchmarks
- gemini-2-5-flash vs gpt-5-mini Pricing, context, capabilities, benchmarks
- grok-3 vs gpt-5 Pricing, context, capabilities, benchmarks
- gpt-4-1 vs gemini-2-5-pro Pricing, context, capabilities, benchmarks
How comparisons work
Every comparison page is generated from the underlying database. The tables show the actual published specs and benchmark scores for each model, with sources. The "when to choose A vs B" section is derived from the capability differences and ranking weights, not from generic copy.