Claude Sonnet 4.6 vs Mistral Large 2: Detailed Comparison

Choosing between Claude Sonnet 4.6 (Anthropic) and Mistral Large 2 (Mistral AI) comes down to three things: per-token pricing, context window, and which capability matters most for your workload. Claude Sonnet 4.6 costs $3.00/M input vs $2.00/M for Mistral Large 2; context windows are 200K vs 128K tokens. Detailed breakdown below.

Side-by-side specs

Spec	Claude Sonnet 4.6	Mistral Large 2
Provider	Anthropic	Mistral AI
Released	2026-03-12	2024-07-24
Input price	$3.00/M	$2.00/M
Output price	$15.00/M	$6.00/M
Cached input	$0.3000/M	—
Context window	200K	128K
Max output	64K	8K
Modalities	text image	text
Tokenizer	`claude-3`	`mistral`

Capability matrix

Capability	Claude Sonnet 4.6	Mistral Large 2
function calling	Yes	Yes
json mode	Yes	Yes
vision	Yes	No
streaming	Yes	Yes
tool use	Yes	No

Benchmark comparison

Higher is better for all benchmarks shown.

Benchmark	Category	Claude Sonnet 4.6	Mistral Large 2	Δ
HumanEval	coding	92.0	92.0	0.0
SWE-bench Verified	coding	72.5	—	—
Aider Polyglot	coding	74.0	—	—
MMLU	general	—	84.0	—

Per-call cost on typical workloads

Workload (in/out tokens)	Claude Sonnet 4.6	Mistral Large 2	Cheaper by
Standard chat (1K / 500)	$0.010500	$0.005000	Mistral Large 2 by $0.005500
RAG (4K / 500)	$0.019500	$0.011000	Mistral Large 2 by $0.008500
Long doc (20K / 1K)	$0.075000	$0.046000	Mistral Large 2 by $0.029000
Very long context (100K / 2K)	$0.322500	$0.209000	Mistral Large 2 by $0.113500

When to choose Claude Sonnet 4.6 over Mistral Large 2

Larger context window (200K vs 128K) — relevant when whole documents or long histories must fit in a single call.
Supports vision — Mistral Large 2 does not.
Supports tool use — Mistral Large 2 does not.

When to choose Mistral Large 2 over Claude Sonnet 4.6

Per-token input cost is 33% lower than Claude Sonnet 4.6.