Prompt Token Cost Calculator

Paste a prompt and see exactly what it costs to send through every major LLM — uncached and cached. Side-by-side comparison ranks the cheapest option for your specific text. Useful for picking a model when cost matters more than capability.

Prompt

Expected output tokens Calls per month

How to use the Prompt Token Cost Calculator

Paste your prompt, set the expected output token count, and pick a monthly call volume. The tool counts tokens with each provider's tokenizer (or a calibrated heuristic where the tokenizer isn't available client-side), then multiplies by current per-million-token pricing. The result is a sorted table — cheapest first.

How to interpret the comparison

Per-token price isn't the only factor. The tokenizer matters too — Gemini and GPT-5/4o (o200k_base) typically produce 5-15% fewer tokens for the same English text than Claude or GPT-3.5 (cl100k_base). For non-Latin scripts the difference can be 2-3x. So the cheapest-per-token-listed model isn't always the cheapest-per-call for your prompt.

Cached pricing applies only to input tokens that match a stable prefix you've sent before. If your prompt is mostly a stable system message with a small per-request suffix, the cached rate dominates and your effective cost drops 90% on supported providers. The comparison shows both columns so you can see the gap.

Output token cost is included in the totals. Output tokens are 2-5x more expensive than input on every provider, so prompt design that constrains output length ("answer in one sentence") is often the biggest single cost lever.

Frequently asked questions

How is the cheapest model chosen?

The tool tokenizes your prompt, multiplies by each model input and output price, and ranks the results, so you see the lowest-cost option for your specific text.

Does it include cached pricing?

Yes. It shows uncached and cached cost side by side, which matters when you reuse a long shared prefix across many calls.

Is output cost included?

It estimates from the prompt you paste; output cost depends on how much the model generates, so assume an output length to compare models fairly.

Embed this tool on your site

Free to embed, no attribution required (but appreciated). Paste this where you want the tool to appear:

<iframe src="https://codeswap.net/llm/prompt-token-cost/?embed=1" width="100%" height="520" loading="lazy" style="border:1px solid #e5e7eb;border-radius:8px" title="Prompt Token Cost Calculator"></iframe>
<p style="font-size:13px">Tool by <a href="https://codeswap.net/llm/prompt-token-cost/">Prompt Token Cost Calculator — Codeswap</a></p>