Google LLMs
Maker of the Gemini family and operator of Google AI Studio / Vertex AI.
Google ships Gemini through two surfaces: the AI Studio API for direct use and Vertex AI for managed deployment. Gemini 1.5 Pro pioneered the 1M-token context window in commercial use; Gemini 2.x extended this further. The pricing model is tiered: smaller context windows are cheaper per token, with a step up after 128K input tokens.
Founded: 1998 · HQ: Mountain View, USA · Docs: ai.google.dev ↗
All Google models
| Model | Family | Context | Input $/M | Output $/M | Released | Status |
|---|---|---|---|---|---|---|
| Gemini 2.5 Flash | Gemini 2 | 1.0M | $0.30 | $2.50 | 2025-04-09 | active |
| Gemini 2.5 Pro | Gemini 2 | 2.0M | $1.25 | $10.00 | 2025-03-25 | active |
| Gemini 1.5 Pro | Gemini 1 | 2.0M | $1.25 | $5.00 | 2024-02-15 | active |
Comparisons with other providers
The most-searched comparisons involving Google models:
- Gemini 2.5 Flash vs GPT-5 Nano
- Gemini 2.5 Flash vs GPT-4o Mini
- Gemini 2.5 Pro vs GPT-5 Nano
- Gemini 2.5 Pro vs GPT-4o Mini
- Gemini 1.5 Pro vs GPT-5 Nano
- Gemini 1.5 Pro vs GPT-4o Mini
Working with the Google API
Documentation lives at https://ai.google.dev/gemini-api/docs. Before paying for any call, count input tokens with the appropriate counter:
- OpenAI Token Counter — for GPT-* and o-series models
- Claude Token Counter — for Anthropic Claude models
- Gemini Token Counter — for Google Gemini models