Gemini API pricing

Last reviewed May 28, 2026 · SoftwareEstimator.com

Google’s Gemini 3.1 Pro API costs $2 per million input tokens and $12 per million output for prompts up to 200K tokens, rising to $4 / $18 above 200K (cached reads $0.20). Gemini 2.5 Flash is far cheaper at $0.30 / $2.50, and Flash-Lite is the budget tier at $0.10 / $0.40. All support a 50% discount for async batch jobs. The quirk to watch is Gemini’s tiered context pricing — long agentic sessions cross the 200K threshold and double the input rate. Even so, the Flash tiers are among the cheapest viable models for high-volume coding work.

Run your own estimate →

2026 Gemini API rates

Per 1M input / output tokens, with cached-read rates:

→ Gemini 3.1 Pro (≤200K) — $2 / $12 (cache $0.20)
→ Gemini 3.1 Pro (>200K) — $4 / $18 (cache $0.40)
→ Gemini 2.5 Flash — $0.30 / $2.50 (cache $0.03)
→ Gemini 2.5 Flash-Lite — $0.10 / $0.40 (cache $0.01)
→ Async batch — 50% off

The 200K-token price jump

Gemini 3.1 Pro doubles its input rate once a prompt exceeds 200K tokens. Agentic coding sessions accumulate context fast, so a long build can quietly tip into the higher tier — factor that in if you’re pricing a large project on Gemini Pro. The Flash tiers don’t have this jump and stay cheap throughout.

Frequently asked questions

How much does the Gemini API cost?

Gemini 3.1 Pro is $2/$12 per million (up to 200K tokens), $4/$18 above that. Gemini 2.5 Flash is $0.30/$2.50; Flash-Lite is $0.10/$0.40.

What is the cheapest Gemini model?

Gemini 2.5 Flash-Lite at $0.10 input / $0.40 output per million — among the cheapest mainstream models available.

Why does Gemini pricing change over 200K tokens?

Gemini 3.1 Pro uses tiered context pricing: prompts above 200K tokens are billed at the higher $4/$18 rate. Long agentic sessions can cross this threshold.

Related guides

Figures are industry-composite estimates for planning, not quotes — agentic token spend has 10×+ run-to-run variance. See the full methodology or run an estimate .