Context Window Cost Calculator
Long context costs more. How much does it cost per request when you send 50k, 100k, or 200k tokens? Compare across all major models.
Last updated: March 2026
Your context
System prompt + context (e.g. RAG docs, conversation history). 100k tokens ≈ 75k words.
Cost per request and per month
Sorted by monthly cost. Input-heavy workloads favor cheaper input pricing.
Gemini 2.0 Flash
$0.010/request
$30.60/mo
GPT-4o mini
openai
$0.015/request
$45.90/mo
Claude Haiku 3
anthropic
$0.026/request
$76.88/mo
GPT-5 mini
openai
$0.026/request
$78.00/mo
Gemini 2.5 Flash
$0.031/request
$93.75/mo
GPT-4.1 mini
openai
$0.041/request
$122.40/mo
Claude Haiku 4.5
anthropic
$0.103/request
$307.50/mo
o4-mini
openai
$0.112/request
$336.60/mo
Gemini 2.5 Pro
$0.130/request
$390.00/mo
GPT-4.1
openai
$0.204/request
$612.00/mo
o3
openai
$0.204/request
$612.00/mo
Gemini 3.1 Pro
$0.206/request
$618.00/mo
Long context costs add up. Which customers are driving them?
PerUnit breaks down cost by customer and feature — so you can see who uses long context most.
Get early access to PerUnit