← Free AI Tools

Context Window Cost Calculator

Long context costs more. How much does it cost per request when you send 50k, 100k, or 200k tokens? Compare across all major models.

Last updated: March 2026

Your context

System prompt + context (e.g. RAG docs, conversation history). 100k tokens ≈ 75k words.

Cost per request and per month

Sorted by monthly cost. Input-heavy workloads favor cheaper input pricing.

Gemini 2.0 Flash

google

$0.010/request

$30.60/mo

GPT-4o mini

openai

$0.015/request

$45.90/mo

Claude Haiku 3

anthropic

$0.026/request

$76.88/mo

GPT-5 mini

openai

$0.026/request

$78.00/mo

Gemini 2.5 Flash

google

$0.031/request

$93.75/mo

GPT-4.1 mini

openai

$0.041/request

$122.40/mo

Claude Haiku 4.5

anthropic

$0.103/request

$307.50/mo

o4-mini

openai

$0.112/request

$336.60/mo

Gemini 2.5 Pro

google

$0.130/request

$390.00/mo

GPT-4.1

openai

$0.204/request

$612.00/mo

o3

openai

$0.204/request

$612.00/mo

Gemini 3.1 Pro

google

$0.206/request

$618.00/mo

Long context costs add up. Which customers are driving them?

PerUnit breaks down cost by customer and feature — so you can see who uses long context most.

Get early access to PerUnit