Use-Case Cost Comparison
Which model is cheapest for your use case? Select a workload type and volume to compare costs across OpenAI, Anthropic, and Google.
Last updated: March 2026
Your use case
1,000 in / 500 out tokens per request
Cheapest models for this use case
Sorted by monthly cost. Quality may vary — test for your workload.
Gemini 2.0 Flash
$9.00/mo
$0.000300/request
GPT-4o mini
openai
$13.50/mo
$0.000450/request
Claude Haiku 3
anthropic
$26.25/mo
$0.000875/request
GPT-4.1 mini
openai
$36.00/mo
$0.0012/request
GPT-5 mini
openai
$37.50/mo
$0.0013/request
Gemini 2.5 Flash
$46.50/mo
$0.0015/request
o4-mini
openai
$99.00/mo
$0.0033/request
Claude Haiku 4.5
anthropic
$105.00/mo
$0.0035/request
GPT-4.1
openai
$180.00/mo
$0.0060/request
o3
openai
$180.00/mo
$0.0060/request
Gemini 2.5 Pro
$187.50/mo
$0.0063/request
GPT-4o
openai
$225.00/mo
$0.0075/request
Different features, different costs.
PerUnit shows cost per feature — so you can use the right model for each and see the impact.
Get early access to PerUnit