← Free AI Tools

AI Model Cost Comparison — OpenAI, Anthropic & Google

Compare the cost of every major OpenAI, Anthropic, and Google Gemini model side by side. Enter your token usage to see exactly what each model would cost at your scale.

Prices sourced from OpenAI, Anthropic, and Google official pricing pages · Last updated: March 2026

Estimate costs at your usage volume

Filter:

ModelInput / 1MOutput / 1MEst. monthly cost
GoogleGemini 2.0 Flash

Affordable production-ready Gemini

$0.10$0.40
$7.50
OpenAIGPT-4o mini

Affordable GPT-4o variant

$0.15$0.60
$11.25
AnthropicClaude Haiku 3

Highly affordable classic

$0.25$1.25
$22.50
OpenAIGPT-4.1 mini

Lightweight GPT-4.1

$0.40$1.60
$30.00
OpenAIGPT-5 mini

Fast, cost-efficient GPT-5

$0.25$2.00
$33.75
GoogleGemini 2.5 Flash

Fast and cost-efficient Gemini

$0.30$2.50
$42.00
OpenAIo4-mini

Fast reasoning model

$1.10$4.40
$82.50
AnthropicClaude Haiku 4.5

Fast and cost-efficient

$1.00$5.00
$90.00
OpenAIGPT-4.1

Latest GPT-4 series

$2.00$8.00
$150.00
OpenAIo3

OpenAI reasoning model

$2.00$8.00
$150.00
GoogleGemini 2.5 Pro

Highly capable Gemini model

$1.25$10.00
$168.75
OpenAIGPT-4o

Multimodal GPT-4 model

$2.50$10.00
$187.50
GoogleGemini 3.1 ProPopular

Google's latest flagship model

$2.00$12.00
$210.00
OpenAIGPT-5.4Popular

OpenAI flagship model

$2.50$15.00
$262.50
AnthropicClaude Sonnet 4.6

Balanced intelligence and speed

$3.00$15.00
$270.00
AnthropicClaude Sonnet 4

Reliable Sonnet series

$3.00$15.00
$270.00
AnthropicClaude Opus 4.6Popular

Most intelligent Claude model

$5.00$25.00
$450.00
AnthropicClaude Opus 4.1

High-capability legacy Opus

$15.00$75.00
$1,350.00

Sorted by estimated monthly cost (cheapest first) · Prices are for standard tier without caching or batch discounts · 1,000 req/day · 500 input + 500 output tokens/req

Full pricing reference

Raw prices per 1M tokens for quick reference, regardless of volume.

OpenAI

ModelInputOutput
GPT-5.4$2.5/M$15/M
GPT-5 mini$0.25/M$2/M
GPT-4.1$2/M$8/M
GPT-4.1 mini$0.4/M$1.6/M
GPT-4o$2.5/M$10/M
GPT-4o mini$0.15/M$0.6/M
o3$2/M$8/M
o4-mini$1.1/M$4.4/M

Google

ModelInputOutput
Gemini 3.1 Pro$2/M$12/M
Gemini 2.5 Pro$1.25/M$10/M
Gemini 2.5 Flash$0.3/M$2.5/M
Gemini 2.0 Flash$0.1/M$0.4/M

Anthropic

ModelInputOutput
Claude Opus 4.6$5/M$25/M
Claude Sonnet 4.6$3/M$15/M
Claude Haiku 4.5$1/M$5/M
Claude Opus 4.1$15/M$75/M
Claude Sonnet 4$3/M$15/M
Claude Haiku 3$0.25/M$1.25/M

Using OpenAI, Anthropic, or Google? PerUnit tracks costs across all providers in one view.

See your total AI spend by customer, feature, and pricing tier — regardless of which provider served each request.

Get early access to PerUnit

Frequently asked questions

Is OpenAI cheaper than Anthropic?
It depends on the models you compare and your usage pattern. At the flagship tier, GPT-4o ($2.50/M input, $10/M output) and Claude Sonnet 4.6 ($3.00/M input, $15/M output) are broadly comparable, with GPT-4o being slightly cheaper on output tokens. At the economy tier, GPT-4o mini ($0.15/M input) is cheaper than Claude Haiku 4.5 ($1.00/M input). Use this tool with your actual token volumes to get a meaningful comparison.
Which AI model gives the best performance per dollar?
For most production workloads, GPT-4o mini and Claude Haiku 4.5 offer the best performance-to-cost ratio for tasks like summarisation, classification, and extraction. Reserve flagship models like GPT-4o or Claude Sonnet for tasks that genuinely require higher reasoning capability. Routing requests to the right model based on complexity can reduce your AI bill by 40–70%.
What is the difference between input and output tokens?
Input tokens are the text you send to the model (your prompt plus any context). Output tokens are the text the model generates in response. Output tokens are always priced higher than input tokens — typically 3–6x more expensive. If your use case generates long responses, output costs will dominate your bill.
Does OpenAI offer volume discounts?
OpenAI does not publish standard volume discount tiers, but enterprise customers can negotiate pricing. OpenAI's Batch API offers a 50% discount for asynchronous workloads with up to 24-hour turnaround. Prompt caching (available on GPT-4o and other models) can reduce repeated input token costs by up to 50% when the same context appears across requests.
Does Anthropic offer volume discounts?
Anthropic offers prompt caching, which reduces costs significantly when you repeatedly send the same system prompt or context. Enterprise pricing is available for large-volume customers. Check Anthropic's pricing page at anthropic.com/pricing for the latest rates.
How does Google Gemini pricing compare to OpenAI and Anthropic?
Google Gemini 2.5 Flash ($0.30/M input, $2.50/M output) sits between GPT-4o mini and GPT-4.1 mini on price. Gemini 2.5 Pro ($1.25/M input, $10/M output) competes with GPT-4o and Claude Sonnet 4.6 at a lower input price. A standout Gemini advantage is the 1M token context window available across all tiers, vs 128K for OpenAI and 200K for Anthropic.
Can I use OpenAI, Anthropic, and Google in the same product?
Yes, many teams use multiple providers — routing different tasks to whichever model handles it best and most cheaply. For example, Gemini Flash for high-volume summarisation, GPT-4o for multimodal tasks, and Claude Sonnet for long-context analysis. The challenge is that your total AI spend is split across providers, making cost attribution harder. PerUnit aggregates costs from all providers into a single view.