Question 1

Is OpenAI cheaper than Anthropic?

Accepted Answer

It depends on the models you compare and your usage pattern. At the flagship tier, GPT-4o ($2.50/M input, $10/M output) and Claude Sonnet 4.6 ($3.00/M input, $15/M output) are broadly comparable. At the economy tier, GPT-4o mini ($0.15/M input) is cheaper than Claude Haiku 4.5 ($1.00/M input).

Question 2

Which AI model gives the best performance per dollar?

Accepted Answer

For most production workloads, GPT-4o mini and Claude Haiku 4.5 offer the best performance-to-cost ratio for tasks like summarisation, classification, and extraction. Routing requests to the right model based on complexity can reduce your AI bill by 40–70%.

Question 3

What is the difference between input and output tokens?

Accepted Answer

Input tokens are the text you send to the model (your prompt plus any context). Output tokens are the text the model generates in response. Output tokens are always priced higher than input tokens — typically 3–6x more expensive.

Question 4

How does Google Gemini pricing compare to OpenAI and Anthropic?

Accepted Answer

Gemini 2.5 Flash ($0.30/M input, $2.50/M output) sits between GPT-4o mini and GPT-4.1 mini on price. Gemini 2.5 Pro ($1.25/M input, $10/M output) competes with GPT-4o at a lower input price. A standout Gemini advantage is its 1M token context window across all tiers, vs 128K for most OpenAI models and 200K for Claude.

Question 5

Can I use OpenAI, Anthropic, and Google in the same product?

Accepted Answer

Yes, many teams use multiple providers — routing different tasks to whichever model handles it best. For example, Gemini Flash for high-volume summarisation, GPT-4o for multimodal tasks, and Claude Sonnet for long-context analysis. The challenge is that your total AI spend is split across providers. PerUnit aggregates costs from all three providers into a single view.

Question 6

Does OpenAI offer volume discounts?

Accepted Answer

OpenAI's Batch API offers a 50% discount for asynchronous workloads with up to 24-hour turnaround. Prompt caching can reduce repeated input token costs by up to 50%. Enterprise customers can negotiate pricing directly.

Model	Input / 1M	Output / 1M	Est. monthly cost
GoogleGemini 2.0 Flash Google's budget option — cheapest Gemini model for simple, high-volume tasks	$0.10	$0.40	$7.50
OpenAIGPT-4o mini Fastest and cheapest GPT-4o variant — good for high-volume, latency-sensitive tasks	$0.15	$0.60	$11.25
OpenAIGPT-5.4 nano Cheapest GPT-5.4-class model — built for simple, high-volume tasks	$0.20	$1.25	$21.75
AnthropicClaude Haiku 3 Cheapest Claude model — great value for simple tasks that don't need the latest generation	$0.25	$1.25	$22.50
OpenAIGPT-4.1 mini Lightweight GPT-4.1 — a solid default for most production workloads	$0.40	$1.60	$30.00
GoogleGemini 2.5 Flash Fast and cost-efficient — the go-to Gemini model for production at scale	$0.30	$2.50	$42.00
OpenAIGPT-5.4 mini Strongest mini model yet — strong on coding, computer use, and subagents	$0.75	$4.50	$78.75
OpenAIo4-mini Compact reasoning model — cheaper than o3, still much stronger than GPT-4o on logic tasks	$1.10	$4.40	$82.50
AnthropicClaude Haiku 4.5 Fastest and most cost-efficient Claude model — ideal for high-volume, latency-sensitive tasks	$1.00	$5.00	$90.00
OpenAIGPT-4.1 Previous OpenAI flagship — still strong on coding, reasoning, and instruction following	$2.00	$8.00	$150.00
OpenAIo3 OpenAI's reasoning model — slower and pricier, but significantly better at multi-step logic	$2.00	$8.00	$150.00
GoogleGemini 2.5 Pro Previous Gemini flagship — 1M token context, widely deployed in production	$1.25	$10.00	$168.75
OpenAIGPT-4o Multimodal model with vision — text, images, and audio in one API	$2.50	$10.00	$187.50
GoogleGemini 3.1 ProPopular Google's latest flagship — 2M token context, strong on reasoning, code, and multimodal tasks	$2.00	$12.00	$210.00
OpenAIGPT-5.4Popular OpenAI's most capable model for professional work — strongest reasoning and coding	$2.50	$15.00	$262.50
AnthropicClaude Sonnet 4.6 Optimal balance of intelligence, cost, and speed — Anthropic's go-to for most production workloads	$3.00	$15.00	$270.00
AnthropicClaude Sonnet 4 Previous Sonnet generation — still widely used in production	$3.00	$15.00	$270.00
AnthropicClaude Opus 4.6Popular Anthropic's most intelligent model — best for complex agents, coding, and reasoning tasks	$5.00	$25.00	$450.00
AnthropicClaude Opus 4.1 Legacy high-capability Opus model — available for workloads already built around it	$15.00	$75.00	$1,350.00

Model	Input	Output
GPT-5.4	$2.5/M	$15/M
GPT-5.4 mini	$0.75/M	$4.5/M
GPT-5.4 nano	$0.2/M	$1.25/M
GPT-4.1	$2/M	$8/M
GPT-4.1 mini	$0.4/M	$1.6/M
GPT-4o	$2.5/M	$10/M
GPT-4o mini	$0.15/M	$0.6/M
o3	$2/M	$8/M
o4-mini	$1.1/M	$4.4/M

Model	Input	Output
Gemini 3.1 Pro	$2/M	$12/M
Gemini 2.5 Pro	$1.25/M	$10/M
Gemini 2.5 Flash	$0.3/M	$2.5/M
Gemini 2.0 Flash	$0.1/M	$0.4/M

Model	Input	Output
Claude Opus 4.6	$5/M	$25/M
Claude Sonnet 4.6	$3/M	$15/M
Claude Haiku 4.5	$1/M	$5/M
Claude Opus 4.1	$15/M	$75/M
Claude Sonnet 4	$3/M	$15/M
Claude Haiku 3	$0.25/M	$1.25/M

AI Model Cost Comparison — OpenAI, Anthropic & Google

Full pricing reference

Frequently asked questions

AI Model Cost Comparison — OpenAI, Anthropic & Google

Full pricing reference

Frequently asked questions

Get a monthly email when AI model pricing changes