AI Model Cost Comparison — OpenAI, Anthropic & Google
Compare the cost of every major OpenAI, Anthropic, and Google Gemini model side by side. Enter your token usage to see exactly what each model would cost at your scale.
Prices sourced from OpenAI, Anthropic, and Google official pricing pages · Last updated: March 2026
Estimate costs at your usage volume
Filter:
| Model | Input / 1M | Output / 1M | Est. monthly cost |
|---|---|---|---|
GoogleGemini 2.0 Flash Affordable production-ready Gemini | $0.10 | $0.40 | |
OpenAIGPT-4o mini Affordable GPT-4o variant | $0.15 | $0.60 | |
AnthropicClaude Haiku 3 Highly affordable classic | $0.25 | $1.25 | |
OpenAIGPT-4.1 mini Lightweight GPT-4.1 | $0.40 | $1.60 | |
OpenAIGPT-5 mini Fast, cost-efficient GPT-5 | $0.25 | $2.00 | |
GoogleGemini 2.5 Flash Fast and cost-efficient Gemini | $0.30 | $2.50 | |
OpenAIo4-mini Fast reasoning model | $1.10 | $4.40 | |
AnthropicClaude Haiku 4.5 Fast and cost-efficient | $1.00 | $5.00 | |
OpenAIGPT-4.1 Latest GPT-4 series | $2.00 | $8.00 | |
OpenAIo3 OpenAI reasoning model | $2.00 | $8.00 | |
GoogleGemini 2.5 Pro Highly capable Gemini model | $1.25 | $10.00 | |
OpenAIGPT-4o Multimodal GPT-4 model | $2.50 | $10.00 | |
GoogleGemini 3.1 ProPopular Google's latest flagship model | $2.00 | $12.00 | |
OpenAIGPT-5.4Popular OpenAI flagship model | $2.50 | $15.00 | |
AnthropicClaude Sonnet 4.6 Balanced intelligence and speed | $3.00 | $15.00 | |
AnthropicClaude Sonnet 4 Reliable Sonnet series | $3.00 | $15.00 | |
AnthropicClaude Opus 4.6Popular Most intelligent Claude model | $5.00 | $25.00 | |
AnthropicClaude Opus 4.1 High-capability legacy Opus | $15.00 | $75.00 |
Sorted by estimated monthly cost (cheapest first) · Prices are for standard tier without caching or batch discounts · 1,000 req/day · 500 input + 500 output tokens/req
Full pricing reference
Raw prices per 1M tokens for quick reference, regardless of volume.
OpenAI
| Model | Input | Output |
|---|---|---|
| GPT-5.4 | $2.5/M | $15/M |
| GPT-5 mini | $0.25/M | $2/M |
| GPT-4.1 | $2/M | $8/M |
| GPT-4.1 mini | $0.4/M | $1.6/M |
| GPT-4o | $2.5/M | $10/M |
| GPT-4o mini | $0.15/M | $0.6/M |
| o3 | $2/M | $8/M |
| o4-mini | $1.1/M | $4.4/M |
| Model | Input | Output |
|---|---|---|
| Gemini 3.1 Pro | $2/M | $12/M |
| Gemini 2.5 Pro | $1.25/M | $10/M |
| Gemini 2.5 Flash | $0.3/M | $2.5/M |
| Gemini 2.0 Flash | $0.1/M | $0.4/M |
Anthropic
| Model | Input | Output |
|---|---|---|
| Claude Opus 4.6 | $5/M | $25/M |
| Claude Sonnet 4.6 | $3/M | $15/M |
| Claude Haiku 4.5 | $1/M | $5/M |
| Claude Opus 4.1 | $15/M | $75/M |
| Claude Sonnet 4 | $3/M | $15/M |
| Claude Haiku 3 | $0.25/M | $1.25/M |
Using OpenAI, Anthropic, or Google? PerUnit tracks costs across all providers in one view.
See your total AI spend by customer, feature, and pricing tier — regardless of which provider served each request.
Get early access to PerUnitFrequently asked questions
- Is OpenAI cheaper than Anthropic?
- It depends on the models you compare and your usage pattern. At the flagship tier, GPT-4o ($2.50/M input, $10/M output) and Claude Sonnet 4.6 ($3.00/M input, $15/M output) are broadly comparable, with GPT-4o being slightly cheaper on output tokens. At the economy tier, GPT-4o mini ($0.15/M input) is cheaper than Claude Haiku 4.5 ($1.00/M input). Use this tool with your actual token volumes to get a meaningful comparison.
- Which AI model gives the best performance per dollar?
- For most production workloads, GPT-4o mini and Claude Haiku 4.5 offer the best performance-to-cost ratio for tasks like summarisation, classification, and extraction. Reserve flagship models like GPT-4o or Claude Sonnet for tasks that genuinely require higher reasoning capability. Routing requests to the right model based on complexity can reduce your AI bill by 40–70%.
- What is the difference between input and output tokens?
- Input tokens are the text you send to the model (your prompt plus any context). Output tokens are the text the model generates in response. Output tokens are always priced higher than input tokens — typically 3–6x more expensive. If your use case generates long responses, output costs will dominate your bill.
- Does OpenAI offer volume discounts?
- OpenAI does not publish standard volume discount tiers, but enterprise customers can negotiate pricing. OpenAI's Batch API offers a 50% discount for asynchronous workloads with up to 24-hour turnaround. Prompt caching (available on GPT-4o and other models) can reduce repeated input token costs by up to 50% when the same context appears across requests.
- Does Anthropic offer volume discounts?
- Anthropic offers prompt caching, which reduces costs significantly when you repeatedly send the same system prompt or context. Enterprise pricing is available for large-volume customers. Check Anthropic's pricing page at anthropic.com/pricing for the latest rates.
- How does Google Gemini pricing compare to OpenAI and Anthropic?
- Google Gemini 2.5 Flash ($0.30/M input, $2.50/M output) sits between GPT-4o mini and GPT-4.1 mini on price. Gemini 2.5 Pro ($1.25/M input, $10/M output) competes with GPT-4o and Claude Sonnet 4.6 at a lower input price. A standout Gemini advantage is the 1M token context window available across all tiers, vs 128K for OpenAI and 200K for Anthropic.
- Can I use OpenAI, Anthropic, and Google in the same product?
- Yes, many teams use multiple providers — routing different tasks to whichever model handles it best and most cheaply. For example, Gemini Flash for high-volume summarisation, GPT-4o for multimodal tasks, and Claude Sonnet for long-context analysis. The challenge is that your total AI spend is split across providers, making cost attribution harder. PerUnit aggregates costs from all providers into a single view.