Blog

Practical guides on AI cost attribution, unit economics, and reducing your OpenAI and Anthropic API spend.

April 19, 2026

How to Track OpenAI API Costs by Customer (and What Changes When You Add Claude)

Your OpenAI dashboard shows total spend, not who's driving it. Here's what actually works for cost attribution by customer, what wastes engineering time, and what changes once you've also got Anthropic and Google in the mix.

Read article

April 19, 2026

GPT-4o vs GPT-4.1: When the Cheaper Model Is Actually the Same Model

GPT-4o costs $2.50/1M input. GPT-4.1 costs $2.00/1M — about 20% less for comparable text quality. When that gap matters, when GPT-4o is still the right call, and the bigger lever most teams miss.

Read article

April 19, 2026

When GPT-4o Mini Is Good Enough (And When It Quietly Isn't)

GPT-4o mini is 10–13× cheaper than GPT-4.1. For some workloads the quality gap is invisible. For others, one bad output costs more than a year of token savings. Here's how we figured out which was which.

Read article

April 19, 2026

Claude vs GPT vs Gemini for Long Context: Which Is Cheapest in 2026?

Long-context workloads are input-heavy, and input pricing varies wildly between providers. Real numbers for OpenAI, Anthropic, and Google at 80K tokens — and the trade-off that decides which one wins for you.

Read article

April 19, 2026

OpenAI Batch API: When Half-Price Is Worth the 24-Hour Wait

OpenAI's Batch API gives a flat 50% discount for jobs that can finish within 24 hours. We saved $1,800/month on three workloads. Here's how to figure out which of yours qualify — and which definitely don't.

Read article

March 14, 2026

AI Cost Per Customer: The Question Our Board Asked (And We Couldn't Answer)

AI cost per customer and unit economics for SaaS. Why total AI spend isn't enough — and how to get cost attribution for pricing and board conversations.

Read article