How I Cut Our LLM API Bills by 73% With Prompt Caching: A Production Engineer's Guide (2026)
Last quarter, our Anthropic console showed $612 in API costs across our six AI products. After a focused prompt caching refactor, it dropped to $167 - a 73% cut without changing models. Here is exactly what worked, what didn't, and the mistakes that cost real money.