// 00Tag · cost optimization

#cost optimization

2 posts

01/06/2026

LLM token cost: how to measure and optimize it

LLM token cost is growing faster than the planned AI budget. How to measure usage, where hidden costs lurk, and which optimization patterns actually work in production.

01/06/2026

Prompt caching in LLMs: how a cheaper static prefix cuts bills

LLM prompt caching in 2026: what is a static prefix cache, how it differs from semantic cache, and how to structure your prompt to hit the cache.

← all posts