1 posts
LLM prompt caching in 2026: what is a static prefix cache, how it differs from semantic cache, and how to structure your prompt to hit the cache.