The Rise of the Internal Cache Platform

How Uber and Mercado Libre scaled distributed caches to billions of ops/sec, and the connection storms, ownership failures, and Valkey decisions that shaped their platforms.

Cache Rebalancing Was Broken. Here’s How Valkey 9.0 Fixed It

split screen depicting a before and after

Few things make SREs more nervous than rebalancing a cache cluster. You know the feeling. You add a node, trigger a rebalance, and suddenly latency graphs start jumping. It’s a familiar risk of the job, especially when your cache sits between your users and your database. A small configuration mistake here can suddenly unleash a storm of GET requests on your primary data store. I admit, I never really understood the concept of a slot or why it was needed. But after listening to a recent episode of the Cache It podcast on the new atomic slot migration feature in Valkey 9.0, I finally decided to dig in. The deeper […]

The 5 Metrics that Predict Cache Outages

Learn the 5 critical metrics that predict cache performance issues before they impact users. From P999 latency to cache miss rates – learn what actually matters in production.