Architecting services that scale to millions of users.
Spread traffic across servers, no single hotspot.
Keep hot data close to cut latency.
Split one big database into many smaller ones.
Add or remove nodes without reshuffling everything.
Decouple producers from consumers with a buffer.
Cap request rates to protect your service.