The Tail at Scale: Just as fault-tolerant computing aims to create a reliable whole out of less-reliable parts, large online services need to create a predictably responsive whole out of less-predictable parts; we refer to such systems as “latency tail-tolerant,” or simply “tail-tolerant.” The techniques: Hedged requests, Tied requests, Micro-partition, Selectively increase replication factors, Put slow machines on probation, Consider ‘good enough’ responses, Use canary requests.