Effective Straggler Mitigation: Attack of the Clones

Small jobs, that are typically run for interactive data analysesin datacenters, continue to be plagued by disproportionatelylong-running tasks called stragglers. In the productionclusters at Facebook and Microsoft Bing, evenafter applying state-of-the-art straggler mitigation techniques,these latency sensitive jobs have stragglers thatare on average 8 times slower than themedian task in thatjob. Such stragglers increase the average job duration by

