Loading [MathJax]/extensions/MathMenu.js
A Better Model for Job Redundancy: Decoupling Server Slowdown and Job Size | IEEE Journals & Magazine | IEEE Xplore

A Better Model for Job Redundancy: Decoupling Server Slowdown and Job Size


Abstract:

Recent computer systems research has proposed using redundant requests to reduce latency. The idea is to replicate a request so that it joins the queue at multiple server...Show More

Abstract:

Recent computer systems research has proposed using redundant requests to reduce latency. The idea is to replicate a request so that it joins the queue at multiple servers. The request is considered complete as soon as any one of its copies completes. Redundancy allows us to overcome serverside variability-the fact that a server might be temporarily slow due to factors such as background load, network interrupts, and garbage collection to reduce response time. In the past few years, queueing theorists have begun to study redundancy, first via approximations, and, more recently, via exact analysis. Unfortunately, for analytical tractability, most existing theoretical analysis has assumed an Independent Runtimes (IR) model, wherein the replicas of a job each experience independent runtimes (service times) at different servers. The IR model is unrealistic and has led to theoretical results that can be at odds with computer systems implementation results. This paper introduces a much more realistic model of redundancy. Our model decouples the inherent job size (X) from the serverside slowdown (S), where we track both S and X for each job. Analysis within the S&X model is, of course, much more difficult. Nevertheless, we design a dispatching policy, Redundant-to-Idle-Queue, which is both analytically tractable within the S&X model and has provably excellent performance.
Published in: IEEE/ACM Transactions on Networking ( Volume: 25, Issue: 6, December 2017)
Page(s): 3353 - 3367
Date of Publication: 22 September 2017

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.