Why segregating short jobs from long jobs under high variability is not always a win

This paper investigates the performance of task assignment policies for server farms as the variability of job sizes (service demands) approaches infinity. The Size-Interval-Task-Assignment policy (SITA), which separates short jobs from long jobs, has long been viewed as the panacea for dealing with high-variability job-size distributions. A very recent paper [16] showed that this common wisdom is flawed: SITA can actually be inferior to the much simpler greedy policy, Least-Work-Left (LWL), for certain common job-size distributions, including many modal, hyperexponential, and Pareto distributions. The above finding leads one to question whether providing isolation for short jobs from long ones is inherently bad, or whether it is just SITA's strict isolation of short jobs that sometimes leads to poor performance. To answer this question, we consider a much more flexible policy, which we call “Cycle-Stealing” (CS). The CS policy is very similar to LWL, in that short jobs can go to any queue, but it still provides short jobs isolation from longs (one server is reserved for short jobs). While CS has many of the same properties as LWL, including high utilization of both servers, we prove, surprisingly, that, for high variability job sizes, CS performs poorly whenever SITA performs poorly. This result suggests that the notion of isolating short jobs from long jobs, under high variability workloads, is sometimes simply not the right thing to do.

[1]  Mor Harchol-Balter,et al.  On Choosing a Task Assignment Policy for a Distributed Server System , 1998, J. Parallel Distributed Comput..

[2]  John A. Buzacott,et al.  Stochastic models of manufacturing systems , 1993 .

[3]  Azer Bestavros,et al.  Self-similarity in World Wide Web traffic: evidence and possible causes , 1996, SIGMETRICS '96.

[4]  Timothy J. O'Donnell,et al.  Analysis of the early workload on the Cornell Theory Center IBM SP2 , 1996, SIGMETRICS '96.

[5]  Zahir Tari,et al.  Task assignment with work-conserving migration , 2006, Parallel Comput..

[6]  Mor Harchol-Balter,et al.  Task assignment with unknown duration , 2000, Proceedings 20th IEEE International Conference on Distributed Computing Systems.

[7]  Mark S. Squillante,et al.  Analysis of task assignment with cycle stealing under central queue , 2003, 23rd International Conference on Distributed Computing Systems, 2003. Proceedings..

[8]  Paul Barford,et al.  Generating representative Web workloads for network and server performance evaluation , 1998, SIGMETRICS '98/PERFORMANCE '98.

[9]  Nigel Thomas,et al.  Comparing job allocation schemes where service demand is unknown , 2008, J. Comput. Syst. Sci..

[10]  Anees Shaikh,et al.  Load-sensitive routing of long-lived IP flows , 1999, SIGCOMM '99.

[11]  Philip S. Yu,et al.  The state of the art in locally distributed Web-server systems , 2002, CSUR.

[12]  Kazumasa Oida,et al.  Characteristics of deterministic optimal routing for a simple traffic control problem , 1999, 1999 IEEE International Performance, Computing and Communications Conference (Cat. No.99CH36305).

[13]  Zahir Tari,et al.  A least flow-time first load sharing approach for distributed server farm , 2005, J. Parallel Distributed Comput..

[14]  Bacel Maddah,et al.  Allocation of Service Time in a Multiserver System , 2006, Manag. Sci..

[15]  Zahir Tari,et al.  Task assignment strategy for overloaded systems , 2003, Proceedings of the Eighth IEEE Symposium on Computers and Communications. ISCC 2003.

[16]  Mark S. Squillante,et al.  Cycle stealing under immediate dispatch task assignment , 2003, SPAA '03.

[17]  Vishal Misra,et al.  Optimal state-free, size-aware dispatching for heterogeneous M/G/-type systems , 2005, Perform. Evaluation.

[18]  Mor Harchol-Balter,et al.  Evaluation of Task Assignment Policies for Supercomputing Servers: The Case for Load Unbalancing and Fairness , 2004, Cluster Computing.

[19]  Alan Scheller-Wolf,et al.  Surprising results on task assignment in server farms with high-variability workloads , 2009, SIGMETRICS '09.

[20]  Hagit Sarfati,et al.  Analysis of size interval task assignment policies , 2008, PERV.

[21]  Gianfranco Ciardo,et al.  EQUILOAD: a load balancing policy for clustered web servers , 2001, Perform. Evaluation.