Analysis of SITA policies

We analyze the performance of Size Interval Task Assignment (SITA) policies, for multi-host assignment in a non-preemptive environment. Assuming Poisson arrivals, we provide general bounds on the average waiting time independent of the job size distribution. We establish a general duality theory for the performance of SITA policies. We provide a detailed analysis of the performance of SITA systems when the job size distribution is Bounded Pareto and the range of job sizes tends to infinity. In particular, we determine asymptotically optimal cutoff values and provide asymptotic formulas for average waiting time and slowdown. We compare the results with the Least Work Remaining policy and compute which policy is asymptotically better for any given set of parameters. In the case of inhomogeneous hosts, we determine their optimal ordering.

[1]  Philip S. Yu,et al.  The state of the art in locally distributed Web-server systems , 2002, CSUR.

[2]  Qi Zhang,et al.  Load Balancing for Performance Differentiation in Dual-Priority Clustered Servers , 2006, Third International Conference on the Quantitative Evaluation of Systems - (QEST'06).

[3]  Azer Bestavros,et al.  Self-similarity in World Wide Web traffic: evidence and possible causes , 1996, SIGMETRICS '96.

[4]  Carey L. Williamson,et al.  A comparative analysis of web and peer-to-peer traffic , 2008, WWW.

[5]  Hagit Sarfati,et al.  Analysis of size interval task assignment policies , 2008, PERV.

[6]  Michael Mitzenmacher,et al.  Dynamic Models for File Sizes and Double Pareto Distributions , 2004, Internet Math..

[7]  S. Wittevrongel,et al.  Queueing Systems , 2019, Introduction to Stochastic Processes and Simulation.

[8]  Alan Scheller-Wolf,et al.  Delay moments for FIFO GI/GI/s queues , 1997, Queueing Syst. Theory Appl..

[9]  Alan Scheller-Wolf Necessary and Sufficient Conditions for Delay Moments in FIFO Multiserver Queues with an Application Comparing s Slow Servers with One Fast One , 2003, Oper. Res..

[10]  Carey L. Williamson,et al.  Internet Web servers: workload characterization and performance implications , 1997, TNET.

[11]  Mark S. Squillante,et al.  Cycle stealing under immediate dispatch task assignment , 2003, SPAA '03.

[12]  Vishal Misra,et al.  Optimal state-free, size-aware dispatching for heterogeneous M/G/-type systems , 2005, Perform. Evaluation.

[13]  Ward Whitt,et al.  The impact of a heavy-tailed service-time distribution upon the M/GI/s waiting-time distribution , 2000, Queueing Syst. Theory Appl..

[14]  Shigeru Saito,et al.  A packet-size aware adaptive routing algorithm for parallel transmission server systems , 2004, J. Parallel Distributed Comput..

[15]  Gianfranco Ciardo,et al.  Analytic modeling of load balancing policies for tasks with heavy-tailed distributions , 2000, WOSP '00.

[16]  Gianfranco Ciardo,et al.  EQUILOAD: a load balancing policy for clustered web servers , 2001, Perform. Evaluation.

[17]  Mor Harchol-Balter,et al.  Evaluation of Task Assignment Policies for Supercomputing Servers: The Case for Load Unbalancing and Fairness , 2004, Cluster Computing.

[18]  Alan Scheller-Wolf,et al.  Further delay moment results for FIFO multiserver queues , 1999, Queueing Syst. Theory Appl..

[19]  Wei Sun,et al.  ADAPTLOAD: effective balancing in clustered web servers under transient load conditions , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[20]  Mor Harchol-Balter Task assignment with unknown duration , 2002, JACM.

[21]  Leonard Kleinrock,et al.  Theory, Volume 1, Queueing Systems , 1975 .

[22]  Mor Harchol-Balter,et al.  On Choosing a Task Assignment Policy for a Distributed Server System , 1998, J. Parallel Distributed Comput..

[23]  Alan Scheller-Wolf,et al.  New bounds for expected delay in FIFO M/G/c queues , 1997, Queueing Syst. Theory Appl..

[24]  Qi Zhang,et al.  Performance-Guided Load (Un)balancing under Autocorrelated Flows , 2008, IEEE Transactions on Parallel and Distributed Systems.

[25]  M. Crovella,et al.  Heavy-tailed probability distributions in the World Wide Web , 1998 .

[26]  Wei Sun,et al.  Workload-aware load balancing for clustered Web servers , 2005, IEEE Transactions on Parallel and Distributed Systems.

[27]  Alan Scheller-Wolf,et al.  Surprising results on task assignment in server farms with high-variability workloads , 2009, SIGMETRICS '09.

[28]  Alan Scheller-Wolf,et al.  Structural interpretation and derivation of necessary and sufficient conditions for delay moments in FIFO multiserver queues , 2006, Queueing Syst. Theory Appl..

[29]  H. Liu,et al.  Conference on Measurement and modeling of computer systems , 2001 .

[30]  David L. Peterson,et al.  Fractal Patterns In DASD I/O Traffic , 1996, Int. CMG Conference.

[31]  Martin Arlitt,et al.  Web Workload Characterization: Ten Years Later , 2005 .

[32]  Jianliang Xu,et al.  Web content delivery , 2005 .

[33]  Mark A. McComb A Practical Guide to Heavy Tails , 2000, Technometrics.

[34]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .