Handling load with less stress

We study how the average performance of a system degrades as the load nears its peak capacity. We restrict our attention to the performance measures of average sojourn time and the large deviation rates of buffer overflow probabilities. We first show that for certain queueing systems, the average sojourn time of requests depends much more weakly on the load ρ than the commonly observed 1/(1−ρ) dependence for most queueing policies. For example, we show that for an M/G/1 system under the preemptive Shortest Job First (pSJF) policy, the average sojourn time varies as log (1/(1−ρ)) with load for a certain class of distributions.We observe that such results hold even for more restricted policies. We give some examples of non-preemptive policies and policies that do not use the knowledge of job sizes while scheduling, where the dependence of average sojourn time on load is significantly better than 1/(1−ρ). Similar results hold even for very simple non-preemptive threshold based policies that partition all the jobs into two job classes based on a fixed threshold and do FIFO within each class. Finally we study the large deviations rate of the queue length under a simple dedicated partition-based policy.

[1]  Ward Whitt,et al.  Comparison methods for queues and other stochastic models , 1986 .

[2]  Mor Harchol-Balter Task assignment with unknown duration , 2002, JACM.

[3]  Vishal Misra,et al.  Mixed scheduling disciplines for network flows , 2003, PERV.

[4]  J. George Shanthikumar,et al.  Scheduling Multiclass Single Server Queueing Systems to Stochastically Maximize the Number of Successful Departures , 1989, Probability in the Engineering and Informational Sciences.

[5]  Nikhil Bansal On the average sojourn time under M/M/1/SRPT , 2005, Oper. Res. Lett..

[6]  Linus Schrage,et al.  Letter to the Editor - A Proof of the Optimality of the Shortest Remaining Processing Time Discipline , 1968, Oper. Res..

[7]  S. Wittevrongel,et al.  Queueing Systems , 2019, Introduction to Stochastic Processes and Simulation.

[8]  Sarah Williams,et al.  Computer applications , 1988 .

[9]  Thomas M. O'Donovan Technical Note - Distribution of Attained and Residual Service in General Queuing Systems , 1974, Oper. Res..

[10]  Adam Wierman,et al.  Nearly insensitive bounds on SMART scheduling , 2005, SIGMETRICS '05.

[11]  William L. Maxwell,et al.  Theory of scheduling , 1967 .

[12]  J. Tsitsiklis,et al.  On the large deviations behavior of acyclic networks of $G/G/1$ queues , 1998 .

[13]  Ronald W. Wolff,et al.  Stochastic Modeling and the Theory of Queues , 1989 .

[14]  Nikhil Bansal On the average sojourn time under M/M/1/SRPT , 2003, PERV.

[15]  R. Stephenson A and V , 1962, The British journal of ophthalmology.

[16]  N. L. Lawrie,et al.  Comparison Methods for Queues and Other Stochastic Models , 1984 .