The Supermarket Model with Known and Predicted Service Times

The supermarket model refers to a system with a large number of queues, where arriving customers choose $d$ queues at random and join the queue with the fewest customers. The supermarket model demonstrates the power of even small amounts of choice, as compared to simply joining a queue chosen uniformly at random, for load balancing systems. In this work we perform simulation-based studies to consider variations where service times for a customer are predicted, as might be done in modern settings using machine learning techniques or related mechanisms. Our primary takeaway is that using even seemingly weak predictions of service times can yield significant benefits over blind First In First Out queueing in this context. However, some care must be taken when using predicted service time information to both choose a queue and order elements for service within a queue; while in many cases using the information for both choosing and ordering is beneficial, in many of our simulation settings we find that simply using the number of jobs to choose a queue is better when using predicted service times to order jobs in a queue. Although this study is simulation based, our study leaves many natural theoretical open questions for future work.

[1]  Eli Upfal,et al.  Probability and Computing: Randomized Algorithms and Probabilistic Analysis , 2005 .

[2]  Michael Mitzenmacher,et al.  The Asymptotics of Selecting the Shortest of Two, Improved , 1999 .

[3]  Benny Van Houdt,et al.  Performance Analysis of Workload Dependent Load Balancing Policies , 2019, Abstracts of the 2019 SIGMETRICS/Performance Joint International Conference on Measurement and Modeling of Computer Systems.

[4]  Benny Van Houdt,et al.  On the Power-of-d-choices with Least Loaded Server Selection , 2018, SIGMETRICS.

[5]  Adam Wierman,et al.  Classifying scheduling policies with respect to unfairness in an M/GI/1 , 2003, SIGMETRICS '03.

[6]  Berthold Vöcking,et al.  How asymmetry helps load balancing , 1999, JACM.

[7]  Eli Upfal,et al.  Balanced Allocations , 1999, SIAM J. Comput..

[8]  Michael Mitzenmacher,et al.  How Useful Is Old Information? , 2000, IEEE Trans. Parallel Distributed Syst..

[9]  Mor Harchol-Balter,et al.  SOAP Bubbles: Robust Scheduling Under Adversarial Noise , 2018, 2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[10]  R.M. Karp, M. Luby, F. Meyer auf der Hei Efficient PRAM simulation on a distributed memory machine , 1996 .

[11]  Michael Mitzenmacher,et al.  The Power of Two Choices in Randomized Load Balancing , 2001, IEEE Trans. Parallel Distributed Syst..

[12]  Sergei Vassilvitskii,et al.  Competitive caching with machine learned advice , 2018, ICML.

[13]  Benny Van Houdt,et al.  On the Power-of-d-choices with Least Loaded Server Selection , 2018, Proc. ACM Meas. Anal. Comput. Syst..

[14]  R. L. Dobrushin,et al.  Queueing system with selection of the shortest of two queues: an assymptotic approach , 1996 .

[15]  Michael Mitzenmacher,et al.  Studying Balanced Allocations with Differential Equations , 1999, Combinatorics, Probability and Computing.

[16]  Mor Harchol-Balter,et al.  Performance Modeling and Design of Computer Systems: Queueing Theory in Action , 2013 .

[17]  Angela H. Jiang,et al.  JamaisVu: Robust Scheduling with Auto-Estimated Job Runtimes , 2016 .

[18]  Michael Mitzenmacher,et al.  Scheduling with Predictions and the Price of Misprediction , 2019, ITCS.

[19]  Yi Lu,et al.  Asymptotic independence of queues under randomized load balancing , 2012, Queueing Syst. Theory Appl..

[20]  Adam Wierman,et al.  Fairness and scheduling in single server queues , 2011 .

[21]  Raymond A. Marie,et al.  Calculating equilibrium probabilities for λ(n)/Ck/1/N queues , 1980, Performance.

[22]  Pietro Michiardi,et al.  PSBS: Practical Size-Based Scheduling , 2014, IEEE Transactions on Computers.

[23]  Sem C. Borst,et al.  Queues with Workload-Dependent Arrival and Service Rates , 2004, Queueing Syst. Theory Appl..

[24]  Piotr Indyk,et al.  Learning-Based Frequency Estimation Algorithms , 2018, ICLR.

[25]  Reza Aghajani,et al.  Mean-field Dynamics of Load-Balancing Networks with General Service Distributions , 2015, 1512.05056.

[26]  Mor Harchol-Balter,et al.  Analysis of SRPT scheduling: investigating unfairness , 2001, SIGMETRICS '01.

[27]  Gregory R. Ganger,et al.  On the diversity of cluster workloads and its impact on research results , 2018, USENIX Annual Technical Conference.

[28]  Mor Harchol-Balter Task assignment with unknown duration , 2002, JACM.

[29]  Michael Mitzenmacher,et al.  A Model for Learned Bloom Filters and Optimizing by Sandwiching , 2018, NeurIPS.

[30]  Adam Wierman,et al.  Scheduling despite inexact job-size information , 2008, SIGMETRICS '08.

[31]  Yi Lu,et al.  Randomized load balancing with general service time distributions , 2010, SIGMETRICS '10.

[32]  Friedhelm Meyer auf der Heide,et al.  Efficient PRAM simulation on a distributed memory machine , 1992, STOC '92.

[33]  Matteo Dell'Amico,et al.  Scheduling With Inexact Job Sizes: The Merits of Shortest Processing Time First , 2019, ArXiv.

[34]  Google,et al.  Improving Online Algorithms via ML Predictions , 2024, NeurIPS.

[35]  Alan Scheller-Wolf,et al.  SOAP: One Clean Analysis of All Age-Based Scheduling Policies , 2017, PERV.

[36]  Tim Kraska,et al.  The Case for Learned Index Structures , 2018 .