Analysis of job assignment with batch arrivals among heterogeneous servers

We revisit the problem of job assignment to multiple heterogeneous servers in parallel. The system under consideration, however, has a few unique features. Specifically, repair jobs arrive to the queueing system in batches according to a Poisson process. In addition, servers are heterogeneous and the service time distributions of the individual servers are general. The objective is to optimally assign each job within a batch arrival to minimize the long-run average number of jobs in the entire system. We focus on the class of static assignment policies where jobs are routed to servers upon arrival according to pre-determined probabilities. We solve the model analytically and derive the structural properties of the optimal static assignment. We show that when the traffic is below a certain threshold, it is better to not assign any jobs to slower servers. As traffic increases (either due to an increase in job arrival rate or batch size), more slower servers will be utilized. We give an explicit formula for computing the threshold. Finally we compare and evaluate the performance of the static assignment policy to two dynamic policies, specifically the shortest expected completion policy and the shortest queue policy.

[1]  Rein D. Nobel,et al.  Optimal control of a queueing system with heterogeneous servers and setup costs , 2000, IEEE Trans. Autom. Control..

[2]  M. J. M. Posner,et al.  Algorithmic and approximation analyses of the shorter queue model , 1987 .

[3]  Ger Koole,et al.  On the Assignment of Customers to Parallel Queues , 1992, Probability in the Engineering and Informational Sciences.

[4]  Randolph D. Nelson,et al.  An approximation to the response time for shortest queue routing , 1989, SIGMETRICS '89.

[5]  Frank A. Haight,et al.  TWO QUEUES IN PARALLEL , 1958 .

[6]  P. Patrick Wang,et al.  Workload distribution of discrete‐time parallel queues with two servers , 2000 .

[7]  Ivo J. B. F. Adan,et al.  Analysis of the symmetric shortest queue problem , 1990 .

[8]  Flavio Bonomi,et al.  On Job Assignment for a Parallel System of Processor Sharing Queues , 1990, IEEE Trans. Computers.

[9]  S. Stidham,et al.  Individual versus Social Optimization in the Allocation of Customers to Alternative Servers , 1983 .

[10]  Randolph D. Nelson,et al.  An Approximation for the Mean Response Time for Shortest Queue Routing with General Inerarrival and Service Times , 1993, Perform. Evaluation.

[11]  Michael Pinedo,et al.  A note on queues with Bernoulli routing , 1990, 29th IEEE Conference on Decision and Control.

[12]  Tapani Lehtonen,et al.  On the optimality of the shortest line discipline , 1984 .

[13]  C. Knessl A new heavy traffic limit for the asymmetric shortest queue problem , 1999, European Journal of Applied Mathematics.

[14]  J. George Shanthikumar,et al.  Optimal Dynamic Assignment of Customers to Heterogeneous Servers in Parallel , 1992, Oper. Res..

[15]  Tim Roughgarden,et al.  The price of anarchy in an exponential multi-server , 2007, Oper. Res. Lett..

[16]  Refael Hassin,et al.  To Queue or Not to Queue: Equilibrium Behavior in Queueing Systems , 2002 .

[17]  Xueyan Tang,et al.  Optimizing static job scheduling in a network of heterogeneous computers , 2000, Proceedings 2000 International Conference on Parallel Processing.

[18]  Winfried K. Grassmann,et al.  The shortest queue model with jockeying , 1990 .

[19]  Eitan Altman,et al.  Discrete-Event Control of Stochastic Networks - Multimodularity and Regularity , 2004, Lecture notes in mathematics.

[20]  J. George Shanthikumar,et al.  Design of manufacturing systems using queueing models , 1992, Queueing Syst. Theory Appl..

[21]  Tao Wu,et al.  A Comparative Analysis of Server Selection in Content Replication Networks , 2008, IEEE/ACM Transactions on Networking.

[22]  W. Whitt,et al.  Analysis of join-the-shortest-queue routing for web server farms , 2007, Perform. Evaluation.

[23]  R. Weber On the optimal assignment of customers to parallel servers , 1978, Journal of Applied Probability.

[24]  Sem C. Borst Optimal probabilistic allocation of customer types to servers , 1995, SIGMETRICS '95/PERFORMANCE '95.

[25]  P. R. Kumar,et al.  Optimal control of a queueing system with two heterogeneous servers , 1984 .

[26]  A. M. K. Tarabia,et al.  Analysis of two queues in parallel with jockeying and restricted capacities , 2008 .

[27]  Vladimir Rykov,et al.  Optimal Control of Queueing Systems with Heterogeneous Servers , 2004, Queueing Syst. Theory Appl..

[28]  H. McKean,et al.  Two queues in parallel , 1977 .

[29]  Robert B. Cooper Queues with Ordered Servers that Work at Different Rates: An Exact Analysis of a Model Solved Approximately by Others , 1987, Perform. Evaluation.

[30]  Mor Armony,et al.  Dynamic Routing in Large-Scale Service Systems with Heterogeneous Servers , 2005, Queueing Syst. Theory Appl..

[31]  Ronald W. Wolff,et al.  Stochastic Modeling and the Theory of Queues , 1989 .

[32]  Onno J. Boxma,et al.  Optimization of Static Traffic Allocation Policies , 1994, Theor. Comput. Sci..

[33]  Vijendra P. Singh Two-Server Markovian Queues with Balking: Heterogeneous vs. Homogeneous Servers , 1970, Oper. Res..

[34]  Harold Gumbel Waiting Lines with Heterogeneous Servers , 1960 .

[35]  Mor Harchol-Balter,et al.  On Choosing a Task Assignment Policy for a Distributed Server System , 1998, J. Parallel Distributed Comput..

[36]  Ernest Koenigsberg,et al.  On Jockeying in Queues , 1966 .

[37]  V. V. Rykov Monotone Control of Queueing Systems with Heterogeneous Servers , 2001, Queueing Syst. Theory Appl..

[38]  Scott Shenker,et al.  The Optimal Control of Heterogeneous Queueing Systems: A Paradigm for Load-Sharing and Routing , 1989, IEEE Trans. Computers.

[39]  Shlomo Halfin The shortest queue problem , 1985 .

[40]  Weiguo Fan,et al.  Web server load balancing: A queueing analysis , 2008, Eur. J. Oper. Res..

[41]  Kazumasa Oida,et al.  Characteristics of Deterministic Optimal Routing for Two Heterogeneous Parallel Servers , 2001, Int. J. Found. Comput. Sci..

[42]  J. Kingman Two Similar Queues in Parallel , 1961 .

[43]  Philip J. Fleming,et al.  HEAVY TRAFFIC APPROXIMATIONS FOR A SYSTEM OF INFINITE SERVERS WITH LOAD BALANCING , 1999, Probability in the Engineering and Informational Sciences.

[44]  J. Cohen Analysis of the asymmetrical shortest two-server queueing model , 1995 .

[45]  Ward Whitt,et al.  Deciding Which Queue to Join: Some Counterexamples , 1986, Oper. Res..

[46]  Charles Knessl,et al.  On the infinite server shortest queue problem: Non-symmetric case , 2006, Queueing Syst. Theory Appl..

[47]  Wk Grassmann,et al.  Transient and steady state results for two parallel queues , 1980 .

[48]  Gabriel R. Bitran,et al.  Production Planning of Style Goods with High Setup Costs and Forecast Revisions , 1986, Oper. Res..

[49]  Ger Koole,et al.  On the Pathwise Optimal Bernoulli Routing Policy for Homogeneous Parallel Servers , 1996, Math. Oper. Res..