Stochastic bounds in Fork–Join queueing systems under full and partial mapping

In a Fork–Join (FJ) queueing system, an upstream fork station splits incoming jobs into N tasks to be further processed by N parallel servers, each with its own queue; the response time of one job is determined, at a downstream join station, by the maximum of the corresponding tasks’ response times. This queueing system is useful to the modeling of multi-service systems subject to synchronization constraints, such as MapReduce clusters or multipath routing. Despite their apparent simplicity, FJ systems are hard to analyze. This paper provides the first computable stochastic bounds on the waiting and response time distributions in FJ systems under full (bijective) and partial (injective) mapping of tasks to servers. We consider four practical scenarios by combining (1a) renewal and (1b) non-renewal arrivals, and (2a) non-blocking and (2b) blocking servers. In the case of non-blocking servers, we prove that delays scale as $$\mathcal {O}(\log N)$$O(logN), a law which is known for first moments under renewal input only. In the case of blocking servers, we prove that the same factor of $$\log N$$logN dictates the stability region of the system. Simulation results indicate that our bounds are tight, especially at high utilizations, in all four scenarios. A remarkable insight gained from our results is that, at moderate to high utilizations, multipath routing “makes sense” from a queueing perspective for two paths only, i.e., response times drop the most when $$N=2$$N=2; the technical explanation is that the resequencing (delay) price starts to quickly dominate the tempting gain due to multipath transmissions.

[1]  Mark Handley,et al.  Improving datacenter performance and robustness with multipath TCP , 2011, SIGCOMM 2011.

[2]  N. Duffield,et al.  Exponential upper bounds via martingales for multiplexers with Markovian arrivals , 1994 .

[3]  Rajeev Gandhi,et al.  An Analysis of Traces from a Production MapReduce Cluster , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[4]  Asser N. Tantawi,et al.  Approximate Analysis of Fork/Join Synchronization in Parallel Queues , 1988, IEEE Trans. Computers.

[5]  Tom White,et al.  Hadoop: The Definitive Guide , 2009 .

[6]  Felix Poloczek,et al.  Scheduling analysis with martingales , 2014, Perform. Evaluation.

[7]  Fabio Kon,et al.  A comprehensive view of Hadoop research - A systematic literature review , 2014, J. Netw. Comput. Appl..

[8]  Kevin Wilkinson,et al.  Analytical Performance Models for MapReduce Workloads , 2012, International Journal of Parallel Programming.

[9]  Elizabeth Varki Mean value technique for closed fork-join networks , 1999, SIGMETRICS '99.

[10]  Albert G. Greenberg,et al.  The nature of data center traffic: measurements & analysis , 2009, IMC '09.

[11]  Randy H. Katz,et al.  Improving MapReduce Performance in Heterogeneous Environments , 2008, OSDI.

[12]  Yijie Han,et al.  Resequencing Delays Under Multipath Routing - Asymptotics in a Simple Queueing Model , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[13]  Weikuan Yu,et al.  Non-work-conserving effects in MapReduce: diffusion limit and criticality , 2014, SIGMETRICS '14.

[14]  Felix Poloczek,et al.  Sharp per-flow delay bounds for bursty arrivals: The case of FIFO, SP, and EDF scheduling , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[15]  David Tse,et al.  On the Large Deviations of Resequencing Queue Size: 2-M/M/1 Case , 2008, IEEE Trans. Inf. Theory.

[16]  Onno Boxma,et al.  Queueing-theoretic solution methods for models of parallel and distributed systems , 1994 .

[17]  L. Flatto,et al.  Two parallel queues created by arrivals with two demands. II , 1984 .

[18]  Shivnath Babu,et al.  Towards automatic optimization of MapReduce programs , 2010, SoCC '10.

[19]  Yanpei Chen,et al.  Interactive Analytical Processing in Big Data Systems: A Cross-Industry Study of MapReduce Workloads , 2012, Proc. VLDB Endow..

[20]  Rob Pike,et al.  Interpreting the data: Parallel analysis with Sawzall , 2005, Sci. Program..

[21]  Erol Gelenbe,et al.  An End-to-End Approach to the Resequencing Problem , 1984, JACM.

[22]  George Kesidis,et al.  Network calculus for parallel processing , 2015, PERV.

[23]  Brigitte Plateau,et al.  Queueing Analysis of a Reordering Issue , 1982, IEEE Transactions on Software Engineering.

[24]  Michel Mandjes,et al.  Mean sojourn times in two-queue fork-join systems: bounds and approximations , 2012, OR Spectr..

[25]  Ward Whitt,et al.  Exponential Approximations for Tail Probabilities in Queues, I: Waiting Times , 1995, Oper. Res..

[26]  Emina Soljanin,et al.  On the Delay-Storage Trade-Off in Content Download from Coded Distributed Storage Systems , 2013, IEEE Journal on Selected Areas in Communications.

[27]  Yong Liu,et al.  Stochastic Network Calculus , 2008 .

[28]  J. Kingman Inequalities in the Theory of Queues , 1970 .

[29]  Damon Wischik,et al.  Big queues , 2004, Lecture notes in mathematics.

[30]  Cheng-Shang Chang,et al.  Performance guarantees in communication networks , 2000, Eur. Trans. Telecommun..

[31]  Armand M. Makowski,et al.  Interpolation Approximations for Symmetric Fork-Join Queues , 1994, Perform. Evaluation.

[32]  Simonetta Balsamo,et al.  Bound Performance Models of Heterogeneous Parallel Processing Systems , 1998, IEEE Trans. Parallel Distributed Syst..

[33]  Richard F. Serfozo,et al.  Sojourn times in G/M/1 fork‐join networks , 2008 .

[34]  Emina Soljanin,et al.  Coding for fast content download , 2012, 2012 50th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[35]  R. J. Gibbens,et al.  Traac Characterisation and Eeective Bandwidths for Broadband Network Traces , 1996 .

[36]  Xiaoqiao Meng,et al.  Delay tails in MapReduce scheduling , 2012, SIGMETRICS '12.

[37]  Nick G. Duffield,et al.  Exponential bounds for queues with Markovian arrivals , 1994, Queueing Syst. Theory Appl..

[38]  A. Rényi On the theory of order statistics , 1953 .

[39]  William J. Knottenbelt,et al.  Response Time Approximations in Fork-Join Queues , 2007 .

[40]  P. Billingsley,et al.  Probability and Measure , 1980 .

[41]  Guodong Pang,et al.  Gaussian Limits for a Fork-Join Network with Nonexchangeable Synchronization in Heavy Traffic , 2016, Math. Oper. Res..

[42]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[43]  F. Baccelli,et al.  The fork-join queue and related systems with synchronization constraints: stochastic ordering and computable bounds , 1989, Advances in Applied Probability.