Bayesian inference for queueing networks and modeling of internet services

Modern Internet services, such as those at Google, Yahoo!, and Amazon, handle billions of requests per day on clusters of thousands of computers. Because these services operate under strict performance requirements, a statistical understanding of their performance is of great practical interest. Such services are modeled by networks of queues, where each queue models one of the computers in the system. A key challenge is that the data are incomplete, because recording detailed information about every request to a heavily used system can require unacceptable overhead. In this paper we develop a Bayesian perspective on queueing models in which the arrival and departure times that are not observed are treated as latent variables. Underlying this viewpoint is the observation that a queueing model defines a deterministic transformation between the data and a set of independent variables called the service times. With this viewpoint in hand, we sample from the posterior distribution over missing data and model parameters using Markov chain Monte Carlo. We evaluate our framework on data from a benchmark Web application. We also present a simple technique for selection among nested queueing models. We are unaware of any previous work that considers inference in networks of queues in the presence of missing data.

[1]  Asser N. Tantawi,et al.  An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[2]  N. U. Prabhu,et al.  Queueing systems: Theory and Applications , 1986 .

[3]  Paul Fearnhead,et al.  Filtering recursions for calculating likelihoods for queues based on inter-departure time data , 2004, Stat. Comput..

[4]  Michael P. Wiper,et al.  Bayesian analysis of M/Er/1 and M/H_k/1 queues , 1998, Queueing Syst. Theory Appl..

[5]  M. J. Bayarri,et al.  Prior Assessments for Prediction in Queues , 1994 .

[6]  Thomas Lengauer,et al.  ROCR: visualizing classifier performance in R , 2005, Bioinform..

[7]  Hsing Luh,et al.  Derivation of the N-step interdeparture time distribution in GI/G/1 queueing systems , 1999, Eur. J. Oper. Res..

[8]  U. Narayan Bhat,et al.  Statistical analysis of queueing systems , 1998, Queueing Syst. Theory Appl..

[9]  Gregory R. Ganger,et al.  Ironmodel: robust performance models in the wild , 2008, SIGMETRICS '08.

[10]  P. Damlen,et al.  Gibbs sampling for Bayesian non‐conjugate and hierarchical models by using auxiliary variables , 1999 .

[11]  Robert Nowak,et al.  Network Tomography: Recent Developments , 2004 .

[12]  Lee K. Jones,et al.  Inferring Balking Behavior From Transactional Data , 1999, Oper. Res..

[13]  Radford M. Neal Slice Sampling , 2003, The Annals of Statistics.

[14]  Robert Nowak,et al.  Internet tomography , 2002, IEEE Signal Process. Mag..

[15]  Luiz André Barroso,et al.  The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines , 2009, The Datacenter as a Computer: An Introduction to the Design of Warehouse-Scale Machines.

[16]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[17]  David E. Culler,et al.  An architecture for highly concurrent, well-conditioned internet services , 2002 .

[18]  Ishwar V. Basawa,et al.  Introduction : Frequentist , Bayes and empirical Bayes approaches , 2005 .

[19]  Edward D. Lazowska,et al.  Quantitative system performance - computer system analysis using queueing network models , 1983, Int. CMG Conference.

[20]  M. J. Bayarri,et al.  Bayesian prediction inM/M/1 queues , 1994, Queueing Syst. Theory Appl..

[21]  Michael I. Jordan Graphical Models , 2003 .

[22]  K. Heggland,et al.  Estimating functions in indirect inference , 2004 .

[23]  M. J. Bayarri,et al.  Dealing with Uncertainties in Queues and Networks of Queues: a Bayesian Approach , 2022 .

[24]  A. Fox,et al.  Cloudstone : Multi-Platform , Multi-Language Benchmark and Measurement Tools for Web 2 . 0 , 2008 .