Workload service requirements analysis: a queueing network optimization approach

The answers to many important performance related questions with multiclass queueing models depends upon having estimates for the service times of different classes of jobs. We present a general approach to infer the per-class service times at different servers in an environment where only server throughput, utilization and per-class response time measurements are available. The per-class service times are solutions to an optimization problem with queueing-theoretic formulas in the objective and constraints. We further study the impact of the variance of service times on the variance of response times. A few case studies are presented to demonstrate the power of our approach.

[1]  Dimitris Bertsimas,et al.  Deducing queueing from transactional data: the queue inference engine, revisited , 1990, 29th IEEE Conference on Decision and Control.

[2]  Wushow Chou,et al.  Queueing Systems, Volume II: Computer Applications - Leonard Kleinrock , 1977, IEEE Transactions on Communications.

[3]  Mark S. Squillante,et al.  Internet traffic: periodicity, tail behavior, and performance implications , 2000 .

[4]  Zhen Liu,et al.  Web traffic modeling and performance comparison between HTTP1.0 and HTTP1.1 , 2000 .

[5]  Richard C. Larson The queue inference engine: deducing queue statistics from transactional data , 1990 .

[6]  Ronald W. Wolff,et al.  Stochastic Modeling and the Theory of Queues , 1989 .

[7]  Robert B. Cooper,et al.  Queueing systems, volume II: computer applications : By Leonard Kleinrock. Wiley-Interscience, New York, 1976, xx + 549 pp. , 1977 .

[8]  Mark S. Squillante,et al.  Analysis and characterization of large‐scale Web server access patterns and performance , 1999, World Wide Web.

[9]  Vinod Sharma,et al.  Estimating Traffic Parameters in Queueing Systems with Local Information , 1998, Perform. Evaluation.

[10]  Donald F. Towsley,et al.  Inferring network characteristics via moment-based estimators , 2001, Proceedings IEEE INFOCOM 2001. Conference on Computer Communications. Twentieth Annual Joint Conference of the IEEE Computer and Communications Society (Cat. No.01CH37213).

[11]  Mark S. Squillante,et al.  Web traffic modeling and Web server performance analysis , 1999, PERV.

[12]  Leonard Kleinrock,et al.  Queueing Systems: Volume I-Theory , 1975 .

[13]  Guerney D. H. Hunt,et al.  Network Dispatcher: A Connection Router for Scalable Internet Services , 1998, Comput. Networks.

[14]  Edward G. Coffman,et al.  Waiting Time Distributions for Processor-Sharing Systems , 1970, JACM.