Decay function model for resource configuration and adaptive allocation on Internet servers

Server-side resource configuration and allocation for QoS guarantee is a challenge in performance-critical Internet applications. To overcome the difficulties caused by the high-variability and burstiness of Internet traffic, this paper presents a decay function model of request scheduling algorithms for the resource configuration and allocation problem. Under the decay function model, request scheduling is modelled as a transfer-function based filter system that has an input process of requests and an output process of server load. Unlike conventional queueing network models that rely on mean-value analysis for input renewal or Markovian processes, this decay function model works for general time-series based or measurement based processes and hence facilitates the study of statistical correlations between the request traffic, server load, and QoS of requests. Based on the model, we apply filter design theories in signal processing in the optimality analysis of various scheduling algorithms. We reveal a relationship between the server capacity, scheduling policy, service deadline, and other request properties in a formalism and present the optimality condition with respect to the second moments of request properties for an important class of fixed-time scheduling policies. Simulation results verify the relationship and show that optimal fixed-time scheduling can effectively reduce the server workload variance and guarantee service deadlines with high robustness on the Internet.

[1]  Lui Sha,et al.  Feedback control with queueing-theoretic prediction for relative delay guarantees in web servers , 2003, The 9th IEEE Real-Time and Embedded Technology and Applications Symposium, 2003. Proceedings..

[2]  Daniel A. Menascé,et al.  Scaling for E-Business: Technologies, Models, Performance, and Capacity Planning , 2000 .

[3]  Oliver Rose,et al.  Statistical properties of MPEG video traffic and their impact on traffic modeling in ATM systems , 1995, Proceedings of 20th Conference on Local Computer Networks.

[4]  Sándor Molnár,et al.  Queueing performance estimation for general multifractal traffic , 2003, Int. J. Commun. Syst..

[5]  Walter Willinger,et al.  Experimental queueing analysis with long-range dependent packet traffic , 1996, TNET.

[6]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[7]  Ward Whitt,et al.  Extending the effective bandwidth concept to networks with priority classes , 1998 .

[8]  Richard G. Baraniuk,et al.  Multiscale queuing analysis of long-range-dependent network traffic , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[9]  Azer Bestavros,et al.  Temporal Locality in Web Request Streams , 2000 .

[10]  Dimitris G. Manolakis,et al.  Statistical and Adaptive Signal Processing , 2000 .

[11]  Mark S. Squillante,et al.  Time-function scheduling: a general approach to controllable resource , 1995, SOSP.

[12]  Ilkka Norros,et al.  A storage model with self-similar input , 1994, Queueing Syst. Theory Appl..

[13]  Hamid Ahmadi,et al.  Equivalent Capacity and Its Application to Bandwidth Allocation in High-Speed Networks , 1991, IEEE J. Sel. Areas Commun..

[14]  Dick H. J. Epema,et al.  Decay-usage scheduling in multiprocessors , 1998, TOCS.

[15]  Azer Bestavros,et al.  Self-similarity in World Wide Web traffic: evidence and possible causes , 1996, SIGMETRICS '96.

[16]  Frank P. Kelly,et al.  Effective bandwidths at multi-class queues , 1991, Queueing Syst. Theory Appl..

[17]  W. Feller,et al.  An Introduction to Probability Theory and Its Applications, Vol. II , 1972, The Mathematical Gazette.

[18]  David Finkel Brief review: Computer Networks & Systems: Queueing Theory and Performance Evaluation by Thomas Robertazzi (Springer-Verlag, 1990) , 1991, PERV.

[19]  Nsf Ncr,et al.  A Generalized Processor Sharing Approach to Flow Control in Integrated Services Networks: The Single Node Case* , 1991 .

[20]  Jean-Yves Le Boudec,et al.  Network Calculus , 2001, Lecture Notes in Computer Science.

[21]  Abhay Parekh,et al.  A generalized processor sharing approach to flow control in integrated services networks: the single-node case , 1993, TNET.

[22]  Nicola Blefari-Melazzi,et al.  Steady-state analysis of the MMPP/G/1/K queue , 1993, IEEE Trans. Commun..

[23]  Mark S. Squillante,et al.  Web traffic modeling and Web server performance analysis , 1999, PERV.

[24]  Jean C. Walrand,et al.  Effective bandwidths for multiclass Markov fluids and other ATM sources , 1993, TNET.

[25]  Sang Hyuk Son,et al.  A feedback control approach for guaranteeing relative delays in Web servers , 2001, Proceedings Seventh IEEE Real-Time Technology and Applications Symposium.

[26]  Cathy H. Xia,et al.  Queueing systems with long-range dependent input process and subexponential service times , 2003, SIGMETRICS '03.

[27]  Shudong Jin,et al.  Temporal Locality in Web Request Streams Sources, Characteristics, and Caching Implications* An Extended Abstract , 1999, SIGMETRICS 2000.

[28]  Wilton R. Abbott,et al.  Network Calculus , 1970 .

[29]  Carey L. Williamson,et al.  Internet Web servers: workload characterization and performance implications , 1997, TNET.

[30]  Azer Bestavros,et al.  Temporal locality in Web request streams (poster session) (extended abstract): sources, characteristics, and caching implications , 2000, SIGMETRICS '00.

[31]  W. Feller,et al.  An Introduction to Probability Theory and Its Applications , 1951 .

[32]  Lui Sha,et al.  Queueing model based network server performance control , 2002, 23rd IEEE Real-Time Systems Symposium, 2002. RTSS 2002..

[33]  Vern Paxson,et al.  Fast, approximate synthesis of fractional Gaussian noise for generating self-similar network traffic , 1997, CCRV.

[34]  Marwan Krunz,et al.  The Correlation Structure for a Class of Scene/Based Video Models and Its Impact on the Dimensioning of Video Buffers , 2000, IEEE Trans. Multim..

[35]  Louis P. Slothouber,et al.  A Model of Web Server Performance , 1996 .

[36]  K. Shin,et al.  Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach , 2002, IEEE Trans. Parallel Distributed Syst..

[37]  Vern Paxson,et al.  Fast approximation of self-similar network traffic , 1995, SIGCOMM 1995.

[38]  San-qi Li,et al.  On the convergence of traffic measurement and queueing analysis: a statistical-matching and queueing (SMAQ) tool , 1997, TNET.

[39]  Keith W. Ross,et al.  A framework for guaranteeing statistical QoS , 2002, TNET.

[40]  Daniel P. Siewiorek,et al.  Practical solutions for QoS-based resource allocation problems , 1998, Proceedings 19th IEEE Real-Time Systems Symposium (Cat. No.98CB36279).