Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach

The Internet is undergoing substantial changes from a communication and browsing infrastructure to a medium for conducting business and marketing a myriad of services. The World Wide Web provides a uniform and widely-accepted application interface used by these services to reach multitudes of clients. These changes place the Web server at the center of a gradually emerging e-service infrastructure with increasing requirements for service quality and reliability guarantees in an unpredictable and highly-dynamic environment. This paper describes performance control of a Web server using classical feedback control theory. We use feedback control theory to achieve overload protection, performance guarantees, and service differentiation in the presence of load unpredictability. We show that feedback control theory offers a promising analytic foundation for providing service differentiation and performance guarantees. We demonstrate how a general Web server may be modeled for purposes of performance control, present the equivalents of sensors and actuators, formulate a simple feedback loop, describe how it can leverage on real-time scheduling and feedback-control theories to achieve per-class response-time and throughput guarantees, and evaluate the efficacy of the scheme on an experimental testbed using the most popular Web server, Apache. Experimental results indicate that control-theoretic techniques offer a sound way of achieving desired performance in performance-critical Internet applications. Our QoS (Quality-of-Service) management solutions can be implemented either in middleware that is transparent to the server, or as a library called by server code.

[1]  James W. Layland,et al.  Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment , 1989, JACM.

[2]  F. G. Greg Shinskey,et al.  Process Control Systems: Application, Design and Tuning , 1990 .

[3]  Alan Burns,et al.  Deadline Monotonic Scheduling Theory , 1992 .

[4]  A. Burns,et al.  Deadline monotonic scheduling theory and application , 1993 .

[5]  Clifford W. Mercer Operating system support for multimedia applications , 1994, MULTIMEDIA '94.

[6]  Scott Shenker,et al.  Integrated Services in the Internet Architecture : an Overview Status of this Memo , 1994 .

[7]  Andrew T. Campbell,et al.  A quality of service architecture , 1994, CCRV.

[8]  Stefan Savage,et al.  Processor capacity reserves: operating system support for multimedia applications , 1994, 1994 Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[9]  Aurel A. Lazar,et al.  A Binding Architecture for Multimedia Networks , 1994, J. Parallel Distributed Comput..

[10]  Daniel Mossé,et al.  V-NET: a framework for a versatile network architecture to support real-time communication performance guarantees , 1995, Proceedings of INFOCOM'95.

[11]  Klara Nahrstedt,et al.  The QOS Broker , 1995, IEEE Multim..

[12]  Aurel A. Lazar,et al.  A Binding Architecture for Multimedia Networks , 1995, J. Parallel Distributed Comput..

[13]  Steven McCanne,et al.  An application level video gateway , 1995, MULTIMEDIA '95.

[14]  Panos K. Chrysanthis,et al.  Framework for experimenting with QoS for multimedia services , 1996, Electronic Imaging.

[15]  Eric A. Brewer,et al.  Adapting to network and client variability via on-demand dynamic distillation , 1996, ASPLOS VII.

[16]  Klara Nahrstedt,et al.  Design, Implementation, and Experiences of the OMEGA End-Point Architecture , 1996, IEEE J. Sel. Areas Commun..

[17]  Harrick M. Vin,et al.  A hierarchial CPU scheduler for multimedia operating systems , 1996, OSDI '96.

[18]  Klara Nahrstedt,et al.  An end-to-end QoS model and management architecture , 1997 .

[19]  Thomas F. Lawrence,et al.  Modeling applications for adaptive QoS-based resource management , 1997, Proceedings 1997 High-Assurance Engineering Workshop.

[20]  Kang G. Shin,et al.  QoS negotiation in real-time systems and its application to automated flight control , 1997, Proceedings Third IEEE Real-Time Technology and Applications Symposium.

[21]  Michael B. Jones,et al.  CPU reservations and time constraints: efficient, predictable scheduling of independent activities , 1997, SOSP.

[22]  Lakshman Krishnamurthy AQUA: an adaptive quality of service architecture for distributed multimedia applications , 1997 .

[23]  M. Humphrey,et al.  The DQM Architecture: Middleware for Application-centered QoS Resource Management , 1997 .

[24]  Scott A. Brandt,et al.  A dynamic quality of service middleware agent for mediating application resource usage , 1998, Proceedings 19th IEEE Real-Time Systems Symposium (Cat. No.98CB36279).

[25]  Zheng Wang,et al.  An Architecture for Differentiated Services , 1998, RFC.

[26]  Andrew T. Campbell,et al.  A survey of QoS architectures , 1998, Multimedia Systems.

[27]  David L. Black,et al.  An Architecture for Differentiated Service , 1998 .

[28]  Steven McCanne,et al.  An active service framework and its application to real-time multimedia transcoding , 1998, SIGCOMM '98.

[29]  Kang G. Shin,et al.  End-host architecture for QoS-adaptive communication , 1998, Proceedings. Fourth IEEE Real-Time Technology and Applications Symposium (Cat. No.98TB100245).

[30]  John P. Lehoczky,et al.  Practical Solutions for QoS-Based Resource Allocation , 1998, RTSS 1998.

[31]  David Mosberger,et al.  httperf—a tool for measuring web server performance , 1998, PERV.

[32]  Michael D. Smith,et al.  Using Path Profiles to Predict HTTP Requests , 1998, Comput. Networks.

[33]  Lars C. Wolf,et al.  HeiRAT - Quality-of-service management for distributed multimedia systems , 1998, Multimedia Systems.

[34]  Daniel P. Siewiorek,et al.  Practical solutions for QoS-based resource allocation problems , 1998, Proceedings 19th IEEE Real-Time Systems Symposium (Cat. No.98CB36279).

[35]  Karsten Schwan,et al.  FARA-a framework for adaptive resource allocation in complex real-time systems , 1998, Proceedings. Fourth IEEE Real-Time Technology and Applications Symposium (Cat. No.98TB100245).

[36]  Karsten Schwan,et al.  FARACost: an adaptation cost model aware of pending constraints , 1999, Proceedings 20th IEEE Real-Time Systems Symposium (Cat. No.99CB37054).

[37]  Kang G. Shin,et al.  QoS adaptation in real-time systems , 1999 .

[38]  Tarek F. Abdelzaher,et al.  Web Content Adaptation to Improve Server Overload Behavior , 1999, Comput. Networks.

[39]  Nina Bhatti,et al.  Web server support for tiered services , 1999, IEEE Netw..

[40]  Mahadev Satyanarayanan,et al.  Experience with adaptive mobile applications in Odyssey , 1999, Mob. Networks Appl..

[41]  Peter Druschel,et al.  Resource containers: a new facility for resource management in server systems , 1999, OSDI '99.

[42]  Calton Pu,et al.  A feedback-driven proportion allocator for real-rate scheduling , 1999, OSDI '99.

[43]  T.F. Abdelzaher,et al.  Web server QoS management by adaptive content delivery , 1999, 1999 Seventh International Workshop on Quality of Service. IWQoS'99. (Cat. No.98EX354).

[44]  Kang G. Shin,et al.  QoS provisioning with qContracts in web and multimedia servers , 1999, Proceedings 20th IEEE Real-Time Systems Symposium (Cat. No.99CB37054).

[45]  Sang Hyuk Son,et al.  Design and evaluation of a feedback control EDF scheduling algorithm , 1999, Proceedings 20th IEEE Real-Time Systems Symposium (Cat. No.99CB37054).

[46]  Allan Kuchinsky,et al.  Quality is in the eye of the beholder: meeting users' requirements for Internet quality of service , 2000, CHI.

[47]  Kang G. Shin,et al.  QoS negotiation in real-time systems and its application to automated flight control , 1997, Proceedings Third IEEE Real-Time Technology and Applications Symposium.

[48]  Tarek F. Abdelzaher,et al.  An automated profiling subsystem for QoS-aware services , 2000, Proceedings Sixth IEEE Real-Time Technology and Applications Symposium. RTAS 2000.

[49]  Edward W. Knightly,et al.  Multi-class latency-bounded Web services , 2000, 2000 Eighth International Workshop on Quality of Service. IWQoS 2000 (Cat. No.00EX400).

[50]  Amin Vahdat,et al.  Application-level differentiated multimedia Web services using quality aware transcoding , 2000, IEEE Journal on Selected Areas in Communications.

[51]  Gautam G. Rao,et al.  Application Level Differentiated Services for Web Servers , 2000 .

[52]  Prasant Mohapatra,et al.  Scalable internet servers: issues and challenges , 2000, PERV.

[53]  Allan Kuchinsky,et al.  Integrating user-perceived quality into Web server design , 2000, Comput. Networks.

[54]  Sang Hyuk Son,et al.  A feedback control approach for guaranteeing relative delays in Web servers , 2001, Proceedings Seventh IEEE Real-Time Technology and Applications Symposium.

[55]  Tarek F. Abdelzaher,et al.  Differentiated caching services; a control-theoretical approach , 2001, Proceedings 21st International Conference on Distributed Computing Systems.

[56]  Joseph L. Hellerstein,et al.  Feedback control of a Lotus Notes server: modeling and control design , 2001, Proceedings of the 2001 American Control Conference. (Cat. No.01CH37148).

[57]  Harrick M. Vin,et al.  A hierarchial CPU scheduler for multimedia operating systems , 1996, OSDI '96.

[58]  Chenyang Lu,et al.  Schedulability analysis and utilization bounds for highly scalable real-time services , 2001, Proceedings Seventh IEEE Real-Time Technology and Applications Symposium.

[59]  John S. Heidemann,et al.  Application‐level differentiated services for Web servers , 1999, World Wide Web.