A Survey on Performance Management for Internet Applications

Internet applications have become indispensable for many business and personal processes, turning the performance of these applications into a key issue. For this reason, recent research has comprehensively explored mechanisms for managing the performance of these applications, with special focus on dealing with overload situations and providing QoS guarantees to clients. This paper makes a survey on the different proposals in the literature for managing Internet applications’ performance. We present a complete taxonomy that characterizes and classifies these proposals into several categories including request scheduling, admission control, service differentiation, dynamic resource management, service degradation, control theoretic approaches, works using queuing models, observation-based approaches that use runtime measurements, and overall approaches combining several mechanisms. For each work, we provide a brief description in order to provide the reader with a global understanding of the research progress in this area.

[1]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[2]  Thiemo Voigt,et al.  Kernel-based control of persistent web server connections , 2001, PERV.

[3]  Jordi Torres,et al.  Designing an overload control strategy for secure e-commerce applications , 2007, Comput. Networks.

[4]  Prashant J. Shenoy,et al.  Cataclysm: Scalable overload policing for internet applications , 2008, J. Netw. Comput. Appl..

[5]  K. Shin,et al.  Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach , 2002, IEEE Trans. Parallel Distributed Syst..

[6]  Erich M. Nahum,et al.  A method for transparent admission control and request scheduling in e-commerce web sites , 2004, WWW '04.

[7]  Anand Sivasubramaniam,et al.  Xen and co.: communication-aware CPU scheduling for consolidated xen-based hosting platforms , 2007, VEE '07.

[8]  Prashant J. Shenoy,et al.  Sharc: managing CPU and network bandwidth in shared clusters , 2004, IEEE Transactions on Parallel and Distributed Systems.

[9]  Ludmila Cherkasova,et al.  Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites , 2002, IEEE Trans. Computers.

[10]  Jordi Torres,et al.  Dynamic CPU provisioning for self-managed secure web applications in SMP hosting platforms , 2008, Comput. Networks.

[11]  Xiaoyun Zhu,et al.  An adaptive optimal controller for non-intrusive performance differentiation in computing services , 2005, 2005 International Conference on Control and Automation.

[12]  Prashant J. Shenoy,et al.  Resource overbooking and application profiling in a shared Internet hosting platform , 2009, TOIT.

[13]  Prashant J. Shenoy,et al.  Dynamic resource allocation for shared data centers using online measurements , 2003, IWQoS'03.

[14]  Tao Yang,et al.  Request-Aware Scheduling for Busy Internet Services , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[15]  Asser N. Tantawi,et al.  Dynamic placement for clustered web applications , 2006, WWW '06.

[16]  David E. Irwin,et al.  Dynamic virtual clusters in a grid site manager , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[17]  Vijay Karamcheti,et al.  Improving Performance of Internet Services Through Reward-Driven Request Prioritization , 2006, 200614th IEEE International Workshop on Quality of Service.

[18]  Virgílio A. F. Almeida,et al.  Business-oriented resource management policies for e-commerce servers , 2000, Perform. Evaluation.

[19]  Willy Zwaenepoel,et al.  Cluster reserves: a mechanism for resource management in cluster-based network servers , 2000, SIGMETRICS '00.

[20]  Nina Bhatti,et al.  Web server support for tiered services , 1999, IEEE Netw..

[21]  Jordi Torres,et al.  Differentiated Quality of Service for e-Commerce Applications through Connection Scheduling based on System-Level Thread Priorities , 2007, 15th EUROMICRO International Conference on Parallel, Distributed and Network-Based Processing (PDP'07).

[22]  Daniel A. Menascé,et al.  Resource Allocation for Autonomic Data Centers using Analytic Performance Models , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[23]  Benny Rochwerger,et al.  Oceano-SLA based management of a computing utility , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[24]  Edward W. Knightly,et al.  Ensuring Latency Targets in Multiclass Web Servers , 2003, IEEE Trans. Parallel Distributed Syst..

[25]  Thiemo Voigt,et al.  Adaptive resource-based Web server admission control , 2002, Proceedings ISCC 2002 Seventh International Symposium on Computers and Communications.

[26]  Prasant Mohapatra,et al.  ACES: An efficient admission control scheme for QoS-aware web servers , 2003, Comput. Commun..

[27]  Dirk Beyer,et al.  Self-Adaptive SLA-Driven Capacity Management for Internet Services , 2006, 2006 IEEE/IFIP Network Operations and Management Symposium NOMS 2006.

[28]  S. Ranjan,et al.  QoS-driven server migration for Internet data centers , 2002, IEEE 2002 Tenth IEEE International Workshop on Quality of Service (Cat. No.02EX564).

[29]  Mor Harchol-Balter,et al.  Size-based scheduling to improve web performance , 2003, TOCS.

[30]  Prashant J. Shenoy,et al.  Agile dynamic provisioning of multi-tier Internet applications , 2008, TAAS.

[31]  Noel De Palma,et al.  Autonomic Management of Clustered Applications , 2006, 2006 IEEE International Conference on Cluster Computing.

[32]  Jing Xu,et al.  On the Use of Fuzzy Modeling in Virtualized Data Center Management , 2007, Fourth International Conference on Autonomic Computing (ICAC'07).

[33]  Krishna Kant,et al.  Overload Control Mechanisms for Web Servers , 2001 .

[34]  Prasant Mohapatra,et al.  Overload control in QoS-aware web servers , 2003, Comput. Networks.

[35]  Mor Harchol-Balter,et al.  Web servers under overload: How scheduling can help , 2006, TOIT.

[36]  Akshat Verma,et al.  On admission control for profit maximization of networked service providers , 2003, WWW '03.

[37]  Tarek F. Abdelzaher,et al.  Web Content Adaptation to Improve Server Overload Behavior , 1999, Comput. Networks.

[38]  Xiaoyun Zhu,et al.  Adaptive entitlement control of resource containers on shared servers , 2005, 2005 9th IFIP/IEEE International Symposium on Integrated Network Management, 2005. IM 2005..