Analytic modeling of multitier Internet applications

Since many Internet applications employ a multitier architecture, in this article, we focus on the problem of analytically modeling the behavior of such applications. We present a model based on a network of queues where the queues represent different tiers of the application. Our model is sufficiently general to capture (i) the behavior of tiers with significantly different performance characteristics and (ii) application idiosyncrasies such as session-based workloads, tier replication, load imbalances across replicas, and caching at intermediate tiers. We validate our model using real multitier applications running on a Linux server cluster. Our experiments indicate that our model faithfully captures the performance of these applications for a number of workloads and configurations. Furthermore, our model successfully handles a comprehensive range of resource utilization---from 0 to near saturation for the CPU---for two separate tiers. For a variety of scenarios, including those with caching at one of the application tiers, the average response times predicted by our model were within the 95% confidence intervals of the observed average response times. Our experiments also demonstrate the utility of the model for dynamic capacity provisioning, performance prediction, bottleneck identification, and session policing. In one scenario, where the request arrival rate increased from less than 1500 to nearly 4200 requests/minute, a dynamic provisioning technique employing our model was able to maintain response time targets by increasing the capacity of two of the tiers by factors of 2 and 3.5, respectively.

[1]  K. Shin,et al.  Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach , 2002, IEEE Trans. Parallel Distributed Syst..

[2]  Louis P. Slothouber,et al.  A Model of Web Server Performance , 1996 .

[3]  Sneha Kumar Kasera,et al.  Robust multiclass signaling overload control , 2005, 13TH IEEE International Conference on Network Protocols (ICNP'05).

[4]  Prashant J. Shenoy,et al.  Dynamic Provisioning of Multi-tier Internet Applications , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[5]  Ludmila Cherkasova,et al.  Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites , 2002, IEEE Trans. Computers.

[6]  Jeffrey S. Chase,et al.  Balance of Power: Energy Management for Server Clusters , 2001 .

[7]  J. Rolia,et al.  Adaptive Internet Data Centers , 2000 .

[8]  Anand Sivasubramaniam,et al.  Managing server energy and operational costs in hosting centers , 2005, SIGMETRICS '05.

[9]  Jeffrey S. Chase,et al.  Energy management for server clusters , 2001, Proceedings Eighth Workshop on Hot Topics in Operating Systems.

[10]  Zongwei Luo,et al.  Layered queueing models for enterprise JavaBean applications , 2001, Proceedings Fifth IEEE International Enterprise Distributed Object Computing Conference.

[11]  Prashant J. Shenoy,et al.  Dynamic resource allocation for shared data centers using online measurements , 2003, IWQoS'03.

[12]  Akshat Verma,et al.  On admission control for profit maximization of networked service providers , 2003, WWW '03.

[13]  Daniel A. Menascé,et al.  Resource Allocation for Autonomic Data Centers using Analytic Performance Models , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[14]  Abhishek Chandra,et al.  Quantifying the Benefits of Resource Multiplexing in On-Demand Data Centers , 2003 .

[15]  Erich M. Nahum,et al.  Yaksha: a self-tuning controller for managing the performance of 3-tiered Web sites , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[16]  Tarek F. Abdelzaher,et al.  Web Content Adaptation to Improve Server Overload Behavior , 1999, Comput. Networks.

[17]  Prashant J. Shenoy,et al.  Agile dynamic provisioning of multi-tier Internet applications , 2008, TAAS.

[18]  Prashant J. Shenoy,et al.  Resource overbooking and application profiling in shared hosting platforms , 2002, OSDI '02.

[19]  Stephen S. Lavenberg,et al.  Mean-Value Analysis of Closed Multichain Queuing Networks , 1980, JACM.

[20]  Amin Vahdat,et al.  Managing energy and server resources in hosting centers , 2001, SOSP.

[21]  Xiaoyun Zhu,et al.  Optimal resource assignment in Internet data centers , 2001, MASCOTS 2001, Proceedings Ninth International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[22]  C. Murray Woodside,et al.  General Bypass Architecture for High-Performance Distributed Applications , 1995, Data Communications and their Performance.

[23]  S. Ranjan,et al.  QoS-driven server migration for Internet data centers , 2002, IEEE 2002 Tenth IEEE International Workshop on Quality of Service (Cat. No.02EX564).

[24]  Chris I. Dalton,et al.  SoftUDC: a software-based data center for utility computing , 2004, Computer.

[25]  Cheng-Zhong Xu,et al.  Decay function model for resource configuration and adaptive allocation on Internet servers , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[26]  Joseph L. Hellerstein,et al.  An approach to predictive detection for service management , 1999, Integrated Network Management VI. Distributed Management for the Networked Millennium. Proceedings of the Sixth IFIP/IEEE International Symposium on Integrated Network Management. (Cat. No.99EX302).

[27]  Martin Arlitt,et al.  Workload Characterization of the 1998 World Cup Web Site , 1999 .

[28]  Edward W. Knightly,et al.  Multi-class latency-bounded Web services , 2000, 2000 Eighth International Workshop on Quality of Service. IWQoS 2000 (Cat. No.00EX400).

[29]  Prashant J. Shenoy,et al.  Brief announcement: Cataclysm: handling extreme overloads in internet services , 2004, PODC '04.

[30]  Asser N. Tantawi,et al.  Performance management for cluster-based web services , 2005, IEEE Journal on Selected Areas in Communications.

[31]  Benny Rochwerger,et al.  Oceano-SLA based management of a computing utility , 2001, 2001 IEEE/IFIP International Symposium on Integrated Network Management Proceedings. Integrated Network Management VII. Integrated Management Strategies for the New Millennium (Cat. No.01EX470).

[32]  Jeffrey S. Chase,et al.  Correlating Instrumentation Data to System States: A Building Block for Automated Diagnosis and Control , 2004, OSDI.

[33]  Jerome A. Rolia,et al.  The Method of Layers , 1995, IEEE Trans. Software Eng..

[34]  Prashant J. Shenoy,et al.  Dynamic resource allocation for shared data centers using online measurements , 2003, IWQoS'03.

[35]  Eric A. Brewer,et al.  Cluster-based scalable network services , 1997, SOSP.

[36]  Jing Xu,et al.  Performance modeling and prediction of enterprise JavaBeans with layered queuing network templates , 2006, ACM SIGSOFT Softw. Eng. Notes.

[37]  Erich M. Nahum,et al.  A method for transparent admission control and request scheduling in e-commerce web sites , 2004, WWW '04.

[38]  Edward D. Lazowska,et al.  Quantitative system performance - computer system analysis using queueing network models , 1983, Int. CMG Conference.

[39]  Dan Rubenstein,et al.  Provisioning servers in the application tier for e-commerce systems , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[40]  Prashant Shenoy,et al.  Cataclysm: Handling Extreme Overloads in Internet Services , 2004, PODC 2004.

[41]  Samuel Kounev,et al.  Performance Modeling and Evaluation of Large-Scale J2EE Applications , 2003, Int. CMG Conference.

[42]  Wei Jin,et al.  USENIX Association Proceedings of USITS ’ 03 : 4 th USENIX Symposium on Internet Technologies and Systems , 2003 .

[43]  Mor Harchol-Balter,et al.  Web servers under overload: How scheduling can help , 2006, TOIT.

[44]  Virgílio A. F. Almeida,et al.  Performance by Design - Computer Capacity Planning By Example , 2004 .

[45]  Anand Sivasubramaniam,et al.  Pricing-based strategies for autonomic control of web servers for time-varying request arrivals , 2004, Eng. Appl. Artif. Intell..

[46]  Yixin Diao,et al.  Dynamic Surge Protection: An Approach to Handling Unexpected Workload Surges with Resource Actions that Have Lead Times , 2003, DSOM.

[47]  Daniel A. Menascé Web Server Software Architectures , 2003, IEEE Internet Comput..

[48]  Jing Xu,et al.  Performance modeling and prediction of enterprise JavaBeans with layered queuing network templates , 2006 .

[49]  Sugih Jamin,et al.  A measurement-based admission-controlled Web server , 2000, Proceedings IEEE INFOCOM 2000. Conference on Computer Communications. Nineteenth Annual Joint Conference of the IEEE Computer and Communications Societies (Cat. No.00CH37064).

[50]  Krishna Kant,et al.  Overload Control Mechanisms for Web Servers , 2001 .

[51]  Anand Sivasubramaniam,et al.  Predicting Web Cache Behavior using Stochastic State-Space Models , 2008, PDPTA.

[52]  Edward D. Lazowska,et al.  Quantitative System Performance , 1985, Int. CMG Conference.

[53]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[54]  Douglas M. Freimuth,et al.  Kernel Mechanisms for Service Differentiation in Overloaded Web Servers , 2001, USENIX Annual Technical Conference, General Track.

[55]  Mor Harchol-Balter,et al.  Web servers under overload: How scheduling can help , 2006, TOIT.

[56]  David E. Culler,et al.  USENIX Association Proceedings of USITS ’ 03 : 4 th USENIX Symposium on Internet Technologies and Systems , 2003 .

[57]  Prashant Shenoy,et al.  Dynamic resource management in internet hosting platforms , 2005 .

[58]  C. Murray Woodside,et al.  Performance analysis of distributed server systems , 2000 .

[59]  Asser N. Tantawi,et al.  Performance management for cluster based Web services , 2003 .

[60]  Christopher Stewart,et al.  Exploiting nonstationarity for performance prediction , 2007, EuroSys '07.