Identifying Root Causes of Web Performance Degradation Using Changepoint Analysis

The large scale of the Internet has offered unique economic opportunities, that in turn introduce overwhelming challenges for development and operations to provide reliable and fast services in order to meet the high demands on the performance of online services. In this paper, we investigate how performance engineers can identify three different classes of externally-visible performance problems (global delays, partial delays, periodic delays) from concrete traces. We develop a simulation model based on a taxonomy of root causes in server performance degradation. Within an experimental setup, we obtain results through synthetic monitoring of a target Web service, and observe changes in Web performance over time through exploratory visual analysis and changepoint detection. Finally, we interpret our findings and discuss various challenges and pitfalls.

[1]  Leszek Borzemski,et al.  USING DATA MINING ALGORITHMS IN WEB PERFORMANCE PREDICTION , 2009, Cybern. Syst..

[2]  Leszek Borzemski,et al.  Knowledge Discovery about Web Performance with Geostatistical Turning Bands Method , 2011, KES.

[3]  João Paulo Magalhães,et al.  Anomaly Detection Techniques for Web-Based Applications: An Experimental Study , 2012, 2012 IEEE 11th International Symposium on Network Computing and Applications.

[4]  Ratul Mahajan,et al.  A provider-side view of web search response time , 2013, SIGCOMM.

[5]  Leszek Borzemski,et al.  THE EXPERIMENTAL DESIGN FOR DATA MINING TO DISCOVER WEB PERFORMANCE ISSUES IN A WIDE AREA NETWORK , 2010, Cybern. Syst..

[6]  Marcos K. Aguilera,et al.  Performance debugging for distributed systems of black boxes , 2003, SOSP '03.

[7]  Leszek Borzemski,et al.  Time Series Forecasting of Web Performance Data Monitored by MWING Multiagent Distributed System , 2010, ICCCI.

[8]  Tadeusz M. Szuba,et al.  Computational Collective Intelligence , 2001, Lecture Notes in Computer Science.

[9]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[10]  Monika Richter Scheduling And Load Balancing In Parallel And Distributed Systems , 2016 .

[11]  Zhen Liu,et al.  Traffic model and performance evaluation of Web servers , 2001, Perform. Evaluation.

[12]  Allan Kuchinsky,et al.  Quality is in the eye of the beholder: meeting users' requirements for Internet quality of service , 2000, CHI.

[13]  Takuji Nishimura,et al.  Mersenne twister: a 623-dimensionally equidistributed uniform pseudo-random number generator , 1998, TOMC.

[14]  Evgenia Smirni,et al.  Automated anomaly detection and performance modeling of enterprise applications , 2009, TOCS.

[15]  Schahram Dustdar,et al.  Data-driven and automated prediction of service level agreement violations in service compositions , 2013, Distributed and Parallel Databases.

[16]  Leszek Borzemski,et al.  Spatio-temporal Web Performance Forecasting with Sequential Gaussian Simulation Method , 2012, CN.

[17]  Lakhmi C. Jain,et al.  Knowledge-Based Intelligent Information and Engineering Systems , 2004, Lecture Notes in Computer Science.

[18]  Douglas C. Montgomery,et al.  Using common random numbers in simulation experiments — an approach to statistical analysis , 1976 .

[19]  Ahmed E. Hassan,et al.  Automated detection of performance regressions using statistical process control techniques , 2012, ICPE '12.

[20]  Eric A. Brewer,et al.  Pinpoint: problem determination in large, dynamic Internet services , 2002, Proceedings International Conference on Dependable Systems and Networks.

[21]  Peter Dalgaard,et al.  R Development Core Team (2010): R: A language and environment for statistical computing , 2010 .

[22]  Leszek Borzemski,et al.  Knowledge Engineering Relating to Spatial Web Performance Forecasting with Sequential Gaussian Simulation Method , 2012, KES.

[23]  P. Jaccard THE DISTRIBUTION OF THE FLORA IN THE ALPINE ZONE.1 , 1912 .

[24]  Jakob Nielsen,et al.  Speed Up Your Site: Web Site Optimization , 2003 .

[25]  Armando Fox,et al.  Capturing, indexing, clustering, and retrieving system history , 2005, SOSP '05.