Prediction of performance anomalies in web-applications based-on software aging scenarios

The topic of this paper is about prediction of performance anomalies caused by software aging. We have developed a framework for detection of performance anomalies that is targeted to web and component-based applications. In this study, we selected some amount of historical data previously collected and we conducted a correlation analysis with this data. The resulting dataset was then submitted to some Machine-Learning (ML) classification algorithms. The best algorithms were selected according to the accuracy and precision. In a second step, we induced some synthetic aging scenarios (memory leaks and CPU contention) in the application and we tried to do estimation of the system parameters by using time-series analysis. With the estimated values we conducted a classification with the three previous ML algorithms. From the initial results we observed that combining the estimation of parameters supported by time-series models with ML classification techniques provides some good results on the prediction of performance anomalies. We also observed that there is no single ML algorithm that can be applied effectively to predict the response time for all the web-transactions.

[1]  Eibe Frank,et al.  Logistic Model Trees , 2003, Machine Learning.

[2]  P. Lachenbruch Statistical Power Analysis for the Behavioral Sciences (2nd ed.) , 1989 .

[3]  Kishor S. Trivedi,et al.  An approach for estimation of software aging in a Web server , 2002, Proceedings International Symposium on Empirical Software Engineering.

[4]  J. Rodgers,et al.  Thirteen ways to look at the correlation coefficient , 1988 .

[5]  Eric A. Brewer,et al.  Pinpoint: problem determination in large, dynamic Internet services , 2002, Proceedings International Conference on Dependable Systems and Networks.

[6]  Kishor S. Trivedi,et al.  A methodology for detection and estimation of software aging , 1998, Proceedings Ninth International Symposium on Software Reliability Engineering (Cat. No.98TB100257).

[7]  Jacob Cohen Statistical Power Analysis for the Behavioral Sciences , 1969, The SAGE Encyclopedia of Research Design.

[8]  Jordi Torres,et al.  Adaptive on-line software aging prediction based on machine learning , 2010, 2010 IEEE/IFIP International Conference on Dependable Systems & Networks (DSN).

[9]  Evgenia Smirni,et al.  Anomaly? application change? or workload change? towards automated detection of application performance anomaly and change , 2008, 2008 IEEE International Conference on Dependable Systems and Networks With FTCS and DCC (DSN).

[10]  Yennun Huang,et al.  Software rejuvenation: analysis, module and applications , 1995, Twenty-Fifth International Symposium on Fault-Tolerant Computing. Digest of Papers.

[11]  Luís Moura Silva,et al.  Using machine learning for non-intrusive modeling and prediction of software aging , 2008, NOMS 2008 - 2008 IEEE Network Operations and Management Symposium.

[12]  Kishor S. Trivedi,et al.  A measurement-based model for estimation of resource exhaustion in operational software systems , 1999, Proceedings 10th International Symposium on Software Reliability Engineering (Cat. No.PR00443).

[13]  George Candea,et al.  Automatic failure-path inference: a generic introspection technique for Internet applications , 2003, Proceedings the Third IEEE Workshop on Internet Applications. WIAPP 2003.

[14]  João Paulo Magalhães,et al.  Detection of Performance Anomalies in Web-Based Applications , 2010, 2010 Ninth IEEE International Symposium on Network Computing and Applications.

[15]  P. Young,et al.  Time series analysis, forecasting and control , 1972, IEEE Transactions on Automatic Control.

[16]  Ramnivas Laddad,et al.  Aspectj in Action: Practical Aspect-Oriented Programming , 2003 .

[17]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[18]  金田 重郎,et al.  C4.5: Programs for Machine Learning (書評) , 1995 .