A KCCA-based Approach for Performance Modeling of Web Application in Clouds

Cloud computing which services in a pay-as-you-go manner to the users transparently, is a newly execution environment. However, the complex and dynamic relationships between application workload and performance and resource consumptions make it difficult to manage the resources in the cloud. Performance modeling provides the ability to predict the performance of application and resource consumptions. In this paper, we describe a dimensionality reduction algorithm called Kernel Canonical Correlation Analysis (KCCA) to predict the performance and resource consumptions of web application in the cloud computing environment. This approach provides much more expressiveness in capturing similarity and its correlations can then be used to quantify performance similarity. With KCCA, we build a single model for multiple performance metrics and reduce the dimensionality of both the workload and performance. Then we detail the effectiveness of the approach using example data gathered from production environment. The performance evaluation results demonstrate that our methodology could effectively predict the performance of web applications.

[1]  Asser N. Tantawi,et al.  Performance management for cluster based Web services , 2003 .

[2]  Prashant Shenoy,et al.  Cataclysm: Handling Extreme Overloads in Internet Services , 2004, PODC 2004.

[3]  Asser N. Tantawi,et al.  An analytical model for multi-tier internet services and its applications , 2005, SIGMETRICS '05.

[4]  Christos Faloutsos,et al.  Storage device performance prediction with CART models , 2004, The IEEE Computer Society's 12th Annual International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunications Systems, 2004. (MASCOTS 2004). Proceedings..

[5]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2004 .

[6]  Qi Zhang,et al.  R-Capriccio: A Capacity Planning and Anomaly Detection Tool for Enterprise Services with Live Workloads , 2007, Middleware.

[7]  Erich M. Nahum,et al.  Yaksha: a self-tuning controller for managing the performance of 3-tiered Web sites , 2004, Twelfth IEEE International Workshop on Quality of Service, 2004. IWQOS 2004..

[8]  Armando Fox,et al.  Capturing, indexing, clustering, and retrieving system history , 2005, SOSP '05.

[9]  Yinong Chen,et al.  Virtualization-based autonomic resource management for multi-tier Web applications in shared data center , 2008, J. Syst. Softw..

[10]  John Shawe-Taylor,et al.  Canonical Correlation Analysis: An Overview with Application to Learning Methods , 2004, Neural Computation.

[11]  S. Islam,et al.  Nonlinear dynamics of hourly ozone concentrations. nonparametric short term prediction , 1998 .

[12]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[13]  Michael I. Jordan,et al.  Statistical Machine Learning Makes Automatic Control Practical for Internet Datacenters , 2009, HotCloud.

[14]  Prashant J. Shenoy,et al.  Brief announcement: Cataclysm: handling extreme overloads in internet services , 2004, PODC '04.