Performance model driven QoS guarantees and optimization in clouds

This paper presents a method for achieving optimization in clouds by using performance models in the development, deployment and operations of the applications running in the cloud. We show the architecture of the cloud, the services offered by the cloud to support optimization and the methodology used by developers to enable runtime optimization of the clouds. An optimization algorithm is presented which accommodates different goals, different scopes and timescales of optimization actions, and different control algorithms. The optimization here maximizes profits in the cloud constrained by QoS and SLAs across a large variety of workloads.

[1]  José Merseguer,et al.  Performance by unified model analysis (PUMA) , 2005, WOSP '05.

[2]  Jing Xu,et al.  Layered Bottlenecks and Their Mitigation , 2006, Third International Conference on the Quantitative Evaluation of Systems - (QEST'06).

[3]  Malgorzata Steinder,et al.  A scalable application placement controller for enterprise data centers , 2007, WWW '07.

[4]  Mark Chang,et al.  Service-Orientation in the Computing Infrastructure , 2006, 2006 Second IEEE International Symposium on Service-Oriented System Engineering (SOSE'06).

[5]  Marin Litoiu,et al.  Performance Model Estimation and Tracking Using Optimal Filters , 2008, IEEE Transactions on Software Engineering.

[6]  Marin Litoiu,et al.  Fast scalable optimization to configure service systems having cost and quality of service constraints , 2009, ICAC '09.

[7]  Marin Litoiu,et al.  Designing Process Replication and Activation: A Quantitative Approach , 2000, IEEE Trans. Software Eng..

[8]  Shikharesh Majumdar,et al.  The Stochastic Rendezvous Network Model for Performance of Synchronous Client-Server-like Distributed Software , 1995, IEEE Trans. Computers.

[9]  Jing Xu,et al.  Rule-based automatic software performance diagnosis and improvement , 2008, WOSP '08.

[10]  Daniel A. Menascé,et al.  A framework for QoS-aware software components , 2004, WOSP '04.

[11]  John W. Chinneck,et al.  Processing network models of energy/environment systems , 1995 .

[12]  Asser N. Tantawi,et al.  Dynamic placement for clustered web applications , 2006, WWW '06.

[13]  John B. Shoven,et al.  I , Edinburgh Medical and Surgical Journal.

[14]  C. Murray Woodside,et al.  Enhanced Modeling and Solution of Layered Queueing Networks , 2009, IEEE Transactions on Software Engineering.

[15]  Jerome A. Rolia,et al.  The Method of Layers , 1995, IEEE Trans. Software Eng..

[16]  Jerome A. Rolia,et al.  Service Centric Computing - Next Generation Internet Computing , 2002, Performance.

[17]  Marin Litoiu,et al.  Service System Resource Management Based on a Tracked Layered Performance Model , 2006, 2006 IEEE International Conference on Autonomic Computing.

[18]  Malgorzata Steinder,et al.  Server virtualization in autonomic management of heterogeneous workloads , 2007, 2007 10th IFIP/IEEE International Symposium on Integrated Network Management.

[19]  Andrzej Kochut,et al.  Dynamic Placement of Virtual Machines for Managing SLA Violations , 2007, 2007 10th IFIP/IEEE International Symposium on Integrated Network Management.

[20]  Jayashree Subrahmonia,et al.  From Cloud Computing to the New Enterprise Data Center , 2008 .