Exploring Alternative Approaches to Implement an Elasticity Policy

An elasticity policy governs how and when resources (e.g., application server instances at the PaaS layer) are added to and/or removed from a cloud environment. The elasticity policy can be implemented as a conventional control loop or as a set of heuristic rules. In the control-theoretic approach, complex constructs such as tracking filters, estimators, regulators, and controllers are utilized. In the heuristic, rule-based approach, various alerts(e.g., events) are defined on instance metrics (e.g., CPU utilization), which are then aggregated at a global scale in order to make provisioning decisions for a given application tier. This work provides an overview of our experiences designing and working with both approaches to construct an auto scaler for simple applications. We enumerate different criteria such as design complexity, ease of comprehension, and maintenance upon which we form an informal comparison between the different methods. We conclude with a brief discussion of how these approaches can be used in the governance of resources to better meet a high-level goal over time.

[1]  Dorina C. Petriu Approximate Mean Value Analysis of Client-Server Systems with Multi-class Requests , 1994, SIGMETRICS.

[2]  Marin Litoiu A performance engineering method for web applications , 2010, 2010 12th IEEE International Symposium on Web Systems Evolution (WSE).

[3]  Kang G. Shin,et al.  Optimal Dynamic Control of Resources in a Distributed System , 1989, IEEE Transactions on Software Engineering.

[4]  Didier Donsez,et al.  Autonomic management of J2EE edge servers , 2005, MGC '05.

[5]  Marin Litoiu,et al.  Hierarchical Model-based Autonomic Control of Software Systems , 2005 .

[6]  Jeffrey O. Kephart,et al.  An artificial intelligence perspective on autonomic computing policies , 2004, Proceedings. Fifth IEEE International Workshop on Policies for Distributed Systems and Networks, 2004. POLICY 2004..

[7]  Chenyang Lu,et al.  Modeling and performance control of Internet servers , 2000, Proceedings of the 39th IEEE Conference on Decision and Control (Cat. No.00CH37187).

[8]  K. Shin,et al.  Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach , 2002, IEEE Trans. Parallel Distributed Syst..

[9]  Chenyang Lu,et al.  Feedback performance control in software services , 2003 .

[10]  Rafael Moreno-Vozmediano,et al.  Elastic management of cluster-based services in the cloud , 2009, ACDC '09.

[11]  J. L. Roux An Introduction to the Kalman Filter , 2003 .

[12]  Dorina C. Petriu,et al.  Approximate Mean Value Analysis based on Markov Chain Aggregation by Composition , 2004 .

[13]  G. Mastronardi,et al.  A soft computing approach to the intelligent control , 2006, 2006 4th IEEE International Conference on Industrial Informatics.

[14]  Jing Xu,et al.  Performance modeling and prediction of enterprise JavaBeans with layered queuing network templates , 2006, ACM SIGSOFT Softw. Eng. Notes.

[15]  J. Hayes,et al.  Self-optimization in computer systems via on-line control: application to power management , 2004, International Conference on Autonomic Computing, 2004. Proceedings..

[16]  Nagarajan Kandasamy,et al.  A control-based framework for self-managing distributed computing systems , 2004, WOSS '04.

[17]  Eyke Hüllermeier,et al.  On the representation of fuzzy rules in terms of crisp rules , 2003, Inf. Sci..

[18]  Nagarajan Kandasamy,et al.  Enabling Self-Managing Applications using Model-based Online Control Strategies , 2006, 2006 IEEE International Conference on Autonomic Computing.

[19]  Steven Hand,et al.  Self-adaptive and self-configured CPU resource provisioning for virtualized servers using Kalman filters , 2009, ICAC '09.

[20]  Ying Zhang,et al.  Integrating Resource Consumption and Allocation for Infrastructure Resources on-Demand , 2010, 2010 IEEE 3rd International Conference on Cloud Computing.

[21]  Jerome A. Rolia,et al.  The Method of Layers , 1995, IEEE Trans. Software Eng..

[22]  Sridhar Ramesh,et al.  A multi-layer client-server queueing network model with synchronous and asynchronous messages , 1998, WOSP '98.

[23]  Prashant Pandey,et al.  Cloud computing , 2010, ICWET.

[24]  Nagarajan Kandasamy,et al.  Self-optimization in computer systems via on-line control: application to power management , 2004 .

[25]  Hanan Lutfiyya,et al.  A Policy-Based Framework for Managing Data Centers , 2006, 2006 IEEE/IFIP Network Operations and Management Symposium NOMS 2006.

[26]  Rajkumar Buyya,et al.  Article in Press Future Generation Computer Systems ( ) – Future Generation Computer Systems Cloud Computing and Emerging It Platforms: Vision, Hype, and Reality for Delivering Computing as the 5th Utility , 2022 .

[27]  Kevin Skadron,et al.  Power-aware QoS management in Web servers , 2003, RTSS 2003. 24th IEEE Real-Time Systems Symposium, 2003.

[28]  Zongwei Luo,et al.  Layered queueing models for enterprise JavaBean applications , 2001, Proceedings Fifth IEEE International Enterprise Distributed Object Computing Conference.

[29]  Lui Sha,et al.  Feedback control with queueing-theoretic prediction for relative delay guarantees in web servers , 2003, The 9th IEEE Real-Time and Embedded Technology and Applications Symposium, 2003. Proceedings..

[30]  Randy H. Katz,et al.  Above the Clouds: A Berkeley View of Cloud Computing , 2009 .

[31]  Marin Litoiu,et al.  A performance analysis method for autonomic computing systems , 2007, TAAS.

[32]  Mohamed Adel Serhani,et al.  A queuing model for service selection of multi-classes QoS-aware Web services , 2005, Third European Conference on Web Services (ECOWS'05).

[33]  Joseph L. Hellerstein,et al.  Using Control Theory to Achieve Service Level Objectives In Performance Management , 2002, Real-Time Systems.

[34]  Ajay Mohindra,et al.  Dynamic Scaling of Web Applications in a Virtualized Cloud Computing Environment , 2009, 2009 IEEE International Conference on e-Business Engineering.

[35]  Martin Arlitt,et al.  A workload characterization study of the 1998 World Cup Web site , 2000, IEEE Netw..

[36]  Kevin Skadron,et al.  Control-theoretic dynamic frequency and voltage scaling for multimedia workloads , 2002, CASES '02.

[37]  Muli Ben-Yehuda,et al.  The Reservoir model and architecture for open federated cloud computing , 2009, IBM J. Res. Dev..

[38]  Hanan Lutfiyya,et al.  Policies, grids and autonomic computing , 2005, ACM SIGSOFT Softw. Eng. Notes.

[39]  Filipe Marques,et al.  SLA Design from a Business Perspective , 2005, DSOM.

[40]  Hanan Lutfiyya,et al.  Strategy-Trees: A Feedback Based Approach to Policy Management , 2008, MACE.

[41]  Marin Litoiu,et al.  Tracking time-varying parameters in software systems with extended Kalman filters , 2015, CASCON.

[42]  Paris Flegkas,et al.  A methodological approach toward the refinement problem in policy-based management systems , 2006 .

[43]  Chenyang Lu,et al.  Proceedings of the Fast 2002 Conference on File and Storage Technologies Aqueduct: Online Data Migration with Performance Guarantees , 2022 .

[44]  Yixin Diao,et al.  Feedback Control of Computing Systems , 2004 .