Flexible Distributed Capacity Allocation and Load Redirect Algorithms for Cloud Systems

In Cloud computing systems, resource management is one of the main issues. Indeed, in any time instant resources have to be allocated to handle effectively workload fluctuations, while providing Quality of Service (QoS) guarantees to the end users. In such systems, workload prediction-based autonomic computing techniques have been developed. In this paper we propose capacity allocation techniques able to coordinate multiple distributed resource controllers working in geographically distributed cloud sites. Furthermore, capacity allocation solutions are integrated with a load redirection mechanism which forwards incoming requests between different domains. The overall goal is to minimize the costs of the allocated virtual machine instances, while guaranteeing QoS constraints expressed as a threshold on the average response time. We compare multiple heuristics which integrate workload prediction and distributed non-linear optimization techniques. Experimental results show how our solutions significantly improve other heuristics proposed in the literature (5-35% on average), without introducing significant QoS violations.

[1]  M. L. Shone,et al.  Exponential Smoothing with an Adaptive Response Rate , 1967 .

[2]  Johannes Ledolter,et al.  Statistical methods for forecasting , 1983 .

[3]  Everette S. Gardner,et al.  Exponential smoothing: The state of the art , 1985 .

[4]  Dimitri P. Bertsekas,et al.  Nonlinear Programming , 1997 .

[5]  M. Crovella,et al.  Heavy-tailed probability distributions in the World Wide Web , 1998 .

[6]  Mikko H. Lipasti,et al.  An architectural evaluation of Java TPC-W , 2001, Proceedings HPCA Seventh International Symposium on High-Performance Computer Architecture.

[7]  Jerome A. Rolia,et al.  Characterizing the scalability of a large web-based shopping system , 2001, ACM Trans. Internet Techn..

[8]  Balachander Krishnamurthy,et al.  Flash crowds and denial of service attacks: characterization and implications for CDNs and web sites , 2002, WWW.

[9]  Michael A. Saunders,et al.  SNOPT: An SQP Algorithm for Large-Scale Constrained Optimization , 2002, SIAM J. Optim..

[10]  Ludmila Cherkasova,et al.  Session-Based Admission Control: A Mechanism for Peak Load Management of Commercial Web Sites , 2002, IEEE Trans. Computers.

[11]  Pascal Felber,et al.  Proactive hot spot avoidance for Web server dependability , 2004, Proceedings of the 23rd IEEE International Symposium on Reliable Distributed Systems, 2004..

[12]  Daniel A. Menascé,et al.  Resource Allocation for Autonomic Data Centers using Analytic Performance Models , 2005, Second International Conference on Autonomic Computing (ICAC'05).

[13]  Gunter Bolch,et al.  Queueing Networks and Markov Chains , 2005 .

[14]  G. Pierre,et al.  Predictability of Web-server traffic congestion , 2005, 10th International Workshop on Web Content Caching and Distribution (WCW'05).

[15]  Daniel Pérez Palomar,et al.  A tutorial on decomposition methods for network utility maximization , 2006, IEEE Journal on Selected Areas in Communications.

[16]  Asser N. Tantawi,et al.  Analytic modeling of multitier Internet applications , 2007, TWEB.

[17]  Cathy H. Xia,et al.  Load shedding and distributed resource control of stream processing networks , 2007, Perform. Evaluation.

[18]  Asser N. Tantawi,et al.  CPU demand for web serving: Measurement analysis and dynamic estimation , 2008, Perform. Evaluation.

[19]  Xiaoyun Zhu,et al.  1000 islands: an integrated approach to resource management for virtualized data centers , 2009, Cluster Computing.

[20]  Michele Colajanni,et al.  Autonomic Request Management Algorithms for Geographically Distributed Internet-Based Systems , 2008, 2008 Second IEEE International Conference on Self-Adaptive and Self-Organizing Systems.

[21]  Marios D. Dikaiakos,et al.  Cloud Computing: Distributed Internet Computing for IT and Scientific Research , 2009, IEEE Internet Computing.

[22]  Hakan Erdogmus,et al.  Cloud Computing: Does Nirvana Hide behind the Nebula? , 2009, IEEE Softw..

[23]  Michele Colajanni,et al.  On the Selection of Models for Runtime Prediction of System Resources , 2010 .

[24]  Gerhard Meixner,et al.  TwoSpot: A Cloud Platform for Scaling Out Web Applications Dynamically , 2010, ServiceWave.

[25]  Barbara Panicucci,et al.  Energy-Aware Autonomic Resource Allocation in Multitier Virtualized Environments , 2012, IEEE Transactions on Services Computing.