Decentralized cloud datacenter reconsolidation through emergent and topology-aware behavior

Consolidation of multiple applications on a single Physical Machine (PM) within acloud data center can increase utilization, minimize energy consumption, and reduceoperational costs. However, these benefits comes at the cost of increasing the complex-ity of the scheduling problem.In this paper, we present a topology-aware resource management framework. Aspart of this framework, we introduce a Reconsolidating PlaceMent scheduler (RPM)that provides and maintains durable allocations with low maintenance costs for datacenters with dynamic workloads. We focus on workloads featuring both short-livedbatch jobs and latency-sensitive services such as interactive web applications. Thescheduler assigns resources to Virtual Machines (VMs) and maintains packing effi-ciency while taking into account migration costs, topological constraints, and the riskof resource contention, as well as the variability of the background load and its com-plementarity to the new VM.We evaluate the model by simulating a data center with over 65000 PMs, structuredas a three-level multi-rooted tree topology. We investigate trade-offs between factorsthat affect the durability and operational cost of maintaining a near-optimal packing.The results show that the proposed scheduler can scale to the number of PMs in thesimulation and maintain efficient utilization with low migration costs.

[1]  Martin Bichler,et al.  A Mathematical Programming Approach for Server Consolidation Problems in Virtualized Data Centers , 2010, IEEE Transactions on Services Computing.

[2]  Jordi Torres Viñals,et al.  An integer linear programming representation for data-center power-aware management , 2010 .

[3]  Erik Elmroth,et al.  Divide the Task, Multiply the Outcome: Cooperative VM Consolidation , 2014, 2014 IEEE 6th International Conference on Cloud Computing Technology and Science.

[4]  Ramesh K. Sitaraman,et al.  The power of two random choices: a survey of tech-niques and results , 2001 .

[5]  Rajkumar Buyya,et al.  Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing , 2012, Future Gener. Comput. Syst..

[6]  Brenda S. Baker,et al.  A New Proof for the First-Fit Decreasing Bin-Packing Algorithm , 1985, J. Algorithms.

[7]  Rolf Stadler,et al.  Allocating Compute and Network Resources Under Management Objectives in Large-Scale Clouds , 2013, Journal of Network and Systems Management.

[8]  Patrick Wendell,et al.  Sparrow: distributed, low latency scheduling , 2013, SOSP.

[9]  Haifeng Chen,et al.  Network-aware coordination of virtual machine migrations in enterprise data centers and clouds , 2013, 2013 IFIP/IEEE International Symposium on Integrated Network Management (IM 2013).

[10]  Akshat Verma,et al.  pMapper: Power and Migration Cost Aware Application Placement in Virtualized Systems , 2008, Middleware.

[11]  César A. F. De Rose,et al.  Server consolidation with migration control for virtualized data centers , 2011, Future Gener. Comput. Syst..

[12]  Michael Abd-El-Malek,et al.  Omega: flexible, scalable schedulers for large compute clusters , 2013, EuroSys '13.

[13]  Johan Tordsson,et al.  Virtual Machine Placement for Predictable and Time-Constrained Peak Loads , 2011, GECON.

[14]  Rina Panigrahy,et al.  Validating Heuristics for Virtual Machines Consolidation , 2011 .

[15]  Calton Pu,et al.  Mistral: Dynamically Managing Power, Performance, and Adaptation Cost in Cloud Infrastructures , 2010, 2010 IEEE 30th International Conference on Distributed Computing Systems.

[16]  Petter Svärd,et al.  Principles and Performance Characteristics of Algorithms for Live VM Migration , 2015, OPSR.

[17]  Joseph Naor,et al.  Topology-Aware VM Migration in Bandwidth Oversubscribed Datacenter Networks , 2012, ICALP.

[18]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[19]  P. Cochat,et al.  Et al , 2008, Archives de pediatrie : organe officiel de la Societe francaise de pediatrie.

[20]  Erik Elmroth,et al.  Autonomic Resource Allocation for Cloud Data Centers: A Peer to Peer Approach , 2014, 2014 International Conference on Cloud and Autonomic Computing.

[21]  Arun Venkataramani,et al.  Sandpiper: Black-box and gray-box resource management for virtual machines , 2009, Comput. Networks.

[22]  Petter Svärd,et al.  Continuous Datacenter Consolidation , 2015, 2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom).

[23]  Albert G. Greenberg,et al.  The cost of a cloud: research problems in data center networks , 2008, CCRV.

[24]  Vijay Mann,et al.  Remedy: Network-Aware Steady State VM Management for Data Centers , 2012, Networking.

[25]  Márk Jelasity,et al.  PeerSim: A scalable P2P simulator , 2009, 2009 IEEE Ninth International Conference on Peer-to-Peer Computing.

[26]  Djamal Zeghlache,et al.  Energy Efficient VM Scheduling for Cloud Data Centers: Exact Allocation and Migration Algorithms , 2013, 2013 13th IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing.

[27]  Johan Tordsson,et al.  Cloudy with a Chance of Load Spikes: Admission Control with Fuzzy Risk Assessments , 2013, 2013 IEEE/ACM 6th International Conference on Utility and Cloud Computing.

[28]  Akshat Verma,et al.  Virtual machine consolidation in the wild , 2014, Middleware.

[29]  Abhishek Verma,et al.  Evaluating job packing in warehouse-scale computing , 2014, 2014 IEEE International Conference on Cluster Computing (CLUSTER).

[30]  Randy H. Katz,et al.  Heterogeneity and dynamicity of clouds at scale: Google trace analysis , 2012, SoCC '12.

[31]  Fabio Panzieri,et al.  Server consolidation in Clouds through gossiping , 2011, 2011 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks.

[32]  Luiz André Barroso,et al.  The tail at scale , 2013, CACM.

[33]  Lucio Grandinetti,et al.  A multi-dimensional job scheduling , 2016, Future Gener. Comput. Syst..

[34]  Márk Jelasity,et al.  Large-Scale Newscast Computing on the Internet , 2002 .

[35]  Balaji Viswanathan,et al.  CloudMap: Workload-aware placement in private heterogeneous clouds , 2012, 2012 IEEE Network Operations and Management Symposium.

[36]  Christine Morin,et al.  Snooze: A Scalable and Autonomic Virtual Machine Management Framework for Private Clouds , 2012, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012).

[37]  Moustafa Ghanem,et al.  Improving Resource Utilisation in the Cloud Environment Using Multivariate Probabilistic Models , 2012, 2012 IEEE Fifth International Conference on Cloud Computing.