Cost-Efficient Consolidating Service for Aliyun’s Cloud-Scale Computing

Server consolidation is critical for energy efficiency of cloud-scale computing. In production environments like Aliyun, which is one of the largest public cloud platforms in the world, server consolidation has several challenges. First, the widespread use of local storage remarkably increases the migration cost (time). Second, the resource utilization of service instances varies over time, which may result in migration oscillation. Third, server consolidation must follow practical constraints. E.g., instances can only be migrated within a maintenance window, and both the resource utilization and the number of instances on a server are bounded. This paper designs and implements C4, a Cost-Efficient Consolidating Service for Aliyun’s Cloud-Scale Computing, which Aliyun uses to consolidate servers. We analyze user pattern, resource utilization, and migration cost in Aliyun, showing that traditional utilization-based consolidation approaches cannot meet the needs in production environments, especially for the local-storage-based computing. This motivates us to propose the migration cost model, by which to select servers with the minimum migration time to release. We use the Worst-Fit heuristic to migrate instances to balance the load. Evaluation shows that C4 achieves cost-efficient, load-balanced, and oscillation-free consolidating service. We describe experience with over one year of C4 production deployment, lessons learned, and areas for future work.

[1]  Meng Wang,et al.  Consolidating virtual machines with dynamic bandwidth demand in data centers , 2011, 2011 Proceedings IEEE INFOCOM.

[2]  Rina Panigrahy,et al.  Heuristics for Vector Bin Packing , 2011 .

[3]  Jingfei Jiang,et al.  Efficient Resources Provisioning Based on Load Forecasting in Cloud , 2014, TheScientificWorldJournal.

[4]  Ning Ding,et al.  The only constant is change: incorporating time-varying network reservations in data centers , 2012, SIGCOMM.

[5]  Umesh Deshpande,et al.  Traffic-Sensitive Live Migration of Virtual Machines , 2015, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[6]  Randy H. Katz,et al.  Heterogeneity and dynamicity of clouds at scale: Google trace analysis , 2012, SoCC '12.

[7]  Kartik Gopalan,et al.  Post-copy based live virtual machine migration using adaptive pre-paging and dynamic self-ballooning , 2009, VEE '09.

[8]  Carlo Curino,et al.  Relational Cloud: a Database Service for the cloud , 2011, CIDR.

[9]  Rajkumar Buyya,et al.  Cost of Virtual Machine Live Migration in Clouds: A Performance Evaluation , 2009, CloudCom.

[10]  Prashant J. Shenoy,et al.  "Cut me some slack": latency-aware live migration for databases , 2012, EDBT '12.

[11]  Haiying Shen,et al.  RIAL: Resource Intensity Aware Load balancing in clouds , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[12]  J. B. G. Frenk,et al.  On the multidimensional vector bin packing , 1990, Acta Cybern..

[13]  Yasuhiro Fujiwara,et al.  Madeus: Database Live Migration Middleware under Heavy Workloads for Cloud Environment , 2015, SIGMOD Conference.

[14]  Rajkumar Buyya,et al.  Optimal online deterministic algorithms and adaptive heuristics for energy and performance efficient dynamic consolidation of virtual machines in Cloud data centers , 2012, Concurr. Comput. Pract. Exp..

[15]  Werner Vogels,et al.  Beyond Server Consolidation , 2008, ACM Queue.

[16]  Divyakant Agrawal,et al.  Zephyr: live migration in shared nothing databases for elastic cloud platforms , 2011, SIGMOD '11.

[17]  Carlo Curino,et al.  Workload-aware database monitoring and consolidation , 2011, SIGMOD '11.

[18]  Divyakant Agrawal,et al.  Albatross: Lightweight Elasticity in Shared Storage Databases for the Cloud using Live Data Migration , 2011, Proc. VLDB Endow..

[19]  Fabio Panzieri,et al.  Server consolidation in Clouds through gossiping , 2011, 2011 IEEE International Symposium on a World of Wireless, Mobile and Multimedia Networks.

[20]  Haiying Shen,et al.  Consolidating complementary VMs with spatial/temporal-awareness in cloud datacenters , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[21]  米海波 Storage-aware Server Consolidation for Cloud Services Utilizing Local Storage , 2015 .

[22]  Sangyoon Oh,et al.  Sercon: Server Consolidation Algorithm using Live Migration of Virtual Machines for Green Computing , 2011 .

[23]  Suman Nath,et al.  Energy-Aware Server Provisioning and Load Dispatching for Connection-Intensive Internet Services , 2008, NSDI.

[24]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[25]  Martin Bichler,et al.  A Mathematical Programming Approach for Server Consolidation Problems in Virtualized Data Centers , 2010, IEEE Transactions on Services Computing.

[26]  Chuang Lin,et al.  Delay guaranteed live migration of Virtual Machines , 2014, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[27]  Rajkumar Buyya,et al.  Energy-aware resource allocation heuristics for efficient management of data centers for Cloud computing , 2012, Future Gener. Comput. Syst..

[28]  Jeffrey O. Kephart,et al.  Runtime Demand Estimation for effective dynamic resource management , 2010, 2010 IEEE Network Operations and Management Symposium - NOMS 2010.

[29]  Jingfei Jiang,et al.  G2LC: Resources Autoscaling for Real Time Bioinformatics Applications in IaaS , 2015, Comput. Math. Methods Medicine.

[30]  Pangfeng Liu,et al.  Server Consolidation Algorithms with Bounded Migration Cost and Performance Guarantees in Cloud Computing , 2011, 2011 Fourth IEEE International Conference on Utility and Cloud Computing.

[31]  Paolo Toth,et al.  Lower bounds and algorithms for the 2-dimensional vector packing problem , 2001, Discret. Appl. Math..