Communication-Aware Container Placement and Reassignment in Large-Scale Internet Data Centers

Containerization has been used in many applications for isolation purposes due to its lightweight, scalable, and highly portable properties. However, to apply containerization in large-scale Internet data centers faces a big challenge. Services in data centers are always instantiated as a group of containers, which often generate heavy communication workloads and therefore resulting in inefficient communications and downgraded service performance. Although assigning the containers of the same service to the same server can reduce the communication overhead, this may cause heavily imbalanced resource utilization since containers of the same service are usually intensive to the same resource. To reduce communication cost as well as balance the resource utilization in large-scale data centers, we further explore the container distribution issues in a real industrial environment and find that such conflict lies in two phases—container placement and container reassignment. The objective of this paper is to address the container distribution problem in these two phases. For the container placement problem, we propose an efficient communication aware worst fit decreasing algorithm to place a set of new containers into data centers. For the container reassignment problem, we propose a two-stage algorithm called Sweep&Search to optimize a given initial distribution of containers by migrating containers among servers. We implement the proposed algorithms in Baidu’s data centers and conduct extensive evaluations. Compared with the state-of-the-art strategies, the evaluation results show that our algorithms perform better up to 70% and increase the overall service throughput up to 90% simultaneously.

[1]  Tarik Taleb,et al.  Towards Edge Slicing: VNF Placement Algorithms for a Dynamic & Realistic Edge Cloud Environment , 2017, GLOBECOM 2017 - 2017 IEEE Global Communications Conference.

[2]  Hong Xu,et al.  Multi-resource Load Balancing for Virtual Network Functions , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[3]  Zhu Han,et al.  Signal Processing and Networking for Big Data Applications , 2017 .

[4]  Holger Karl,et al.  Specifying and placing chains of virtual network functions , 2014, 2014 IEEE 3rd International Conference on Cloud Networking (CloudNet).

[5]  Jing Zhu,et al.  Bandwidth guaranteed virtual network function placement and scaling in datacenter networks , 2015, 2015 IEEE 34th International Performance Computing and Communications Conference (IPCCC).

[6]  Qun Li,et al.  Efficient service handoff across edge servers via docker container migration , 2017, SEC.

[7]  Randy H. Katz,et al.  Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center , 2011, NSDI.

[8]  Jiankun Hu,et al.  Cloud-Based Approximate Constrained Shortest Distance Queries Over Encrypted Graphs With Privacy Protection , 2018, IEEE Transactions on Information Forensics and Security.

[9]  Jie Wu,et al.  Let's stay together: Towards traffic aware virtual machine placement in data centers , 2012, IEEE INFOCOM 2014 - IEEE Conference on Computer Communications.

[10]  Raouf Boutaba,et al.  Elastic virtual network function placement , 2015, 2015 IEEE 4th International Conference on Cloud Networking (CloudNet).

[11]  Ke Xu,et al.  PieBridge: A Cross-DR scale Large Data Transmission Scheduling System , 2016, SIGCOMM.

[12]  Song Guo,et al.  A General Communication Cost Optimization Framework for Big Data Stream Processing in Geo-Distributed Data Centers , 2016, IEEE Transactions on Computers.

[13]  Mohammed Samaka,et al.  Optimal virtual network function placement in multi-cloud service function chaining architecture , 2017, Comput. Commun..

[14]  Fred W. Glover,et al.  A very large-scale neighborhood search algorithm for the multi-resource generalized assignment problem , 2004, Discret. Optim..

[15]  Rajkumar Buyya,et al.  A Framework and Algorithm for Energy Efficient Container Consolidation in Cloud Data Centers , 2015, 2015 IEEE International Conference on Data Science and Data Intensive Systems.

[16]  Ramakrishnan Rajamony,et al.  An updated performance comparison of virtual machines and Linux containers , 2015, 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

[17]  Ke Xu,et al.  BDS: a centralized near-optimal overlay network for inter-datacenter data replication , 2018, EuroSys.

[18]  Hari Balakrishnan,et al.  Choreo: network-aware task placement for cloud applications , 2013, Internet Measurement Conference.

[19]  Teng Ma,et al.  Optimal VM placement for traffic scalability using Markov chain in cloud data centre networks , 2017 .

[20]  Hasan Pirkul,et al.  Computer and Database Location in Distributed Computer Systems , 1986, IEEE Transactions on Computers.

[21]  Tao Han,et al.  DRAPS: Dynamic and resource-aware placement scheme for docker containers in a heterogeneous cluster , 2017, 2017 IEEE 36th International Performance Computing and Communications Conference (IPCCC).

[22]  Joseph B. Mazzola,et al.  Heuristics for the multi‐resource generalized assignment problem , 2001 .

[23]  Temel Öncan,et al.  A Survey of the Generalized Assignment Problem and Its Applications , 2007, INFOR Inf. Syst. Oper. Res..

[24]  Tong Li,et al.  Modeling, Analysis, and Implementation of Universal Acceleration Platform Across Online Video Sharing Sites , 2018, IEEE Transactions on Services Computing.

[25]  Thibaut Vidal,et al.  An iterated local search heuristic for multi-capacity bin packing and machine reassignment problems , 2013, Expert Syst. Appl..

[26]  S. R,et al.  Data Mining with Big Data , 2017, 2017 11th International Conference on Intelligent Systems and Control (ISCO).

[27]  Henrik I. Christensen,et al.  Approximation and online algorithms for multidimensional bin packing: A survey , 2017, Comput. Sci. Rev..

[28]  Brendan Burns,et al.  Design Patterns for Container-based Distributed Systems , 2016, HotCloud.

[29]  Saikat Guha,et al.  Generalized resource allocation for the cloud , 2012, SoCC '12.

[30]  Zoltán Ádám Mann,et al.  Which is the best algorithm for virtual machine placement optimization? , 2017, Concurr. Comput. Pract. Exp..

[31]  Richard M. Soland,et al.  A branch and bound algorithm for the generalized assignment problem , 1975, Math. Program..

[32]  Laurent Hérault,et al.  Solving a Real World Assignment Problem with a Metaheuristic , 1998, J. Heuristics.

[33]  Ke Xu,et al.  A Communication-Aware Container Re-Distribution Approach for High Performance VNFs , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[34]  Ke Xu,et al.  Towards shorter task completion time in datacenter networks , 2015, 2015 IEEE 34th International Performance Computing and Communications Conference (IPCCC).

[35]  Tarik Taleb,et al.  Efficient virtual evolved packet core deployment across multiple cloud domains , 2018, 2018 IEEE Wireless Communications and Networking Conference (WCNC).

[36]  Ke Xu,et al.  Elastic and Efficient Virtual Network Provisioning for Cloud-Based Multi-tier Applications , 2015, 2015 44th International Conference on Parallel Processing.

[37]  Haris Gavranovic,et al.  An efficient local search with noising strategy for Google Machine Reassignment problem , 2016, Ann. Oper. Res..

[38]  Tarik Taleb,et al.  Service-aware network function placement for efficient traffic handling in carrier cloud , 2014, 2014 IEEE Wireless Communications and Networking Conference (WCNC).

[39]  Christoph Hochreiner,et al.  Elastic Provisioning of Virtual Machines for Container Deployment , 2017, ICPE Companion.

[40]  Tianlong Yu,et al.  FreeFlow: High Performance Container Networking , 2016, HotNets.

[41]  Lucas Chaufournier,et al.  Containers and Virtual Machines at Scale: A Comparative Study , 2016, Middleware.

[42]  Athanasios V. Vasilakos,et al.  GreenDCN: A General Framework for Achieving Energy Efficiency in Data Center Networks , 2013, IEEE Journal on Selected Areas in Communications.

[43]  Sheng Wang,et al.  Joint VM placement and topology optimization for traffic scalability in dynamic datacenter networks , 2015, Comput. Networks.

[44]  Huiqun Yu,et al.  A Novel Resource Scheduling Approach in Container Based Clouds , 2014, 2014 IEEE 17th International Conference on Computational Science and Engineering.

[45]  Bo Dong,et al.  Container-VM-PM Architecture: A Novel Architecture for Docker Container Placement , 2018, CLOUD.

[46]  Peng Li,et al.  A Minimum-Aware Container Live Migration Algorithm in the Cloud Environment , 2017, Int. J. Bus. Data Commun. Netw..

[47]  Wenbin Yao,et al.  A container scheduling strategy based on neighborhood division in micro service , 2018, NOMS 2018 - 2018 IEEE/IFIP Network Operations and Management Symposium.

[48]  Tarik Taleb,et al.  A Queuing Based Dynamic Auto Scaling Algorithm for the LTE EPC Control Plane , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[49]  Alexander J. Smola,et al.  Scaling Distributed Machine Learning with the Parameter Server , 2014, OSDI.

[50]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[51]  Shripad Nadgowda,et al.  Voyager: Complete Container State Migration , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[52]  I. Stoica,et al.  FairCloud: sharing the network in cloud computing , 2011, CCRV.

[53]  Antonello Monti,et al.  Migrating LinuX Containers Using CRIU , 2016, ISC Workshops.

[54]  Tong Li,et al.  Toward Cloud-Based Distributed Interactive Applications: Measurement, Modeling, and Analysis , 2018, IEEE/ACM Transactions on Networking.

[55]  Juan A. Díaz,et al.  A Tabu search heuristic for the generalized assignment problem , 2001, Eur. J. Oper. Res..

[56]  David S. Johnson,et al.  Fast Algorithms for Bin Packing , 1974, J. Comput. Syst. Sci..

[57]  Andreas Kassler,et al.  A Power Efficient and Robust Virtual Network Functions Placement Problem , 2016, 2016 28th International Teletraffic Congress (ITC 28).

[58]  Ke Xu,et al.  Going Fast and Fair: Latency Optimization for Cloud-Based Service Chains , 2018, IEEE Network.

[59]  Teofilo F. Gonzalez,et al.  P-Complete Approximation Problems , 1976, J. ACM.

[60]  Charles Anderson,et al.  Docker , 2015, IEEE Softw..

[61]  Albert G. Greenberg,et al.  Reining in the Outliers in Map-Reduce Clusters using Mantri , 2010, OSDI.

[62]  Hasan Pirkul,et al.  Algorithms for the multi-resource generalized assignment problem , 1991 .

[63]  Konstantin Kogan,et al.  Capacity planning by the dynamic multi-resource generalized assignment problem (DMRGAP) , 1998, Eur. J. Oper. Res..

[64]  Jie Wu,et al.  A Novel Multi-objective Optimization Scheme for Rebalancing Virtual Machine Placement , 2016, 2016 IEEE 9th International Conference on Cloud Computing (CLOUD).

[65]  Kin K. Leung,et al.  Live Service Migration in Mobile Edge Clouds , 2017, IEEE Wireless Communications.

[66]  Chanwit Kaewkasi,et al.  Improvement of container scheduling for Docker using Ant Colony Optimization , 2017, 2017 9th International Conference on Knowledge and Smart Technology (KST).

[67]  Yuqing Qiu,et al.  Evaluating and Improving LXC Container Migration Between Cloudlets Using Multipath TCP , 2016 .

[68]  Vasileios Pappas,et al.  Improving the Scalability of Data Center Networks with Traffic-aware Virtual Machine Placement , 2010, 2010 Proceedings IEEE INFOCOM.

[69]  Masayuki Murata,et al.  Dynamic placement of virtual network functions based on model predictive control , 2016, NOMS 2016 - 2016 IEEE/IFIP Network Operations and Management Symposium.

[70]  Tarik Taleb,et al.  Towards Modeling Cross-Domain Network Slices for 5G , 2018, 2018 IEEE Global Communications Conference (GLOBECOM).

[71]  Rina Panigrahy,et al.  Heuristics for Vector Bin Packing , 2011 .

[72]  Larry L. Peterson,et al.  Container-based operating system virtualization: a scalable, high-performance alternative to hypervisors , 2007, EuroSys '07.

[73]  Jian Guo,et al.  Joint Optimization of Chain Placement and Request Scheduling for Network Function Virtualization , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[74]  Stefano Secci,et al.  Virtual network functions placement and routing optimization , 2015, 2015 IEEE 4th International Conference on Cloud Networking (CloudNet).

[75]  Abraham P. Punnen,et al.  Local search intensified: Very large-scale variable neighborhood search for the multi-resource generalized assignment problem , 2009, Discret. Optim..

[76]  Tarik Taleb,et al.  User mobility-aware Virtual Network Function placement for Virtual 5G Network Infrastructure , 2015, 2015 IEEE International Conference on Communications (ICC).

[77]  Mithuna Thottethodi,et al.  Understanding and mitigating the impact of load imbalance in the memory caching tier , 2013, SoCC.

[78]  Gang Chen,et al.  Energy-Aware Container Consolidation Based on PSO in Cloud Data Centers , 2018, 2018 IEEE Congress on Evolutionary Computation (CEC).

[79]  Jasbir S. Arora,et al.  Survey of multi-objective optimization methods for engineering , 2004 .

[80]  Shanhe Yi,et al.  Efficient Live Migration of Edge Services Leveraging Container Layered Storage , 2019, IEEE Transactions on Mobile Computing.

[81]  Roberto Rojas-Cessa,et al.  Energy-aware scheduling schemes for cloud data centers on Google trace data , 2014, 2014 IEEE Online Conference on Green Communications (OnlineGreenComm).

[82]  Ragunathan Rajkumar,et al.  Resource Allocation in Distributed Mixed-Criticality Cyber-Physical Systems , 2010, 2010 IEEE 30th International Conference on Distributed Computing Systems.

[83]  Yong Wang,et al.  Design, modeling, and analysis of online combinatorial double auction for mobile cloud computing markets , 2018, Int. J. Commun. Syst..