Sledge: Towards Efficient Live Migration of Docker Containers

Modern large-scale cloud platforms require live migration technique on Docker containers with stateful workload to support load balancing, host maintenance, and Quality of Service (QoS) improvement. Efficient and scalable Docker live migration is expected to guarantee the component-integrity (image, runtime, and management context) with negligible downtime. In this paper, we present a highly efficient live migration system called Sledge, which ensures the component-integrity by integrating both images and management context during runtime migration. The key insight is that the layered image can be leveraged to reduce the migration overhead, and appropriately selective migration of management context will effectively improve QoS with negligible downtime. To achieve good scalability, a lightweight container registry mechanism for end-to-end image migration is designed to avoid the redundant layers transmission. In addition, a dynamic context loading scheme is proposed to precisely load the management context into the running daemon, which can significantly reduce downtime. Experiments show that, compared with the state-of-the-art, Sledge reduces 57% of total migration time, 55% of image migration time, and 70% downtime.

[1]  Qun Li,et al.  Efficient service handoff across edge servers via docker container migration , 2017, SEC.

[2]  Umesh Bellur,et al.  On Selecting the Right Optimizations for Virtual Machine Migration , 2016, VEE.

[3]  Hai Jin,et al.  Live migration of virtual machine based on full system trace and replay , 2009, HPDC '09.

[4]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[5]  Hai Jin,et al.  Performance and energy modeling for live migration of virtual machines , 2011, Cluster Computing.

[6]  Shripad Nadgowda,et al.  Voyager: Complete Container State Migration , 2017, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[7]  René Peinl,et al.  Docker Cluster Management for the Cloud - Survey Results and Own Solution , 2016, Journal of Grid Computing.

[8]  Khaled Z. Ibrahim,et al.  Optimized pre-copy live migration for memory intensive applications , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[9]  Danny Jones,et al.  VM Live Migration At Scale , 2018, VEE.

[10]  Kartik Gopalan,et al.  Post-copy based live virtual machine migration using adaptive pre-paging and dynamic self-ballooning , 2009, VEE '09.

[11]  Charles Anderson,et al.  Docker , 2015, IEEE Softw..

[12]  David Bernstein,et al.  Containers and Cloud: From LXC to Docker to Kubernetes , 2014, IEEE Cloud Computing.

[13]  Richard O. Sinnott,et al.  A performance comparison of container-based technologies for the Cloud , 2017, Future Gener. Comput. Syst..

[14]  Andrea C. Arpaci-Dusseau,et al.  Slacker: Fast Distribution with Lazy Docker Containers , 2016, FAST.

[15]  Stefan Lankes,et al.  Implications of Process-Migration in Virtualized Environments , 2016, COSH@HiPEAC.

[16]  Hai Jin,et al.  MECOM: Live migration of virtual machines by adaptively compressing memory pages , 2014, Future Gener. Comput. Syst..

[17]  Antonello Monti,et al.  Migrating LinuX Containers Using CRIU , 2016, ISC Workshops.