A hybrid local storage transfer scheme for live migration of I/O intensive workloads

Live migration of virtual machines (VMs) is key feature of virtualization that is extensively leveraged in IaaS cloud environments: it is the basic building block of several important features, such as load balancing, pro-active fault tolerance, power management, online maintenance, etc. While most live migration efforts concentrate on how to transfer the memory from source to destination during the migration process, comparatively little attention has been devoted to the transfer of storage. This problem is gaining increasing importance: due to performance reasons, virtual machines that run large-scale, data-intensive applications tend to rely on local storage, which poses a difficult challenge on live migration: it needs to handle storage transfer in addition to memory transfer. This paper proposes a memory migration independent approach that addresses this challenge. It relies on a hybrid active push / prioritized prefetch strategy, which makes it highly resilient to rapid changes of disk state exhibited by I/O intensive workloads. At the same time, it is minimally intrusive in order to ensure a maximum of portability with a wide range of hypervisors. Large scale experiments that involve multiple simultaneous migrations of both synthetic benchmarks and a real scientific application show improvements of up to 10x faster migration time, 10x less bandwidth consumption and 8x less performance degradation over state-of-art.

[1]  Matei Ripeanu,et al.  VMFlock: virtual machine co-migration for the cloud , 2011, HPDC '11.

[2]  Hai Jin,et al.  Live migration of virtual machine based on full system trace and replay , 2009, HPDC '09.

[3]  Tal Garfinkel,et al.  The Design and Evolution of Live Storage Migration in VMware ESX , 2011, USENIX Annual Technical Conference.

[4]  Christian Engelmann,et al.  Proactive fault tolerance for HPC with Xen virtualization , 2007, ICS '07.

[5]  Bogdan Nicolae,et al.  On the Benefits of Transparent Compression for Cost-Effective Cloud Data Storage , 2011, Trans. Large Scale Data Knowl. Centered Syst..

[6]  George H. Bryan,et al.  The Maximum Intensity of Tropical Cyclones in Axisymmetric Numerical Model Simulations , 2009 .

[7]  Khaled Z. Ibrahim,et al.  Optimized pre-copy live migration for memory intensive applications , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[8]  Robert B. Ross,et al.  PVFS: A Parallel File System for Linux Clusters , 2000, Annual Linux Showcase & Conference.

[9]  Eric Jul,et al.  Scalable virtual machine storage using local disks , 2010, OPSR.

[10]  Jaejin Lee,et al.  Fast and space-efficient virtual machine checkpointing , 2011, VEE '11.

[11]  Petter Svärd,et al.  Evaluation of delta compression techniques for efficient live migration of large virtual machines , 2011, VEE '11.

[12]  Chunqiang Tang,et al.  FVD: A High-Performance Virtual Machine Image Format for Cloud , 2011, USENIX Annual Technical Conference.

[13]  Rajkumar Buyya,et al.  Cost of Virtual Machine Live Migration in Clouds: A Performance Evaluation , 2009, CloudCom.

[14]  Yellu Sreenivasulu,et al.  FAST TRANSPARENT MIGRATION FOR VIRTUAL MACHINES , 2014 .

[15]  Karsten Schwan,et al.  VirtualPower: coordinated power management in virtualized enterprise systems , 2007, SOSP.

[16]  Anja Feldmann,et al.  Live wide-area migration of virtual machines including local persistent state , 2007, VEE '07.

[17]  Gabriel Antoniu,et al.  BlobSeer: Next-generation data management for large scale infrastructures , 2011, J. Parallel Distributed Comput..

[18]  Umesh Deshpande,et al.  Post-copy live migration of virtual machines , 2009, OPSR.

[19]  Bogdan Nicolae,et al.  BlobSeer: Towards efficient data storage management for large-scale, distributed systems , 2010 .

[20]  Gabriel Antoniu,et al.  Going back and forth: efficient multideployment and multisnapshotting on clouds , 2011, HPDC '11.

[21]  Marcel Gagné Cooking with Linux: still searching for the ultimate linux distro? , 2007 .

[22]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[23]  Bernd Freisleben,et al.  Efficient Storage Synchronization for Live Migration in Cloud Infrastructures , 2011, 2011 19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing.

[24]  Randy H. Katz,et al.  A view of cloud computing , 2010, CACM.

[25]  Eyal de Lara,et al.  SnowFlock: Virtual Machine Cloning as a First-Class Cloud Primitive , 2011, TOCS.

[26]  Franck Cappello,et al.  BlobCR: Efficient checkpoint-restart for HPC applications on IaaS clouds using virtual disk image snapshots , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).