VMFlock: virtual machine co-migration for the cloud

This paper presents VMFlockMS, a migration service optimized for cross-datacenter transfer and instantiation of groups of virtual machine (VM) images that comprise an application-level solution (e.g., a three-tier web application). We dub these groups of related VM images VMFlocks. VMFlockMS employs two main techniques: first, data deduplication within the VMFlock to be migrated and between the VMFlock and the data already present at the destination datacenter, and, second, accelerated instantiation of the application at the target datacenter after transferring only a partial set of data blocks and prioritization of the remaining data based on previously observed access patterns originating from the running VMs. VMFlockMS is designed to be deployed as a set of virtual appliances which make efficient use of the available cloud resources to locally access and deduplicate the images and data in a distributed fashion with minimal requirements imposed on the cloud API to access the VM image repository. VMFlockMS provides an incrementally scalable and high-performance migration service. Our evaluation shows that VMFlockMS can reduce the data volumes to be transferred over the network to as low as 3% of the original VMFlock size, enables the complete transfer of the VM images belonging to a VMFlock over transcontinental link up to 3.5x faster than alternative approaches, and enables booting these VM images with as little as 5% of the compressed VMFlock data available at the destination.

[1]  T. May,et al.  Alpha-particle-induced soft errors in dynamic memories , 1979, IEEE Transactions on Electron Devices.

[2]  David R. Karger,et al.  Consistent hashing and random trees: distributed caching protocols for relieving hot spots on the World Wide Web , 1997, STOC '97.

[3]  Lauren Wood 技術解説 IEEE Internet Computing , 1999 .

[4]  MaziéresDavid,et al.  A low-bandwidth network file system , 2001 .

[5]  David Mazières,et al.  A low-bandwidth network file system , 2001, SOSP.

[6]  Monica S. Lam,et al.  Optimizing the migration of virtual computers , 2002, OPSR.

[7]  Sean Matthew Dorward,et al.  Awarded Best Paper! - Venti: A New Approach to Archival Data Storage , 2002 .

[8]  Sean Quinlan,et al.  Venti: A New Approach to Archival Storage , 2002, FAST.

[9]  Val Henson,et al.  An Analysis of Compare-by-hash , 2003, HotOS.

[10]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[11]  William E. Allcock,et al.  The Globus Striped GridFTP Framework and Server , 2005, ACM/IEEE SC 2005 Conference (SC'05).

[12]  John Black,et al.  Compare-by-Hash: A Reasoned Analysis , 2006, USENIX Annual Technical Conference, General Track.

[13]  Mark Lillibridge,et al.  Jumbo Store: Providing Efficient Incremental Upload and Versioning for a Utility Rendering Service , 2007, FAST.

[14]  Shankar Pasupathy,et al.  An analysis of latent sector errors in disk drives , 2007, SIGMETRICS '07.

[15]  Marc Vertes,et al.  Fault Tolerance in Multiprocessor Systems Via Application Cloning , 2007, 27th International Conference on Distributed Computing Systems (ICDCS '07).

[16]  Anja Feldmann,et al.  Live wide-area migration of virtual machines including local persistent state , 2007, VEE '07.

[17]  Kai Li,et al.  Avoiding the Disk Bottleneck in the Data Domain Deduplication File System , 2008, FAST.

[18]  Aleksey Pesterev,et al.  Fast, Inexpensive Content-Addressed Storage in Foundation , 2008, USENIX Annual Technical Conference.

[19]  Anthony Liguori,et al.  Experiences with Content Addressable Storage and Virtual Disks , 2008, Workshop on I/O Virtualization.

[20]  Bowen Alpern,et al.  Opening black boxes: using semantic information to combat virtual machine image sprawl , 2008, VEE '08.

[21]  Matei Ripeanu,et al.  stdchk: A Checkpoint Storage System for Desktop Grid Computing , 2007, 2008 The 28th International Conference on Distributed Computing Systems.

[22]  Satoshi Sekiguchi,et al.  A live storage migration mechanism over wan and its performance evaluation , 2009, VTDC '09.

[23]  Soumitra Sarkar,et al.  CARP: Handling Silent Data Errors and Site Failures in an Integrated Program and Storage Replication Mechanism , 2009, 2009 29th IEEE International Conference on Distributed Computing Systems.

[24]  José A. B. Fortes,et al.  Sky Computing , 2009, IEEE Internet Computing.

[25]  Bruce M. Maggs,et al.  Cutting the electric bill for internet-scale systems , 2009, SIGCOMM '09.

[26]  Eyal de Lara,et al.  SnowFlock: rapid virtual machine cloning for cloud computing , 2009, EuroSys '09.

[27]  Satoshi Sekiguchi,et al.  A Live Storage Migration Mechanism over WAN for Relocatable Virtual Machine Services on Clouds , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[28]  Antonio Puliafito,et al.  How to Enhance Cloud Architectures to Enable Cross-Federation , 2010, IEEE CLOUD.

[29]  Anees Shaikh,et al.  Cross Enterprise Improvements Delivered via a Cloud Platform: A Game Changer for the Consumer Product and Retail Industry , 2010, 2010 IEEE International Conference on Services Computing.

[30]  Ajay Mohindra,et al.  Solution-based deployment of complex application services on a Cloud , 2010, Proceedings of 2010 IEEE International Conference on Service Operations and Logistics, and Informatics.

[31]  Ali Raza Butt,et al.  Reconciling scratch space consumption, exposure, and volatility to achieve timely staging of job input data , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[32]  Hong Jiang,et al.  DEBAR: A scalable high-performance de-duplication storage system for backup and archiving , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[33]  Mark Adler,et al.  zlib compression library , 2012 .