On the use of virtualization technologies to support uninterrupted IT services: A case study with lessons learned from the Great East Japan Earthquake

Virtualized IT infrastructures combined with virtual machine migration technologies have a potential to support IT services that are resilient to partial physical infrastructure failures caused by extreme events. This paper experimentally evaluates the migration of multiple VMs across long geographical distances - an activity that is required to move virtualized IT systems from a disaster site to a safe location. Taking into account the resource availability parameters observed after the Great East Japan Earthquake, experimental results show that if (1) service downtime in the order of minutes is acceptable, (2) VMs can be kept with small storage footprint, and (3) power and network are available for tens of minutes, it is possible to migrate tens of VMs from damaged sites to a very distant stable location.

[1]  T. V. Lakshman,et al.  Enhancing dynamic cloud-based services using network virtualization , 2010, Comput. Commun. Rev..

[2]  Fang Hao,et al.  Enhancing dynamic cloud-based services using network virtualization , 2009, CCRV.

[3]  Dean H. Lorenz,et al.  IP mobility to support live migration of virtual machines across subnets , 2009, SYSTOR '09.

[4]  Giorgio Ventre,et al.  System-Level Virtualization and Mobile IP to Support Service Mobility , 2009, 2009 International Conference on Parallel Processing Workshops.

[5]  Yingwei Luo,et al.  Live and incremental whole-system migration of virtual machines using block-bitmap , 2008, 2008 IEEE International Conference on Cluster Computing.

[6]  Shrisha Rao,et al.  Optimizing live migration of virtual machines across wide area networks using integrated replication and scheduling , 2011, 2011 IEEE International Systems Conference.

[7]  Anja Feldmann,et al.  Live wide-area migration of virtual machines including local persistent state , 2007, VEE '07.

[8]  Hidenobu Watanabe,et al.  A Performance Improvement Method for the Global Live Migration of Virtual Machine with IP Mobility , 2010 .

[9]  Andrea Bianco,et al.  Optimal Resource Allocation for Disaster Recovery , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[10]  Andy Hopper,et al.  Predicting the Performance of Virtual Machine Migration , 2010, 2010 IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems.

[11]  Mike Murphy,et al.  The Efficacy of Live Virtual Machine Migrations Over the Internet , 2007, Proceedings of the 2nd International Workshop on Virtualization Technology in Distributed Computing (VTDC '07).

[12]  Pierre Riteau,et al.  User-level virtual networking mechanisms to support virtual machine migration over multiple clouds , 2010, 2010 IEEE Globecom Workshops.

[13]  Satoshi Sekiguchi,et al.  A Live Storage Migration Mechanism over WAN for Relocatable Virtual Machine Services on Clouds , 2009, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid.

[14]  M. Rosenblum,et al.  Optimizing the migration of virtual computers , 2002, OSDI '02.

[15]  Qin Li,et al.  HyperMIP: Hypervisor Controlled Mobile IP for Virtual Machine Live Migration across Networks , 2008, 2008 11th IEEE High Assurance Systems Engineering Symposium.

[16]  Georg Carle,et al.  Wide-Area Virtual Machine Migration as Resilience Mechanism , 2011, 2011 IEEE 30th Symposium on Reliable Distributed Systems Workshops.

[17]  Mahadev Satyanarayanan,et al.  Internet suspend/resume , 2002, Proceedings Fourth IEEE Workshop on Mobile Computing Systems and Applications.

[18]  Antonio Puliafito,et al.  Improving Virtual Machine Migration in Federated Cloud Environments , 2010, 2010 2nd International Conference on Evolving Internet.

[19]  Heather McCullough,et al.  2011 Tohoku earthquake and tsunami data available from the National Oceanic and Atmospheric Administration/National Geophysical Data Center , 2011 .

[20]  Andrew Warfield,et al.  Live migration of virtual machines , 2005, NSDI.

[21]  Satoshi Sekiguchi,et al.  Enabling Instantaneous Relocation of Virtual Machines with a Lightweight VMM Extension , 2010, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing.

[22]  Prashant J. Shenoy,et al.  PipeCloud: using causality to overcome speed-of-light delays in cloud-based disaster recovery , 2011, SOCC '11.