Early warning disaster-aware service protection in geo-distributed data centers

Abstract We investigate dynamic protection of cloud services supported by geo-distributed DCs (Geographically distributed Data Centers) under disaster scenarios which lead to failures of DC nodes and communication fibers. By assuming that the affected network region can be aware of an upcoming disaster for a certain early warning time before it is finally hit, and the corresponding service requests can be identified before they are disrupted, we utilize early warning time to achieve dynamic service protection by carrying out three methods: request rerouting, request relocation and service relocation. In particular, the first one reroutes requests to their original service providing DC nodes; the second one relocates requests to other service providing DC nodes; and the third one relocates requested services to backup DC nodes to support service requests. We first formulate an Integer Linear Program (ILP) model to maximize the number of disaster-affected requests that can be protected, and then propose a time-efficient heuristic solution. Extensive numerical results show that the proposed strategy can achieve good performance to protect service requests under disasters with early warning times.

[1]  M. Tornatore,et al.  Design of Disaster-Resilient Optical Datacenter Networks , 2012, Journal of Lightwave Technology.

[2]  Takaaki Adachi,et al.  The restoration of telecom power damages by the Great East Japan Earthquake , 2011, 2011 IEEE 33rd International Telecommunications Energy Conference (INTELEC).

[3]  Biswanath Mukherjee,et al.  Disaster-aware datacenter placement and dynamic content management in cloud networks , 2015, IEEE/OSA Journal of Optical Communications and Networking.

[4]  Biswanath Mukherjee,et al.  Disaster-aware service provisioning with manycasting in cloud networks , 2014, Photonic Network Communications.

[5]  Ian Taylor,et al.  SWITCH workbench: A novel approach for the development and deployment of time-critical microservice-based cloud-native applications , 2019, Future Gener. Comput. Syst..

[6]  Biswanath Mukherjee,et al.  Data evacuation from data centers in disaster-affected regions through software-defined satellite networks , 2019, Comput. Networks.

[7]  Biswanath Mukherjee,et al.  Rapid data evacuation for large-scale disasters in optical cloud networks [Invited] , 2015, IEEE/OSA Journal of Optical Communications and Networking.

[8]  Lena Wosinska,et al.  Restoration in optical cloud networks with relocation and services differentiation , 2016, IEEE/OSA Journal of Optical Communications and Networking.

[9]  Biswanath Mukherjee,et al.  A Survey on Resiliency Techniques in Cloud Computing Infrastructures and Applications , 2016, IEEE Communications Surveys & Tutorials.

[10]  Shunroku Yamamoto,et al.  Evaluation of the real‐time earthquake information system in Japan , 2009 .

[11]  Lisandro Zambenedetti Granville,et al.  Data Center Network Virtualization: A Survey , 2013, IEEE Communications Surveys & Tutorials.

[12]  Yongjun Zhang,et al.  Survivable Deployments of Optical Sensor Networks against Multiple Failures and Disasters: A Survey , 2019, Sensors.

[13]  Lei Zhang,et al.  Joint Design on DCN Placement and Survivable Cloud Service Provision over All-Optical Mesh Networks , 2014, IEEE Transactions on Communications.

[14]  Novella Bartolini,et al.  On Critical Service Recovery After Massive Network Failures , 2017, IEEE/ACM Transactions on Networking.

[15]  Kelly T. Morrison,et al.  Rapidly recovering from the catastrophic loss of a major telecommunications office , 2011, IEEE Communications Magazine.

[16]  Xiaohong Jiang,et al.  Heterogeneous data backup against early warning disasters in geo-distributed data center networks , 2018, IEEE/OSA Journal of Optical Communications and Networking.

[17]  Biswanath Mukherjee,et al.  Global versus essential post-disaster re-provisioning in telecom mesh networks , 2015, IEEE/OSA Journal of Optical Communications and Networking.

[18]  Thomas F. La Porta,et al.  Progressive damage assessment and network recovery after massive failures , 2017, IEEE INFOCOM 2017 - IEEE Conference on Computer Communications.

[19]  Cees T. A. M. de Laat,et al.  Dynamic Real-Time Infrastructure Planning and Deployment for Disaster Early Warning Systems , 2018, ICCS.

[20]  Tarik Taleb,et al.  Ε-time early warning data backup in disaster-aware optical inter-connected data center networks , 2017, IEEE/OSA Journal of Optical Communications and Networking.

[21]  George Suciu,et al.  Cloud Computing as Evolution of Distributed Computing – A Case Study for SlapOS Distributed Cloud Computing Platform , 2013 .

[22]  Biswanath Mukherjee,et al.  Minimizing the Risk From Disaster Failures in Optical Backbone Networks , 2014, Journal of Lightwave Technology.

[23]  Hiroshi Saito Spatial Design of Physical Network Robust Against Earthquakes , 2015, Journal of Lightwave Technology.