Creating network resilience against disasters using Service Level Agreements

Building networks that are resilient to natural disasters and other geographically correlated events involves both proactive network design and remediation after the event. Network provisioning is one of the steps used to provide redundancy and diversity in networks that are key elements in resilient network design. Once the disaster has occurred, remediation may include rerouting of the most important services affected by the event. Both components involve understanding the quality of service that may be affected, knowing which services are critical, and having an understanding of the geographic events that may impact the network. Furthermore, if resources are limited following the disaster, how do we select which services to reroute? In this work, we propose to use Service Level Agreements (SLA) to provide much of the information necessary to solve the provisioning and rerouting problems. On the provisioning side, the SLAs will provide the required system response time, availability and survivability of a service. On the remediation side, the SLA can provide the priority of a service to restore and reroute. We present mixed-integer linear programming (MILP) models and heuristics to show how the most important services are provisioned and rerouted to be available in the event of a geographic disaster. Our approaches are found to be more efficient than other priority-based approaches.

[1]  G. Lewis,et al.  Service Level Agreements in Service-Oriented Architecture Environments , 2008 .

[2]  Ralf Steinmetz,et al.  Heuristics for QoS-aware Web Service Composition , 2006, 2006 IEEE International Conference on Web Services (ICWS'06).

[3]  Deep Medhi,et al.  Optimization and performance of restoration schemes for wide-area teletraffic networks , 2005, Journal of Network and Systems Management.

[4]  P. Comitz,et al.  SWIM segment 2 deployment and utilization in NextGen R&D programs , 2012, 2012 Integrated Communications, Navigation and Surveillance Conference.

[5]  Asit Dan,et al.  Web services agreement specification (ws-agreement) , 2004 .

[6]  David Hutchison,et al.  Resilience and survivability in communication networks: Strategies, principles, and survey of disciplines , 2010, Comput. Networks.

[7]  Deep Medhi,et al.  Network Traffic Engineering with Varied Levels of Protection in the Next Generation Internet , 2005 .

[8]  Deep Medhi Diverse routing for survivability in a fiber-based sparse network , 1991, ICC 91 International Conference on Communications Conference Record.

[9]  Galen H. Sasaki,et al.  Quality of protection (QoP): a quantitative unifying paradigm to protection service grades , 2001, OptiComm: Optical Networking and Communications Conference.

[10]  Raouf Boutaba,et al.  SVNE: Survivable Virtual Network Embedding Algorithms for Network Virtualization , 2013, IEEE Transactions on Network and Service Management.

[11]  D. Milojicic,et al.  A mechanism to measure quality-of-service in a federated cloud environment , 2012, FederatedClouds '12.

[12]  Deep Medhi,et al.  A unified approach to network survivability for teletraffic networks: models, algorithms and analysis , 1994, IEEE Trans. Commun..

[13]  Deep Medhi,et al.  A Geographic Multi-Topology Routing approach and its benefits during large-scale geographically correlated failures , 2015, Comput. Networks.

[14]  Elarbi Badidi A framework for brokered Service Level agreements in SOA environments , 2011, 2011 7th International Conference on Next Generation Web Services Practices.

[15]  Michal Pióro,et al.  SNDlib 1.0—Survivable Network Design Library , 2010, Networks.

[16]  James P. G. Sterbenz,et al.  On the fitness of geographic graph generators for modelling physical level topologies , 2013, 2013 5th International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT).

[17]  Deep Medhi,et al.  Routing, flow, and capacity design in communication and computer networks , 2004 .

[18]  Eytan Modiano,et al.  Analysis and algorithms for partial protection in mesh networks , 2011, IEEE/OSA Journal of Optical Communications and Networking.

[19]  Deep Medhi,et al.  Analysing GeoPath diversity and improving routing performance in optical networks , 2015, Comput. Networks.

[20]  Maria Luisa Villani,et al.  An approach for QoS-aware service composition based on genetic algorithms , 2005, GECCO '05.

[21]  Deep Medhi,et al.  Geodiverse routing with path delay and skew requirement under area‐based challenges , 2015, Networks.

[22]  Eytan Modiano,et al.  Analysis and Algorithms for Partial Protection in Mesh Networks , 2014 .