Risk Minimization Routing Against Geographically Correlated Failures

Regional failures, such as natural disaster or malicious attack, have become a major threat to the construction of future reliable communication network. The regional failures usually cause a large number of disconnected nodes simultaneously and influence the network for a long time. However, a routing scheme that is resilient to such geographically correlated failures is still unexplored. In this paper, we provide a comprehensive study of the disaster resilient routing dealing with the regional failure in operational IP backbone networks. It is notable that the path with minimal risk (i.e., minimal failure probability) is not necessarily the shortest path. The main challenge of finding such paths is that regional failure is unpredictable in terms of time, location, and the affected area. To this end, in combination with the computational geometry tool, we develop effective algorithms to find the minimal risk path between end node pairs to tolerate random regional failures. We show that in contrast to the conventional shortest path, a little longer path can be more effective to the disasters. After selecting such a path as the primary path, we turn to find a secondary backup path. In contrast to the conventional single link/node failure, a regional failure disrupts a large number of network components, simultaneously. As a result, how to find backup paths for re-establishment of the corrupted paths will raise a novel fairness issue. Specifically, during the backup path allocation, we focus on routing fairness to bound the worst-case user experience. A metric is proposed based on which an ILP is formulated. The extensive simulations validate that such an issue is non-negligible in face of regional failure scenarios.

[1]  Dimitri P. Bertsekas,et al.  Network optimization : continuous and discrete models , 1998 .

[2]  Ratul Mahajan,et al.  Measuring ISP topologies with rocketfuel , 2002, TNET.

[3]  Moshe Zukerman,et al.  Topology Design of Undersea Cables Considering Survivability Under Major Disasters , 2009, 2009 International Conference on Advanced Information Networking and Applications Workshops.

[4]  Biswanath Mukherjee,et al.  Disaster survivability in optical communication networks , 2013, Comput. Commun..

[5]  Stein Gjessing,et al.  Resilient Routing Layers for Network Disaster Planning , 2005, ICN.

[6]  E. Modiano,et al.  Assessing the impact of geographically correlated network failures , 2008, MILCOM 2008 - 2008 IEEE Military Communications Conference.

[7]  M. Tornatore,et al.  Design of Disaster-Resilient Optical Datacenter Networks , 2012, Journal of Lightwave Technology.

[8]  Eytan Modiano,et al.  Assessing the Vulnerability of the Fiber Infrastructure to Disasters , 2009, IEEE INFOCOM 2009.

[9]  Nick McKeown,et al.  Designing a Fault-Tolerant Network Using Valiant Load-Balancing , 2008, IEEE INFOCOM 2008 - The 27th Conference on Computer Communications.

[10]  Xiaohong Jiang,et al.  Assessing network vulnerability under probabilistic region failure model , 2011, 2011 IEEE 12th International Conference on High Performance Switching and Routing.

[11]  Xiaohong Jiang,et al.  Assessing physical network vulnerability under random line-segment failure model , 2012, 2012 IEEE 13th International Conference on High Performance Switching and Routing.

[12]  Jason P. Jue,et al.  OPN07-4: Maximum Survivability using Two Disjoint Paths under Multiple Failures in Mesh Networks , 2006, IEEE Globecom 2006.

[13]  Ning-Hai Bao,et al.  Reliability threshold based service bandwidth recovery scheme for post-disaster telecom networks , 2018, Optical Fiber Technology.

[14]  Murali S. Kodialam,et al.  Resilient routing of variable traffic with performance guarantees , 2009, 2009 17th IEEE International Conference on Network Protocols.

[15]  Arunabha Sen,et al.  Region-based connectivity - a new paradigm for design of fault-tolerant networks , 2009, 2009 International Conference on High Performance Switching and Routing.

[16]  Jian-Qiang Hu,et al.  Diverse routing in optical mesh networks , 2003, IEEE Trans. Commun..

[17]  Rajeev Motwani,et al.  Randomized Algorithms , 1995, SIGA.

[18]  Dimitri Papadimitriou,et al.  Inference of Shared Risk Link Groups , 2001 .

[19]  Eytan Modiano,et al.  Network Reliability under Random Circular Cuts , 2011, 2011 IEEE Global Telecommunications Conference - GLOBECOM 2011.

[20]  Byrav Ramamurthy,et al.  Shared risk link Group (SRLG)-diverse path provisioning under hybrid service level agreements in wavelength-routed optical mesh networks , 2005, IEEE/ACM Transactions on Networking.

[21]  Eytan Modiano,et al.  Diverse Routing in Networks with Probabilistic Failures , 2009, IEEE INFOCOM 2009.

[22]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[23]  Gil Zussman,et al.  The resilience of WDM networks to probabilistic geographical failures , 2011, INFOCOM 2011.

[24]  Pin-Han Ho,et al.  Optimal Allocation of Monitoring Trails for Fast SRLG Failure Localization in All-Optical Networks , 2010, 2010 IEEE Global Telecommunications Conference GLOBECOM 2010.

[25]  Stefan Savage,et al.  California fault lines: understanding the causes and impact of network failures , 2010, SIGCOMM '10.

[26]  A. Leon-Garcia,et al.  On congestion in mission critical networks , 2008, IEEE INFOCOM Workshops 2008.

[27]  Sanglu Lu,et al.  Designing a disaster-resilient network with software defined networking , 2014, 2014 IEEE 22nd International Symposium of Quality of Service (IWQoS).

[28]  Arun K. Somani,et al.  Diverse routing for shared risk resource groups (SRRG) failures in WDM optical networks , 2004, First International Conference on Broadband Networks.

[29]  Amund Kvalbein,et al.  Fast Recovery from Dual Link Failures in IP Networks , 2009, IEEE INFOCOM 2009.