Optimized self-healing framework for software defined networks

Software defined network (SDN) enables network programmability and provides fine grained control for managing the complex network infrastructure. With the centralized nature of SDN it poses the requirement for failure management at the data plane, control plane and at the centralized controller side. The self-healing attribute of the autonomic network can be combined with SDN to develop a software defined self-healing resilient network. The aim of this paper is to propose a self-healing SDN framework which can optimize the recovery by applying autonomic principles. The proposed work includes a rapid recovery (RR) mechanism to perform an immediate link recovery at the switch level without overburdening the controller. Additionally, it reduces the memory requirement of the switch for storing the backup path flow rules by aggregating all the disrupted flows. We presented the analytical model for calculating the failure recovery time and the backup flow rules required for recovery. Based on the analytical model, RR scheme reduces the total number of backup flow rules for all the disrupted flows of the failed link to a single flow rule.

[1]  Amin Vahdat,et al.  PortLand: a scalable fault-tolerant layer 2 data center network fabric , 2009, SIGCOMM '09.

[2]  Gunjan Tank,et al.  Software-Defined Networking-The New Norm for Networks , 2012 .

[3]  Piero Castoldi,et al.  OpenFlow-based segment protection in Ethernet networks , 2013, IEEE/OSA Journal of Optical Communications and Networking.

[4]  Didier Colle,et al.  Enabling fast failure recovery in OpenFlow networks , 2011, 2011 8th International Workshop on the Design of Reliable Communication Networks (DRCN).

[5]  Sam Lightstone,et al.  Autonomic computing for relational databases: the ten-year vision , 2003, IEEE International Conference on Industrial Informatics, 2003. INDIN 2003. Proceedings..

[6]  Li Xin,et al.  A framework of using OpenFlow to handle transient link failure , 2011, Proceedings 2011 International Conference on Transportation, Mechanical, and Electrical Engineering (TMEE).

[7]  Anish Arora,et al.  Closure and Convergence: A Foundation of Fault-Tolerant Computing , 1993, IEEE Trans. Software Eng..

[8]  Sujata Banerjee,et al.  DevoFlow: scaling flow management for high-performance networks , 2011, SIGCOMM.

[9]  Nick Feamster,et al.  CORONET: Fault tolerance for Software Defined Networks , 2012, 2012 20th IEEE International Conference on Network Protocols (ICNP).

[10]  Nick McKeown,et al.  OpenFlow: enabling innovation in campus networks , 2008, CCRV.

[11]  Didier Colle,et al.  OpenFlow: Meeting carrier-grade recovery requirements , 2013, Comput. Commun..

[12]  Didier Colle,et al.  Fast failure recovery for in-band OpenFlow networks , 2013, 2013 9th International Conference on the Design of Reliable Communication Networks (DRCN).

[13]  Debanjan Ghosh,et al.  Self-healing systems - survey and synthesis , 2007, Decis. Support Syst..

[14]  Schahram Dustdar,et al.  A survey on self-healing systems: approaches and systems , 2010, Computing.

[15]  David Walker,et al.  Composing Software Defined Networks , 2013, NSDI.

[16]  Jeffrey O. Kephart,et al.  The Vision of Autonomic Computing , 2003, Computer.

[17]  Pontus Sköldström,et al.  Scalable fault management for OpenFlow , 2012, 2012 IEEE International Conference on Communications (ICC).

[18]  Vijay Mann,et al.  SwitchReduce: Reducing switch state and controller involvement in OpenFlow networks , 2013, 2013 IFIP Networking Conference.