A distributed primary-segmented backup scheme for dependable real-time communication in multihop net

Several distributed real-time applications require faulttolerance apart from guaranteed timeliness, at acceptable levels of overhead. These applications require hard guarantees on recovery delays, due to network component failures, which cannot be ensured in traditional datagram services. Several schemes exist which attempt to guarantee failure recovery in a timely and resource efficient manner. These methods center around a priori reservation of network resources called spare resources along a backup channel, in addition to each primary communication channel. This backup channel is usually routed along a path disjoint with the primary channel. In this paper, we propose a distributed method of segmented backups for dependable realtime communication in multihop networks, which improves upon existing methods in terms of network resource utilization, average call acceptance rate, scalability and provides better QoS guarantees on bounded failure recovery time and propagation delays, without any compromise in faulttolerance levels. The distributed algorithm is one of finding a "minimal path" based on flooding with a cut-out mechanism that does not relay messages if they came along longer paths than those known. We further show that the complexity of the distributed algorithm is bounded and acceptable.

[1]  Kang G. Shin,et al.  Experimental evaluation of failure-detection schemes in real-time communication networks , 1997, Proceedings of IEEE 27th International Symposium on Fault Tolerant Computing.

[2]  Subrahmanyam Dravida,et al.  Fast restoration of ATM networks , 1994, IEEE J. Sel. Areas Commun..

[3]  Yakov Rekhter,et al.  A Border Gateway Protocol 4 (BGP-4) , 1994, RFC.

[4]  W. D. Grover,et al.  The selfhealing network : A fast distributed restoration technique for networks using digital crossconnect machines , 1987 .

[5]  Adrian Segall,et al.  Distributed network protocols , 1983, IEEE Trans. Inf. Theory.

[6]  Amin Vahdat,et al.  Detour: informed Internet routing and transport , 1999, IEEE Micro.

[7]  K. Mani Chandy,et al.  Distributed computation on graphs: shortest path algorithms , 1982, CACM.

[8]  Parameswaran Ramanathan,et al.  Delivery of time-critical messages using a multiple copy approach , 1992, TOCS.

[9]  C. Siva Ram Murthy,et al.  A new distributed route selection approach for channel establishment in real-time networks , 1999, TNET.

[10]  Seif Haridi,et al.  Distributed Algorithms , 1992, Lecture Notes in Computer Science.

[11]  Jon Crowcroft,et al.  Quality-of-Service Routing for Supporting Multimedia Applications , 1996, IEEE J. Sel. Areas Commun..

[12]  John Moy,et al.  OSPF Protocol Analysis , 1991, RFC.

[13]  Krishna P. Gummadi,et al.  An efficient primary-segmented backup scheme for dependable real-time communication in multihop networks , 2003, TNET.

[14]  D. Estrin,et al.  RSVP: a new resource reservation protocol , 1993, IEEE Communications Magazine.

[15]  Ronny Vogel,et al.  QoS-Based Routing of Multimedia Streams in Computer Networks , 1996, IEEE J. Sel. Areas Commun..

[16]  Raj Jain,et al.  Resource Reservation Protocol (RSVP) , 1999 .

[17]  Kang G. Shin,et al.  A Primary-Backup Channel Approach to Dependable Real-Time Communication in Multihop Networks , 1998, IEEE Trans. Computers.

[18]  John Moy,et al.  OSPF Version 2 , 1998, RFC.

[19]  Krishna P. Gummadi,et al.  A Segmented Backup Scheme for Dependable Real Time Communication in Multihop Networks , 2000, IPDPS Workshops.

[20]  Kang G. Shin,et al.  Efficient spare-resource allocation for fast restoration of real-time channels from network component failures , 1997, Proceedings Real-Time Systems Symposium.

[21]  Kang G. Shin,et al.  Fault-tolerant real-time communication in distributed computing systems , 1992, [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing.

[22]  Ken-ichi Sato,et al.  Self-healing ATM networks based on virtual path concept , 1994, IEEE J. Sel. Areas Commun..

[23]  Robert Tappan Morris,et al.  Resilient overlay networks , 2001, SOSP.

[24]  Abhijit Bose,et al.  Delayed internet routing convergence , 2000, SIGCOMM.

[25]  A. Banerjea Simulation Study of the Capacity Effects of Dispersity Routing for Fault Tolerant Realtime Channels , 1996, SIGCOMM.

[26]  Sriram Raghavan,et al.  A rearrangeable algorithm for the construction delay-constrained dynamic multicast trees , 1999, TNET.

[27]  Hari Balakrishnan,et al.  Resilient overlay networks , 2001, SOSP.

[28]  Tohru Kikuno,et al.  A Distributed Routing Protocol for Finding Two Node-Disjoint Paths in Computer Networks , 1999 .

[29]  Hector Garcia-Molina,et al.  Aggressive Transmissions of Short Messages Over Redundant Paths , 1994, IEEE Trans. Parallel Distributed Syst..

[30]  Parameswaran Ramanathan,et al.  Resource aggregation for fault tolerance in integrated services networks , 1998, CCRV.