Fault-tolerant hierarchical routing

This paper presents a self-stabilizing, fault-tolerant hierarchical routing algorithm. Hierarchical routing algorithms are less expensive algorithm than traditional all-pairs routing algorithms (i.e., lower memory requirements, faster routing table lookups, and less costly broadcast). The algorithm presented here retains these benefits yet, maintains routing capability between all pairs of connected nodes, even in the presence of faults, such as link/node failures and repairs and corruption of program variables. Additionally, this algorithm solves the problem of area partition where nodes that are defined to be in the same subset of the network become isolated by link or node failure. Being self-stabilizing, starting from an arbitrary state the protocol is guaranteed to reach a configuration with routing tables containing valid entries in a finite time. The protocol automatically updates the shortest paths in the face of dynamically changing link weights. The protocol dynamically allocates/deallocates storage for the routing information as the network size changes. The algorithm works on an arbitrary topology and under a distributed daemon model.

[1]  Richard Bellman,et al.  ON A ROUTING PROBLEM , 1958 .

[2]  George Varghese Self-stabilization by counter flushing , 1994, PODC '94.

[3]  Marco Schneider,et al.  Self-stabilization , 1993, CSUR.

[4]  Baruch Awerbuch,et al.  Improved Routing Strategies with Succinct Tables , 1990, J. Algorithms.

[5]  Shing-Tsaan Huang,et al.  A Self-Stabilizing Algorithm for Constructing Spanning Trees , 1991, Inf. Process. Lett..

[6]  Anish Arora,et al.  Distributed Reset , 1994, IEEE Trans. Computers.

[7]  Boaz Patt-Shamir,et al.  Time optimal self-stabilizing synchronization , 1993, STOC.

[8]  Edsger W. Dijkstra,et al.  Self-stabilizing systems in spite of distributed control , 1974, CACM.

[9]  Shlomi Dolev,et al.  SuperStabilizing protocols for dynamic distributed systems , 1995, PODC '95.

[10]  Boaz Patt-Shamir,et al.  Self-stabilization by local checking and correction , 1991, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science.

[11]  Anish Arora,et al.  Closure and Convergence: A Foundation of Fault-Tolerant Computing , 1993, IEEE Trans. Software Eng..

[12]  Amos Israeli,et al.  Self-stabilization of dynamic systems assuming only read/write atomicity , 1990, PODC '90.

[13]  Shing-Tsaan Huang,et al.  A Self-Stabilizing Algorith for the Shortest Paths Problem with a Fully Distributed Demon , 1994, Parallel Process. Lett..

[14]  Amos Israeli,et al.  Self-Stabilization of Dynamic Systems Assuming only Read/Write Atomicity , 1990, PODC.

[15]  Radia J. Perlman Hierarchical Networks and the Subnetwork Partition Problem , 1985, Comput. Networks.

[16]  Boaz Patt-Shamir,et al.  Bounding the Unbounded. , 1994, INFOCOM 1994.