Reliable protocols for distributed termination detection

The problem of detecting termination in a network which has at most one faulty node is considered and several message-efficient protocols are presented. The protocols are then extended to multiple node failures. The network is assumed to be connected in spite of the failures. The protocols are being implemented on a network of SUN workstations. >

[1]  Fred B. Schneider,et al.  Byzantine generals in action: implementing fail-stop processors , 1984, TOCS.

[2]  Rodney W. Topor,et al.  Termination Detection for Distributed Computations , 1984, Inf. Process. Lett..

[3]  A. J. M. van Gasteren,et al.  Derivation of a Termination Detection Algorithm for Distributed Computations , 1983, Inf. Process. Lett..

[4]  Eliezer L. Lozinskii,et al.  A Remark on Distributed Termination , 1985, ICDCS.

[5]  K. Mani Chandy,et al.  How processes learn , 1985, PODC '85.

[6]  Leslie Lamport,et al.  Distributed snapshots: determining global states of distributed systems , 1985, TOCS.

[7]  Pierre A. Humblet,et al.  A Distributed Algorithm for Minimum-Weight Spanning Trees , 1983, TOPL.

[8]  Michael Rodeh,et al.  Achieving Distributed Termination without Freezing , 1982, IEEE Transactions on Software Engineering.

[9]  Ten-Hwang Lai,et al.  Termination Detection for Dynamically Distributed Systems with Non-first-in-first-out Communication , 1986, J. Parallel Distributed Comput..

[10]  Subbarayan Venkatesan,et al.  A Message-Optimal Algorithm for Distributed Termination Detection , 1990, J. Parallel Distributed Comput..

[11]  K. Mani Chandy,et al.  Termination Detection of Diffusing Computations in Communicating Sequential Processes , 1982, TOPL.

[12]  Leslie Lamport,et al.  The Byzantine Generals Problem , 1982, TOPL.

[13]  Richard D. Schlichting,et al.  Fail-stop processors: an approach to designing fault-tolerant computing systems , 1983, TOCS.

[14]  Daniel Lehmann,et al.  Dynamic systems and their distributed termination , 1982, PODC '82.

[15]  Nissim Francez,et al.  Distributed Termination , 1980, TOPL.

[16]  Sam Toueg,et al.  Distributed Snapshots In Spite of Failures , 1984 .

[17]  Danny Dolev,et al.  On the minimal synchronism needed for distributed consensus , 1983, 24th Annual Symposium on Foundations of Computer Science (sfcs 1983).

[18]  Edsger W. Dijkstra,et al.  Termination Detection for Diffusing Computations , 1980, Inf. Process. Lett..

[19]  S. P. Rana,et al.  A Distributed Solution of the Distributed Termination Problem , 1983, Inf. Process. Lett..

[20]  Hussein Zedan,et al.  A Distributed Algorithm for Distributed Termination , 1987, Inf. Process. Lett..

[21]  Heikki Saikkonen,et al.  Distributed termination on a ring , 1986, BIT.

[22]  Baruch Awerbuch,et al.  Applying static network protocols to dynamic networks , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[23]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1983, PODS '83.