The Totem multiple-ring ordering and topology maintenance protocol

The Totem multiple-ring protocol provides reliable totally ordereddelivery of messages across multiple local-area networks interconnectedby gateways. This consistent message order is maintained in the presenceof network partitioning and remerging, and of processor failure andrecovery. The protocol provides accurate topology change information aspart of the global total order of messages. It addresses the issue ofscalability and achieves a latency that increases logarithmically withsystem size by exploiting process group locality and selectiveforwarding of messages through the gateways. Pseudocode for the protocoland an evaluation of its performance are given. —Authors' Abstract

[1]  Philip K. McKinley,et al.  A token-based protocol for reliable, ordered multicast communication , 1989, Proceedings of the Eighth Symposium on Reliable Distributed Systems.

[2]  Robbert van Renesse,et al.  Reliable Distributed Computing with the Isis Toolkit , 1994 .

[3]  Louise E. Moser,et al.  Extended virtual synchrony , 1994, 14th International Conference on Distributed Computing Systems.

[4]  Flaviu Cristian,et al.  The pinwheel asynchronous atomic broadcast protocols , 1995, Proceedings ISADS 95. Second International Symposium on Autonomous Decentralized Systems.

[5]  Andrew S. Tanenbaum,et al.  Group communication in the Amoeba distributed operating system , 1991, [1991] Proceedings. 11th International Conference on Distributed Computing Systems.

[6]  Todd Montgomery,et al.  A High Performance Totally Ordered Multicast Protocol , 1994, Dagstuhl Seminar on Distributed Systems.

[7]  Louise E. Moser,et al.  Broadcast Protocols for Distributed Systems , 1990, IEEE Trans. Parallel Distributed Syst..

[8]  Robbert van Renesse,et al.  Horus: a flexible group communication system , 1996, CACM.

[9]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[10]  Paulo Veríssimo,et al.  Totally ordered multicast in large-scale systems , 1996, Proceedings of 16th International Conference on Distributed Computing Systems.

[11]  Vivek Agrawala,et al.  Asynchronous Fault-Tolerant Total Ordering Algorithms , 1993, SIAM J. Comput..

[12]  Louise E. Moser,et al.  Totem: a fault-tolerant multicast group communication system , 1996, CACM.

[13]  Weijia Jia,et al.  RMP: fault-tolerant group communication , 1996, IEEE Micro.

[14]  Shivakant Mishra,et al.  Consul: a communication substrate for fault-tolerant distributed programs , 1993, Distributed Syst. Eng..

[15]  Louise E. Moser,et al.  The Totem single-ring ordering and membership protocol , 1995, TOCS.

[16]  Jo-Mei Chang,et al.  Reliable broadcast protocols , 1984, TOCS.

[17]  Newtop: a fault-tolerant group communication protocol , 1995, Proceedings of 15th International Conference on Distributed Computing Systems.

[18]  Yair Amir,et al.  Transis: a communication subsystem for high availability , 1992, [1992] Digest of Papers. FTCS-22: The Twenty-Second International Symposium on Fault-Tolerant Computing.

[19]  Louise E. Moser,et al.  Analyzing the latency of the Totem multicast protocols , 1997, Proceedings of Sixth International Conference on Computer Communications and Networks.

[20]  Louise E. Moser,et al.  A reliable ordered delivery protocol for interconnected local area networks , 1995, Proceedings of International Conference on Network Protocols.

[21]  Louise E. Moser,et al.  Trans: a reliable broadcast protocol , 1993 .

[22]  Danny Dolev,et al.  Early delivery totally ordered multicast in asynchronous environments , 1993, FTCS-23 The Twenty-Third International Symposium on Fault-Tolerant Computing.