A dynamic hybrid protocol for total order in large-scale systems

Totally ordered multicast protocols have proved to be extremely useful in supporting fault-tolerant distributed applications. This paper compares the performance of the two main classes of protocols providing total order in large-scale systems (token-site and symmetric protocols) and proposes a new dynamic hybrid protocol that, when applied to systems where the topology/traffic patterns are not known a priori, offers a much lower latency than any of the previous classes of protocols in isolation.

[1]  Louise E. Moser,et al.  Fast message ordering and membership using a logical token-passing ring , 1993, [1993] Proceedings. The 13th International Conference on Distributed Computing Systems.

[2]  Louise E. Moser,et al.  Broadcast Protocols for Distributed Systems , 1990, IEEE Trans. Parallel Distributed Syst..

[3]  Bradford B. Glade,et al.  The Horus System , 1993 .

[4]  Andrew S. Tanenbaum,et al.  Group communication in the Amoeba distributed operating system , 1991, [1991] Proceedings. 11th International Conference on Distributed Computing Systems.

[5]  Paulo Veríssimo,et al.  xAMp: a multi-primitive group communications service , 1992, [1992] Proceedings 11th Symposium on Reliable Distributed Systems.

[6]  Liuba Shrira,et al.  Lazy replication: exploiting the semantics of distributed services , 1990, ACM SIGOPS European Workshop.

[7]  Richard D. Schlichting,et al.  Preserving and using context information in interprocess communication , 1989, TOCS.

[8]  Flaviu Cristian,et al.  Atomic Broadcast: From Simple Message Diffusion to Byzantine Agreement , 1995, Inf. Comput..

[9]  Shivakant Mishra,et al.  Protocol modularity in systems for managing replicated data , 1992, [1992 Proceedings] Second Workshop on the Management of Replicated Data.

[10]  Kenneth P. Birman,et al.  Reliable communication in the presence of failures , 1987, TOCS.

[11]  André Schiper,et al.  Virtually-synchronous communication based on a weak failure suspector , 1993, FTCS-23 The Twenty-Third International Symposium on Fault-Tolerant Computing.

[12]  André Schiper,et al.  Lightweight causal and atomic group multicast , 1991, TOCS.

[13]  Jo-Mei Chang,et al.  Reliable broadcast protocols , 1984, TOCS.

[14]  Peter W. M. John Statistical Methods in Engineering and Quality Assurance , 1990 .

[15]  Alan O. Freier,et al.  Multicast Transport Protocol , 1992, RFC.

[16]  Fred B. Schneider,et al.  Implementing fault-tolerant services using the state machine approach: a tutorial , 1990, CSUR.

[17]  Theodore Johnson,et al.  Two Approaches for High Concurrency in Multicast-Based Object Replication , 1994 .

[18]  Paulo Veríssimo,et al.  AMp: a highly parallel atomic multicast protocol , 1989, SIGCOMM '89.

[19]  Yair Amir,et al.  Transis: A Communication Sub-system for High Availability , 1992 .

[20]  Andrew T. Heybey The network simulator version 2 , 1990 .

[21]  Paulo Veríssimo,et al.  AMp: a highly parallel atomic multicast protocol , 1989, SIGCOMM 1989.

[22]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[23]  Danny Dolev,et al.  Early delivery totally ordered multicast in asynchronous environments , 1993, FTCS-23 The Twenty-Third International Symposium on Fault-Tolerant Computing.

[24]  Newtop: a fault-tolerant group communication protocol , 1995, Proceedings of 15th International Conference on Distributed Computing Systems.