On the Formal Specification of Group Membership Services

The problem of group membership has been the focus of much theoretical and experimental work on fault-tolerant distributed systems. This has resulted in a voluminous literature and several formal specifications of this problem have been given. In this paper, we examine the two most referenced formal specifications of group membership and show that they are unsatisfactory: One has flaws in the formalism and allows undesirable executions, and the other can be satisfied by useless protocols.

[1]  Shivakant Mishra,et al.  A Membership Protocol Based on Partial Order , 1992 .

[2]  Newtop: a fault-tolerant group communication protocol , 1995, Proceedings of 15th International Conference on Distributed Computing Systems.

[3]  Kenneth P. Birman,et al.  Using process groups to implement failure detection in asynchronous environments , 1991, PODC '91.

[4]  Tushar Deepak Chandra Unreliable failure detectors for asynchronous distributed systems , 1993 .

[5]  Özalp Babaoglu,et al.  RELACS: A communications infrastructure for constructing reliable applications in large-scale distributed systems , 1995, Proceedings of the Twenty-Eighth Annual Hawaii International Conference on System Sciences.

[6]  Bradford B. Glade,et al.  The Horus System , 1993 .

[7]  Kenneth P. Birman,et al.  Process Membership in Asynchronous Environments , 1993 .

[8]  Ragunathan Rajkumar,et al.  Processor group membership protocols: specification, design and implementation , 1993, Proceedings of 1993 IEEE 12th Symposium on Reliable Distributed Systems.

[9]  Yair Amir,et al.  Membership Algorithms for Multicast Communication Groups , 1992, WDAG.

[10]  Matti A. Hiltunen,et al.  Properties of membership services , 1995, Proceedings ISADS 95. Second International Symposium on Autonomous Decentralized Systems.

[11]  Leslie Lamport,et al.  Distributed snapshots: determining global states of distributed systems , 1985, TOCS.

[12]  Andrew S. Tanenbaum,et al.  Group communication in the Amoeba distributed operating system , 1991, [1991] Proceedings. 11th International Conference on Distributed Computing Systems.

[13]  F. Cristian Reaching Agreement on Processor Group Membership in Synchronous Distributed Systems Key Words: Communication Network { Distributed System { Failure Detection { Fault Tolerance { Real Time System { Replicated Data , 1991 .

[14]  Yair Amir,et al.  Transis: A Communication Sub-system for High Availability , 1992 .

[15]  Kenneth P. Birman,et al.  The process group approach to reliable distributed computing , 1992, CACM.

[16]  Aleta Marie Ricciardi,et al.  The Group Membership Problem in Asynchronous Systems , 1993 .

[17]  Louise E. Moser,et al.  Processor Membership in Asynchronous Distributed Systems , 1994, IEEE Trans. Parallel Distributed Syst..

[18]  Danny Dolev,et al.  A Framework for Partitionable Membership Service (Abstract). , 1996, PODC 1996.