Distributed computing column 37: reconfiguring state machines ... and the history of common knowledge

Reconfiguration means changing the set of processes executing a distributed system. We explain several methods for reconfiguring a system implemented using the state-machine approach, including some new ones. We discuss the relation between these methods and earlier reconfiguration algorithms—especially view changing in group communication.

[1]  Alberto Montresor,et al.  Group Communication in Partitionable Systems: Specification and Algorithms , 2001, IEEE Trans. Software Eng..

[2]  Idit Keidar,et al.  Efficient message ordering in dynamic networks , 1996, PODC '96.

[3]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[4]  Nancy A. Lynch,et al.  Rambo II: rapidly reconfigurable atomic memory for dynamic networks , 2003, 2003 International Conference on Dependable Systems and Networks, 2003. Proceedings..

[5]  Louise E. Moser,et al.  The Totem single-ring ordering and membership protocol , 1995, TOCS.

[6]  J. Goldberg,et al.  SIFT: Design and analysis of a fault-tolerant computer for aircraft control , 1978, Proceedings of the IEEE.

[7]  Leslie Lamport,et al.  Lower bounds for asynchronous consensus , 2006, Distributed Computing.

[8]  Vincent Gramoli,et al.  Reconfigurable distributed storage for dynamic networks , 2009, J. Parallel Distributed Comput..

[9]  Leslie Lamport,et al.  Using Time Instead of Timeout for Fault-Tolerant Distributed Systems. , 1984, TOPL.

[10]  Kenneth P. Birman,et al.  Reliable communication in the presence of failures , 1987, TOCS.

[11]  Leslie Lamport,et al.  Brief Announcement: Vertical Paxos and Primary-Backup Replication , 2009 .

[12]  Danny Dolev,et al.  The Transis approach to high availability cluster communication , 1996, CACM.

[13]  Leslie Lamport,et al.  The part-time parliament , 1998, TOCS.

[14]  Marcos K. Aguilera,et al.  Dynamic atomic storage without consensus , 2009, PODC '09.

[15]  Maurice Herlihy,et al.  Axioms for concurrent objects , 1987, POPL '87.

[16]  Nancy A. Lynch,et al.  Specifying and using a partitionable group communication service , 2001, TOCS.

[17]  Nancy A. Lynch,et al.  RAMBO: A Reconfigurable Atomic Memory Service for Dynamic Networks , 2002, DISC.

[18]  Louise E. Moser,et al.  Extended virtual synchrony , 1994, 14th International Conference on Distributed Computing Systems.

[19]  Idit Keidar,et al.  Group communication specifications: a comprehensive study , 2001, CSUR.

[20]  Kenneth P. Birman,et al.  Exploiting virtual synchrony in distributed systems , 1987, SOSP '87.

[21]  Fred B. Schneider,et al.  Implementing fault-tolerant services using the state machine approach: a tutorial , 1990, CSUR.

[22]  Jon Howell,et al.  The SMART way to migrate replicated stateful services , 2006, EuroSys.

[23]  L. Lamport,et al.  Stoppable Paxos , 2008 .

[24]  Butler W. Lampson,et al.  How to Build a Highly Available System Using Consensus , 1996, WDAG.