Tight failure detection bounds on atomic object implementations

This article determines the weakest failure detectors to implement shared atomic objects in a distributed system with crash-prone processes. We first determine the weakest failure detector for the basic register object. We then use that to determine the weakest failure detector for all popular atomic objects including test-and-set, fetch-and-add, queue, consensus and compare-and-swap, which we show is the same.

[1]  Marcos K. Aguilera,et al.  Thrifty Generic Broadcast , 2000, DISC.

[2]  Maurice Herlihy,et al.  Wait-free synchronization , 1991, TOPL.

[3]  Sam Toueg,et al.  The weakest failure detector for solving consensus , 1992, PODC '92.

[4]  Marcin Paprzycki,et al.  Distributed Computing: Fundamentals, Simulations and Advanced Topics , 2001, Scalable Comput. Pract. Exp..

[5]  Sam Toueg,et al.  Unreliable failure detectors for reliable distributed systems , 1996, JACM.

[6]  Fred B. Schneider,et al.  Implementing fault-tolerant services using the state machine approach: a tutorial , 1990, CSUR.

[7]  Maurice Herlihy,et al.  Linearizability: a correctness condition for concurrent objects , 1990, TOPL.

[8]  Prasad Jayanti On the robustness of Herlihy's hierarchy , 1993, PODC '93.

[9]  Dan Suciu,et al.  Journal of the ACM , 2006 .

[10]  Sam Toueg,et al.  The weakest failure detector for solving consensus , 1996, JACM.

[11]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1983, PODS '83.

[12]  Leslie Lamport,et al.  The part-time parliament , 1998, TOCS.

[13]  Rida A. Bazzi,et al.  On the use of registers in achieving wait-free consensus , 1994, PODC '94.

[14]  Sam Toueg,et al.  Fault-tolerant broadcasts and related problems , 1993 .

[15]  Leslie Lamport,et al.  Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[16]  Leslie Lamport,et al.  On interprocess communication , 1986, Distributed Computing.

[17]  Hagit Attiya,et al.  Sharing memory robustly in message-passing systems , 1990, PODC '90.

[18]  Hagit Attiya,et al.  Distributed Computing: Fundamentals, Simulations and Advanced Topics , 1998 .

[19]  Amos Israeli,et al.  Bounded time-stamps , 1987, 28th Annual Symposium on Foundations of Computer Science (sfcs 1987).

[20]  Leslie Lamport,et al.  Interprocess Communication , 2020, Practical System Programming with C.

[21]  Baruch Awerbuch,et al.  Atomic shared register access by asynchronous hardware , 1986, 27th Annual Symposium on Foundations of Computer Science (sfcs 1986).