Weakest failure detectors via an egg-laying simulation ( Preliminary Version )

In the k-set agreement task, n processes propose values, and have to decide on at most k of these values. In particular, consensus is 1-set agreement. In PODC 2008 Zieliński showed that the anti−Ω failure detector is necessary and sufficient to solve (n − 1)-set agreement in an asynchronous read/write shared memory system where at most t processes can fail by crashing, t = n− 1. In this paper it is shown that anti−Ω is the weakest failure detector to solve t-set agreement in a t-resilient asynchronous distributed system. Each query to anti−Ω returns a set S of process ids, |S| = n − t, such that some correct process eventually never appears in such a set S; thus, anti−Ωn−1 = anti−Ω , and anti−Ω = Ω. Actually, the paper shows a stronger result: Any failure detector that can be used to solve T is at least as strong as anti−Ω, for any agreement task T that has no t-resilient solution. The previous results are obtained through a variant of Zieliński’s technique, that simplifies some of the arguments and introduces an egg-laying simulation of independent interest. The simulation provides the first detailed analysis of immediate snapshot executions in a failure detector enriched environment. It provides a simple proof that Ω is the weakest failure detector to solve consensus when t = 1.

[1]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1985, JACM.

[2]  Nancy A. Lynch,et al.  Consensus in the presence of partial synchrony , 1988, JACM.

[3]  Nir Shavit,et al.  Atomic snapshots of shared memory , 1990, JACM.

[4]  Sam Toueg,et al.  The weakest failure detector for solving consensus , 1992, PODC '92.

[5]  Eli Gafni,et al.  Immediate Atomic Snapshots and Fast Renaming (Extended Abstract). , 1993, PODC 1993.

[6]  Maurice Herlihy,et al.  The asynchronous computability theorem for t-resilient tasks , 1993, STOC.

[7]  Eli Gafni,et al.  Generalized FLP impossibility result for t-resilient asynchronous computations , 1993, STOC.

[8]  Sam Toueg,et al.  Unreliable failure detectors for reliable distributed systems , 1996, JACM.

[9]  Eli Gafni,et al.  Structured derivations of consensus algorithms for failure detectors , 1998, PODC '98.

[10]  Michael E. Saks,et al.  Wait-free k-set agreement is impossible: the topology of public knowledge , 1993, STOC.

[11]  Yehuda Afek,et al.  Long-lived and adaptive atomic snapshot and immediate snapshot (extended abstract) , 2000, PODC '00.

[12]  Achour Mostéfaoui,et al.  k-set agreement with limited accuracy failure detectors , 2000, PODC '00.

[13]  Nancy A. Lynch,et al.  The BG distributed simulation algorithm , 2001, Distributed Computing.

[14]  Hagit Attiya,et al.  The Combinatorial Structure of Wait-Free Solvable Tasks , 2002, SIAM J. Comput..

[15]  Maurice Herlihy,et al.  Tight Bounds for K-set Agreement with Limited-scope Failure Detectors , 2003, PODC.

[16]  Maurice Herlihy,et al.  A classification of wait-free loop agreement tasks , 2003, Theor. Comput. Sci..

[17]  Rachid Guerraoui,et al.  The weakest failure detectors to solve certain fundamental problems in distributed computing , 2004, PODC '04.

[18]  Sam Toueg,et al.  The weakest failure detector to solve nonuniform consensus , 2005, PODC.

[19]  Wei Chen,et al.  Weakening Failure Detectors for k -Set Agreement Via the Partition Approach , 2007, DISC.

[20]  Nancy A. Lynch,et al.  On the weakest failure detector ever , 2007, PODC '07.

[21]  Rachid Guerraoui,et al.  Failure detectors as type boosters , 2007, Distributed Computing.

[22]  Rachid Guerraoui,et al.  The weakest failure detectors to boost obstruction-freedom , 2006, Distributed Computing.

[23]  Piotr Zielinski Automatic Classification of Eventual Failure Detectors , 2007, DISC.

[24]  Achour Mostéfaoui,et al.  The Combined Power of Conditions and Information on Failures to Solve Asynchronous Set Agreement , 2008, SIAM J. Comput..

[25]  Rachid Guerraoui,et al.  Sharing is harder than agreeing , 2008, PODC '08.

[26]  Rachid Guerraoui,et al.  The Weakest Failure Detector for Message Passing Set-Agreement , 2008, DISC.

[27]  Sam Toueg,et al.  Every problem has a weakest failure detector , 2008, PODC '08.

[28]  Piotr Zielinski,et al.  Anti-Ω: the weakest failure detector for set agreement , 2010, PODC '08.