Irreducibility and additivity of set agreement-oriented failure detector classes

Solving agreement problems (such as consensus and <i>k</i>-set agreement) in asynchronous distributed systems prone to process failures has been shown to be impossible. To circumvent this impossibility, distributed oracles (also called unreliable failure detectors) have been introduced. A failure detector provides information on failures, and a failure detector class is defined by a set of abstract properties that encapsulate (and hide) synchrony assumptions. Some failure detector classes have been shown to be the weakest to solve some agreement problems (e.g., Ω is the weakest class of failure detectors that allow solving the consensus problem in asynchronous systems where a majority of processes do not crash).This paper considers several failure detector classes and focuses on their additivity or their irreducibility. It mainly investigates two families of failure detector classes (denoted ◊ <i>S</i><sub>x</sub> and ◊ φ<sup>y</sup>, 0≤ <i>x</i>, <i>y</i> ≤ <i>n</i>), shows that they can be "added" to provide a failure detector of the class Ω<sup>z</sup> (a generalization of Ω). It also characterizes the power of such an "addition", namely, ◊ <b><i>S</i></b><sub>x</sub> + ◊ φ<sup>y</sup> ➝ Ω<sup>z</sup> ⇔ <i>x</i>+<i>y</i>+<i>z</i>><i>t</i>+1, where <i>t</i> is the maximum number of processes that can crash in a run. As an example, the paper shows that, while ◊ <b><i>S</i></b><sub>t</sub> allows solving 2-set agreement (and not consensus) and ◊ φ<sup>1</sup> allows solving <i>t</i>-set agreement (but not (<i>t</i>-1)-set agreement), their "addition" allows solving consensus. More generally, the paper studies the failure detector classes ◊ <b><i>S</i></b><sub>x</sub>, ◊ φ<sup>y</sup> and Ω<sup>z</sup>, and shows which reductions among these classes are possible and which are not. The paper presents also an Ω<sup>k</sup>-based <i>k</i>-set agreement protocol. In that sense, it can be seen as a step toward the characterization of the weakest failure detector that allows solving the <i>k</i>-set agreement problem.

[1]  Achour Mostéfaoui,et al.  Leader-Based Consensus , 2001, Parallel Process. Lett..

[2]  Achour Mostéfaoui,et al.  k-set agreement with limited accuracy failure detectors , 2000, PODC '00.

[3]  Rachid Guerraoui,et al.  Non-blocking atomic commit in asynchronous distributed systems with failure detectors , 2002, Distributed Computing.

[4]  Gil Neiger,et al.  Failure Detectors and the Wait-Free Hierarchy. , 1995, ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing.

[5]  Rachid Guerraoui,et al.  (Almost) All Objects Are Universal in Message Passing Systems , 2005, DISC.

[6]  Gil Neiger Failure detectors and the wait-free hierarchy (extended abstract) , 1995, PODC '95.

[7]  Maurice Herlihy,et al.  Tight Bounds for K-set Agreement with Limited-scope Failure Detectors , 2003, PODC.

[8]  Maurice Herlihy,et al.  The topological structure of asynchronous computability , 1999, JACM.

[9]  André Schiper Early consensus in an asynchronous system with a weak failure detector , 1997, Distributed Computing.

[10]  Sam Toueg,et al.  The weakest failure detector for solving consensus , 1992, PODC '92.

[11]  Michael E. Saks,et al.  Wait-free k-set agreement is impossible: the topology of public knowledge , 1993, STOC.

[12]  Eli Gafni,et al.  Generalized FLP impossibility result for t-resilient asynchronous computations , 1993, STOC.

[13]  Achour Mostéfaoui,et al.  The combined power of conditions and failure detectors to solve asynchronous set agreement , 2005, PODC '05.

[14]  Eli Gafni,et al.  Structured derivations of consensus algorithms for failure detectors , 1998, PODC '98.

[15]  Achour Mostéfaoui,et al.  From omega to Omega: A simple bounded quiescent reliable broadcast-based transformation , 2007, J. Parallel Distributed Comput..

[16]  Soma Chaudhuri,et al.  More Choices Allow More Faults: Set Consensus Problems in Totally Asynchronous Systems , 1993, Inf. Comput..

[17]  Rachid Guerraoui,et al.  Fast Indulgent Consensus with Zero Degradation , 2002, EDCC.

[18]  Achour Mostéfaoui,et al.  Solving Consensus Using Chandra-Toueg's Unreliable Failure Detectors: A General Quorum-Based Approach , 1999, DISC.

[19]  Achour Mostéfaoui,et al.  Conditions on input vectors for consensus solvability in asynchronous distributed systems , 2003, J. ACM.

[20]  Leslie Lamport,et al.  The part-time parliament , 1998, TOCS.

[21]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1985, JACM.

[22]  Sam Toueg,et al.  Unreliable failure detectors for reliable distributed systems , 1996, JACM.

[23]  C. Gorman,et al.  PI , 2021, Encyclopedic Dictionary of Archaeology.

[24]  Rachid Guerraoui,et al.  The information structure of indulgent consensus , 2004, IEEE Transactions on Computers.

[25]  Maurice Herlihy,et al.  Tight bounds for k-set agreement with limited-scope failure detectors , 2003, PODC '03.

[26]  Francis C. Chu Reducing &Ω to ◊ W , 1998 .