Robust and Speculative Byzantine Randomized Consensus with Constant Time Complexity in Normal Conditions

Randomized Byzantine Consensus can be an interesting building block in the implementation of asynchronous distributed systems. Despite its exponential worst-case complexity, which would make it less appealing in practice, a few experimental works have argued quite the opposite. To bridge the gap between theory and practice, we analyze a well-known state-of-the-art algorithm in normal system conditions, in which crash failures may occur but no malicious attacks, proving that it is fast on average. We then leverage our analysis to improve its best-case complexity from three to two phases, by reducing the communication operations through speculative executions. Our findings are confirmed through an experimental validation.

[1]  Michael Ben-Or,et al.  Another advantage of free choice (Extended Abstract): Completely asynchronous agreement protocols , 1983, PODC '83.

[2]  Leslie Lamport,et al.  The Byzantine Generals Problem , 1982, TOPL.

[3]  Paulo Veríssimo,et al.  Randomized Consensus in Wireless Environments: A Case Where More is Better , 2010, 2010 Third International Conference on Dependability.

[4]  P. A. P. Moran,et al.  An introduction to probability theory , 1968 .

[5]  Sam Toueg,et al.  Asynchronous consensus and broadcast protocols , 1985, JACM.

[6]  Hagit Attiya,et al.  Lower bounds for randomized consensus under a weak adversary , 2008, PODC '08.

[7]  Rachid Guerraoui,et al.  On the complexity of asynchronous gossip , 2008, PODC '08.

[8]  Gabriel Bracha,et al.  An asynchronous [(n - 1)/3]-resilient consensus protocol , 1984, PODC '84.

[9]  Miguel Correia,et al.  Turquois: Byzantine consensus in wireless ad hoc networks , 2010, 2010 IEEE/IFIP International Conference on Dependable Systems & Networks (DSN).

[10]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1983, PODS '83.

[11]  Feller William,et al.  An Introduction To Probability Theory And Its Applications , 1950 .

[12]  Miguel Correia,et al.  Experimental Comparison of Local and Shared Coin Randomized Consensus Protocols , 2006, 2006 25th IEEE Symposium on Reliable Distributed Systems (SRDS'06).

[13]  Péter Urbán,et al.  Chasing the FLP impossibility result in a LAN: or, How robust can a fault tolerant server be? , 2001, Proceedings 20th IEEE Symposium on Reliable Distributed Systems.

[14]  W. L. Nicholson,et al.  On the Normal Approximation to the Hypergeometric Distribution , 1956 .

[15]  Eli Upfal,et al.  Probability and Computing: Randomized Algorithms and Probabilistic Analysis , 2005 .

[16]  Ran Canetti,et al.  Fast asynchronous Byzantine agreement with optimal resilience , 1993, STOC.

[17]  Michael J. Fischer,et al.  The Consensus Problem in Unreliable Distributed Systems (A Brief Survey) , 1983, FCT.

[18]  James Aspnes,et al.  Fast deterministic consensus in a noisy environment , 2000, PODC '00.

[19]  Danny Dolev,et al.  An almost-surely terminating polynomial protocol for asynchronous byzantine agreement with optimal resilience , 2008, PODC '08.

[20]  Sam Toueg,et al.  A Modular Approach to Fault-Tolerant Broadcasts and Related Problems , 1994 .