Reach High Performance Consensus on a Multicasting Network with Dual Failure Mode

The distributed consensus problem is one of the most important problems in designing a fault-tolerant distributed system. In prior work, the consensus problem was solved in a Fully Connected Network (FCN) or a Generalize Connectivity Network (GCN) with fallible communication media and cost two rounds of message exchange to solve the consensus problem. However the above network topology is not practical in the real world. So we revisit the consensus problem in a MultiCasting network (MCN) which is the most practical network topology in the real world, and we also propose two new protocols Relay Fault-Tolerance Channel (RFC) and Efficient MultiCasting Agreement Protocol (EMAP). Using RFC, we only need one round of message exchange to get enough information in FCN, BCN, GCN and MCN. We also enlarge the fault tolerant capability by allowing both dormant faults and malicious faults exist in a multicasting network. In this paper, the protocols that we proposed use only one round to exchange messages and can tolerate the maximum number of allowable faulty communication media.

[1]  David K. Gifford,et al.  Weighted voting for replicated data , 1979, SOSP '79.

[2]  Sam Toueg,et al.  Asynchronous consensus and broadcast protocols , 1985, JACM.

[3]  J. Goldberg,et al.  SIFT: Design and analysis of a fault-tolerant computer for aircraft control , 1978, Proceedings of the IEEE.

[4]  Shu-Chin Wang,et al.  Byzantine Agreement in a Generalized Connected Network , 1995, IEEE Trans. Parallel Distributed Syst..

[5]  Wei-Pang Yang,et al.  A Note on Consensus on Dual Failure Modes , 1996, IEEE Trans. Parallel Distributed Syst..

[6]  Miroslaw Malek,et al.  The consensus problem in fault-tolerant computing , 1993, CSUR.

[7]  Özalp Babaoglu,et al.  Streets of Byzantium: Network Architectures for Fast Reliable Broadcasts , 1985, IEEE Transactions on Software Engineering.

[8]  Danny Dolev,et al.  Authenticated Algorithms for Byzantine Agreement , 1983, SIAM J. Comput..

[9]  Shu-Chin Wang,et al.  Consensus Under Unreliable Transmission , 1999, Inf. Process. Lett..

[10]  Leslie Lamport,et al.  The Byzantine Generals Problem , 1982, TOPL.

[11]  Dhiraj K. Pradhan,et al.  Consensus With Dual Failure Modes , 1991, IEEE Trans. Parallel Distributed Syst..