论文信息 - The perfectly synchronized round-based model of distributed computing

The perfectly synchronized round-based model of distributed computing

The perfectly synchronized round-based model provides the powerful abstraction of crash-stop failures with atomic and synchronous message delivery. This abstraction makes distributed programming very easy. We describe a technique to automatically transform protocols devised in the perfectly synchronized round-based model into protocols for the crash, send omission, general omission or Byzantine models. Our transformation is achieved using a round shifting technique with a constant time complexity overhead. The overhead depends on the target model: crashes, send omissions, general omissions or Byzantine failures. Rather surprisingly, we show that no other automatic non-uniform transformation from a weaker model, say from the traditional crash-stop model (with no atomic message delivery), onto an even stronger model than the general-omission one, say the send-omission model, can provide a better time complexity performance in a failure-free execution.

Rachid Guerraoui | Carole Delporte-Gallet | Hugues Fauconnier | Bastian Pochon

[1] Sape Mullender,et al. Distributed systems , 1989 .

[2] Leslie Lamport,et al. The Byzantine Generals Problem , 1982, TOPL.

[3] Danny Dolev,et al. 'Eventual' is earlier than 'immediate' , 1982, 23rd Annual Symposium on Foundations of Computer Science (sfcs 1982).

[4] Yoram Moses,et al. Programming simultaneous actions using common knowledge , 1987, Algorithmica.

[5] Nancy A. Lynch,et al. A Lower Bound for the Time to Assure Interactive Consistency , 1982, Inf. Process. Lett..

[6] Marcel-Catalin Rosu. Early-stopping Terminating Reliable Broadcast protocol for general-omission failures , 1996, PODC '96.

[7] Nancy A. Lynch,et al. Impossibility of distributed consensus with one faulty process , 1985, JACM.

[8] Sam Toueg,et al. Unreliable failure detectors for reliable distributed systems , 1996, JACM.

[9] Yoram Moses,et al. Fully Polynomial Byzantine Agreement for n > 3t Processors in t + 1 Rounds , 1998, SIAM J. Comput..

[10] Leslie Lamport,et al. Time, clocks, and the ordering of events in a distributed system , 1978, CACM.

[11] Rachid Guerraoui. On the hardness of failure-sensitive agreement problems , 2001, Inf. Process. Lett..

[12] Gil Neiger,et al. Automatically Increasing the Fault-Tolerance of Distributed Algorithms , 1990, J. Algorithms.

[13] Michel Raynal. Consensus in synchronous systems: a concise guided tour , 2002, 2002 Pacific Rim International Symposium on Dependable Computing, 2002. Proceedings..

[14] Fred B. Schneider,et al. Replication management using the state-machine approach , 1993 .

[15] Rida A. Bazzi,et al. Simplifying fault-tolerance: providing the abstraction of crash failures , 2001, JACM.

[16] Rachid Guerraoui,et al. Synchronous system and perfect failure detector: Solvability and efficiency issues , 2000, Proceeding International Conference on Dependable Systems and Networks. DSN 2000.

[17] Danny Dolev,et al. Early stopping in Byzantine agreement , 1990, JACM.

[18] Rachid Guerraoui,et al. A Note on Set Agreement with Omission Failures , 2003, Electron. Notes Theor. Comput. Sci..

[19] Leslie Lamport,et al. Reaching Agreement in the Presence of Faults , 1980, JACM.