论文信息 - Efficient Gossip and Robust Distributed Computation

Efficient Gossip and Robust Distributed Computation

This paper presents an efficient deterministic gossip algorithm for p synchronous, crash-prone, message-passing processors. The algorithm has time complexity T = O(log2 p) and message complexity M=O(p 1 + e), for any e>0. This substantially improves the message complexity of the previous best algorithm that has M=O(p 1.77), while maintaining the same time complexity. The strength of the new algorithm is demonstrated by constructing a deterministic algorithm for performing n tasks in this distributed setting. Previous solutions used coordinator or check-pointing approaches, immediately incurring a work penalty Ω(n + f.p) for f crashes, or relied on strong communication primitives, such as reliable broadcast, or had work too close to the trivial Θ(p.n) bound of oblivious algorithms.The new algorithm uses p crash-prone processors to perform n similar and idempotent tasks so long as one processor remains active. The work of the algorithm is W = O(n + p.min{f + 1,log 3 p}) and its message complexity is M = O(fp e + pmin{f + 1, logp}), for any e>0. This substantially improves the work complexity of previous solutions using simple point-to-point messaging, while “meeting or beating” the corresponding message complexity bounds. The new algorithms use communication graphs and permutations with certain combinatorial properties that are shown to exist. The algorithms are correct for any permutations, and in particular, the same expected bounds can be achieved using random permutations.

Chryssis Georgiou | Dariusz R. Kowalski | Alexander A. Shvartsman

[1] Moti Yung,et al. Resolving message complexity of Byzantine Agreement and beyond , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[2] Dariusz R. Kowalski,et al. Gossiping to reach consensus , 2002, SPAA '02.

[3] Andrzej Pelc. Fault‐tolerant broadcasting and gossiping in communication networks , 1996 .

[4] Alexander Russell,et al. Distributed Cooperation During the Absence of Communication , 2000, DISC.

[5] Dariusz R. Kowalski,et al. Bounding Work and Communication in Robust Cooperative Computation , 2002, DISC.

[6] Joseph Y. Halpern,et al. Performing Work Efficiently in the Presence of Faults , 1998, SIAM J. Comput..

[7] Alexander Russell,et al. The Complexity of Synchronous Iterative Do-All with Crashes , 2001, DISC.

[8] Bogdan S. Chlebus,et al. Performing tasks on synchronous restartable message-passing processors , 2001, Distributed Computing.

[9] Richard J. Anderson,et al. Algorithms for the Certified Write-All Problem , 1997, SIAM J. Comput..

[10] Alexander A. Shvartsman,et al. Efficient parallel algorithms can be made robust , 1989, PODC '89.

[11] Noga Alon,et al. The Probabilistic Method , 2015, Fundamentals of Ramsey Theory.

[12] Moti Yung,et al. Time-optimal message-efficient work performance in the presence of faults , 1994, PODC '94.