Agreement in Epidemic Data Aggregation

Computing and spreading global information in large-scale distributed systems pose significant challenges when scalability, parallelism, resilience and consistency are demanded. Epidemic protocols are a robust and scalable computing and communication paradigm that can be effectively used for information dissemination and data aggregation in a fully decentralised context where each network node requires the local computation of a global synopsis function. Theoretical analysis of epidemic protocols for synchronous and static network models provide guarantees on the convergence to a global target and on the consistency among the network nodes. However, practical applications in real-world networks may require the explicit detection of both local convergence and global agreement (consensus). This work introduces the Epidemic Consensus Protocol (ECP) for the determination of consensus on the convergence of a decentralised data aggregation task. ECP adopts a heuristic method to locally detect convergence of the aggregation task and stochastic phase transitions to detect global agreement and reach consensus. The performance of ECP has been investigated by means of simulations and compared to a tree-based Three-Phase Commit protocol (3PC). Although, as expected, ECP exhibits total communication costs greater than the optimal tree-based protocol, it is shown to have better performance and scalability properties; ECP can achieve faster convergence to consensus for large system sizes and inherits the intrinsic decentralisation, fault-tolerance and robustness properties of epidemic protocols.

[1]  Rachid Guerraoui,et al.  The Generic Consensus Service , 2001, IEEE Trans. Software Eng..

[2]  Rachid Guerraoui,et al.  Consensus in Asynchronous Distributed Systems: A Concise Guided Tour , 1999, Advances in Distributed Systems.

[3]  Richard M. Karp,et al.  Randomized rumor spreading , 2000, Proceedings 41st Annual Symposium on Foundations of Computer Science.

[4]  Jeremy Clark,et al.  SoK: Research Perspectives and Challenges for Bitcoin and Cryptocurrencies , 2015, 2015 IEEE Symposium on Security and Privacy.

[5]  Márk Jelasity,et al.  Spanning Tree or Gossip for Aggregation: A Comparative Study , 2014, Euro-Par.

[6]  Dominique Gaïti,et al.  Decentralized Aggregation Protocols in Peer-to-Peer Networks: A Survey , 2009, MACE.

[7]  Gottfried Vossen,et al.  Transactional Information Systems: Theory, Algorithms, and the Practice of Concurrency Control and Recovery , 2002 .

[8]  Giuseppe Di Fatta,et al.  Agreement in Epidemic Information Dissemination , 2016, IDCS.

[9]  Giancarlo Fortino,et al.  Fault tolerant decentralised K-Means clustering for asynchronous large-scale networks , 2013, J. Parallel Distributed Comput..

[10]  Wei Ren,et al.  Information consensus in multivehicle cooperative control , 2007, IEEE Control Systems.

[11]  Shanika Karunasekera,et al.  Impacts of Asynchrony on Epidemic-Style Aggregation Protocols , 2010, 2010 IEEE 16th International Conference on Parallel and Distributed Systems.

[12]  Yin Chen,et al.  Corrective consensus: Converging to the exact average , 2010, 49th IEEE Conference on Decision and Control (CDC).

[13]  Shanika Karunasekera,et al.  Gossip-based asynchronous and robust aggregation protocol — A pessimistic approach , 2011, 2011 IEEE Consumer Communications and Networking Conference (CCNC).

[14]  Sanjay Ranka,et al.  Aggregation methods for large-scale sensor networks , 2008, TOSN.

[15]  John N. Tsitsiklis,et al.  Convergence Speed in Distributed Consensus and Averaging , 2009, SIAM J. Control. Optim..

[16]  G. Fortino,et al.  Symmetric Push-Sum Protocol for Decentralised Aggregation , 2011, AP2PS 2011.

[17]  Giuseppe Di Fatta,et al.  Convergence Detection in Epidemic Aggregation , 2013, Euro-Par Workshops.

[18]  Gianluca Iannaccone,et al.  Has internet delay gotten better or worse? , 2010, CFI.

[19]  Márk Jelasity,et al.  Gossip-based aggregation in large dynamic networks , 2005, TOCS.

[20]  Márk Jelasity,et al.  Epidemic-style proactive aggregation in large overlay networks , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[21]  Paulo Sérgio Almeida,et al.  Dependability in Aggregation by Averaging , 2010, ArXiv.

[22]  Jun Ye Yu,et al.  Performance comparison of randomized gossip, broadcast gossip and collection tree protocol for distributed averaging , 2013, 2013 5th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[23]  Márk Jelasity,et al.  PeerSim: A scalable P2P simulator , 2009, 2009 IEEE Ninth International Conference on Peer-to-Peer Computing.

[24]  Giuseppe Di Fatta,et al.  Epidemic failure detection and consensus for extreme parallelism , 2018, Int. J. High Perform. Comput. Appl..

[25]  Johannes Gehrke,et al.  Gossip-based computation of aggregate information , 2003, 44th Annual IEEE Symposium on Foundations of Computer Science, 2003. Proceedings..

[26]  Thomas Naughton,et al.  A Log-Scaling Fault Tolerant Agreement Algorithm for a Fault Tolerant MPI , 2011, EuroMPI.

[27]  Nancy A. Lynch,et al.  Impossibility of distributed consensus with one faulty process , 1985, JACM.

[28]  Sam Toueg,et al.  Unreliable failure detectors for reliable distributed systems , 1996, JACM.

[29]  Nancy A. Lynch,et al.  Consensus in the presence of partial synchrony , 1988, JACM.