Peer counting and sampling in overlay networks: random walk methods

In this article we address the problem of counting the number of peers in a peer-to-peer system, and more generally of aggregating statistics of individual peers over the whole system. This functionality is useful in many applications, but hard to achieve when each node has only a limited, local knowledge of the whole system. We propose two generic techniques to solve this problem. The Random Tour method is based on the return time of a continuous time random walk to the node originating the query. The Sample and Collide method is based on counting the number of random samples gathered until a target number of redundant samples are obtained. It is inspired by the "birthday paradox" technique of [6], upon which it improves by achieving a target variance with fewer samples. The latter method relies on a sampling sub-routine which returns randomly chosen peers. Such a sampling algorithm is of independent interest. It can be used, for instance, for neighbour selection by new nodes joining the system. We use a continuous time random walk to obtain such samples. We analyse the complexity and accuracy of the two methods. We illustrate in particular how expansion properties of the overlay affect their performance.

[1]  Thomas L. Saaty,et al.  Elements of queueing theory , 2003 .

[2]  N. S. Barnett,et al.  Private communication , 1969 .

[3]  Upendra Dave,et al.  Applied Probability and Queues , 1987 .

[4]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[5]  Noga Alon,et al.  The Probabilistic Method , 2015, Fundamentals of Ramsey Theory.

[6]  T. Lindvall Lectures on the Coupling Method , 1992 .

[7]  J. Bolot,et al.  Scalable feedback control for multicast video distribution in the Internet , 1994, SIGCOMM.

[8]  B. Mohar Some applications of Laplace eigenvalues of graphs , 1997 .

[9]  Ernst W. Biersack,et al.  Optimal multicast feedback , 1998, Proceedings. IEEE INFOCOM '98, the Conference on Computer Communications. Seventeenth Annual Joint Conference of the IEEE Computer and Communications Societies. Gateway to the 21st Century (Cat. No.98.

[10]  Donald F. Towsley,et al.  Multicast session membership size estimation , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[11]  Albert-László Barabási,et al.  Statistical mechanics of complex networks , 2001, ArXiv.

[12]  Moni Naor,et al.  Viceroy: a scalable and dynamic emulation of the butterfly , 2002, PODC '02.

[13]  Eitan Altman,et al.  Optimal on-line estimation of the size of a dynamic multicast group , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[14]  Mike Preufl On Obtaining Global Information in a Peer-to-Peer Fully Distributed Environment ??? , 2002 .

[15]  Mike Preuss,et al.  On Obtaining Global Information in a Peer-to-Peer Fully Distributed Environment (Research Note) , 2002, Euro-Par.

[16]  Miguel Castro,et al.  Secure routing for structured peer-to-peer overlay networks , 2002, OSDI '02.

[17]  Anne-Marie Kermarrec,et al.  Peer-to-Peer Membership Management for Gossip-Based Protocols , 2003, IEEE Trans. Computers.

[18]  Dahlia Malkhi,et al.  Estimating network size from local information , 2003, Information Processing Letters.

[19]  Gurmeet Singh Manku,et al.  Routing networks for distributed hash tables , 2003, PODC '03.

[20]  Anne-Marie Kermarrec,et al.  Lightweight probabilistic broadcast , 2003, TOCS.

[21]  M,et al.  Estimating Aggregates on a Peer-to-Peer Network , 2003 .

[22]  Indranil Gupta Practical Algorithms for Size Estimation in Large and Dynamic Groups , 2004 .

[23]  Miguel Castro,et al.  Security for Structured Peer-to-peer Overlay Networks , 2004 .

[24]  A Robust Aggregation Tree on Distributed Hash Tables , 2004 .

[25]  Márk Jelasity,et al.  Epidemic-style proactive aggregation in large overlay networks , 2004, 24th International Conference on Distributed Computing Systems, 2004. Proceedings..

[26]  Donald F. Towsley,et al.  The effect of network topology on the spread of epidemics , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[27]  Indranil Gupta,et al.  Decentralized Schemes for Size Estimation in Large and Dynamic Groups , 2005, Fourth IEEE International Symposium on Network Computing and Applications.

[28]  Stephen P. Boyd,et al.  Gossip algorithms: design, analysis and applications , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[29]  Anne-Marie Kermarrec,et al.  Efficient application-level multicast on a network-aware self-organizing overlay , 2005 .

[30]  Anne-Marie Kermarrec,et al.  Peer to peer size estimation in large and dynamic networks: A comparative study , 2006, 2006 15th IEEE International Conference on High Performance Distributed Computing.

[31]  N. Linial,et al.  Expander Graphs and their Applications , 2006 .

[32]  D. Dolev,et al.  On multicast trees: structure and size estimation , 2006, IEEE/ACM Transactions on Networking.

[33]  Anne-Marie Kermarrec,et al.  Gossip-based peer sampling , 2007, TOCS.