Node isolation model and age-based neighbor selection in unstructured P2P networks

Previous analytical studies of unstructured P2P resilience have assumed exponential user lifetimes and only con-sidered age-independent neighbor replacement. In this paper, we overcome these limitations by introducing a general node-isolation model for heavy-tailed user lifetimes and arbitrary neighbor-se-lection algorithms. Using this model, we analyze two age-biased neighbor-selection strategies and show that they significantly improve the residual lifetimes of chosen users, which dramatically reduces the probability of user isolation and graph partitioning compared with uniform selection of neighbors. In fact, the second strategy based on random walks on age-proportional graphs demonstrates that, for lifetimes with infinite variance, the system monotonically increases its resilience as its age and size grow. Specifically, we show that the probability of isolation converges to zero as these two metrics tend to infinity. We finish the paper with simulations in finite-size graphs that demonstrate the effect of this result in practice.

[1]  Rajmohan Rajaraman,et al.  Accessing Nearby Copies of Replicated Objects in a Distributed Environment , 1997, SPAA '97.

[2]  Fabián E. Bustamante,et al.  Friendships that Last: Peer Lifespan and its Role in P2P Protocols , 2003, WCW.

[3]  Robert Morris,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM 2001.

[4]  Dmitri Loguinov,et al.  Modeling Heterogeneous User Churn and Local Resilience of Unstructured P2P Networks , 2006, Proceedings of the 2006 IEEE International Conference on Network Protocols.

[5]  Harold Exton,et al.  Handbook of Hypergeometric Integrals , 1979 .

[6]  Eli Upfal,et al.  Building low-diameter peer-to-peer networks , 2003, IEEE J. Sel. Areas Commun..

[7]  Miguel Castro,et al.  Performance and dependability of structured peer-to-peer overlays , 2004, International Conference on Dependable Systems and Networks, 2004.

[8]  Ronald W. Wolff,et al.  Stochastic Modeling and the Theory of Queues , 1989 .

[9]  Anne-Marie Kermarrec,et al.  Network awareness and failure resilience in self-organizing overlay networks , 2003, 22nd International Symposium on Reliable Distributed Systems, 2003. Proceedings..

[10]  Dmitri Loguinov,et al.  Residual-Based Measurement of Peer and Link Lifetimes in Gnutella Networks , 2007, IEEE INFOCOM 2007 - 26th IEEE International Conference on Computer Communications.

[11]  Dmitri Loguinov,et al.  On lifetime-based node failure and stochastic resilience of decentralized peer-to-peer networks , 2007, TNET.

[12]  Krishna P. Gummadi,et al.  The impact of DHT routing geometry on resilience and proximity , 2003, SIGCOMM '03.

[13]  Harold Exton,et al.  Handbook of Hypergeometric Integrals: Theory, Applications, Tables, Computer Programs , 1978 .

[14]  Dan Rubenstein,et al.  Can unstructured P2P protocols survive flash crowds? , 2005, IEEE/ACM Transactions on Networking.

[15]  Carl D. Meyer,et al.  Matrix Analysis and Applied Linear Algebra , 2000 .

[16]  Simon S. Lam,et al.  Failure recovery for structured P2P networks: protocol design and performance evaluation , 2004, SIGMETRICS '04/Performance '04.

[17]  David R. Karger,et al.  Koorde: A Simple Degree-Optimal Distributed Hash Table , 2003, IPTPS.

[18]  Walter Willinger,et al.  Proof of a fundamental result in self-similar traffic modeling , 1997, CCRV.

[19]  Ramesh Govindan,et al.  Incrementally improving lookup latency in distributed hash table systems , 2003, SIGMETRICS '03.

[20]  John Kubiatowicz,et al.  Handling churn in a DHT , 2004 .

[21]  Dmitri Loguinov,et al.  On Lifetime-Based Node Failure and Stochastic Resilience of Decentralized Peer-to-Peer Networks , 2005, IEEE/ACM Transactions on Networking.

[22]  S. Resnick Adventures in stochastic processes , 1992 .

[23]  Ben Y. Zhao,et al.  Impact of Neighbor Selection on Performance and Resilience of Structured P2P Networks , 2005, IPTPS.

[24]  Bruce M. Maggs,et al.  The feasibility of supporting large-scale live streaming applications with dynamic application end-points , 2004, SIGCOMM 2004.

[25]  Scott Shenker,et al.  Making gnutella-like P2P systems scalable , 2003, SIGCOMM '03.

[26]  Scott Shenker,et al.  Minimizing churn in distributed systems , 2006, SIGCOMM.

[27]  Mudhakar Srivatsa,et al.  Large Scaling Unstructured Peer-to-Peer Networks with Heterogeneity-Aware Topology and Routing , 2006, IEEE Transactions on Parallel and Distributed Systems.

[28]  Mark Handley,et al.  Topologically-aware overlay construction and server selection , 2002, Proceedings.Twenty-First Annual Joint Conference of the IEEE Computer and Communications Societies.

[29]  Anja Feldmann,et al.  Fitting Mixtures of Exponentials to Long-Tail Distributions to Analyze Network , 1998, Perform. Evaluation.

[30]  Robert Tappan Morris,et al.  Bandwidth-efficient management of DHT routing tables , 2005, NSDI.

[31]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[32]  Masaaki Kijima,et al.  Markov processes for stochastic modeling , 1997 .

[33]  Ming Zhong,et al.  Non-uniform random membership management in peer-to-peer networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[34]  Robert Tappan Morris,et al.  Comparing the Performance of Distributed Hash Tables Under Churn , 2004, IPTPS.

[35]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM 2001.

[36]  Christos Gkantsidis,et al.  Random walks in peer-to-peer networks , 2004, IEEE INFOCOM 2004.

[37]  David R. Karger,et al.  Analysis of the evolution of peer-to-peer systems , 2002, PODC '02.

[38]  Thomas P. Hettmansperger,et al.  Tailweight, Statistical Inference and Families of Distributions — A Brief Survey , 1975 .

[39]  David Mazières,et al.  Kademlia: A Peer-to-Peer Information System Based on the XOR Metric , 2002, IPTPS.

[40]  Paul Francis,et al.  On Heterogeneous Overlay Construction and Random Node Selection in Unstructured P2P Networks , 2006, Proceedings IEEE INFOCOM 2006. 25TH IEEE International Conference on Computer Communications.

[41]  Ramesh Govindan,et al.  Improving lookup latency in distributed hash table systems using random sampling , 2005, IEEE/ACM Transactions on Networking.

[42]  Dmitri Loguinov,et al.  On static and dynamic partitioning behavior of large-scale networks , 2005, 13TH IEEE International Conference on Network Protocols (ICNP'05).

[43]  Ben Y. Zhao,et al.  Tapestry: a resilient global-scale overlay for service deployment , 2004, IEEE Journal on Selected Areas in Communications.

[44]  Paul Francis,et al.  Chunkyspread: Multi-tree Unstructured Peer-to-Peer Multicast , 2006, IPTPS.

[45]  Seif Haridi,et al.  A Statistical Theory of Chord Under Churn , 2005, IPTPS.

[46]  William Feller,et al.  An Introduction to Probability Theory and Its Applications , 1951 .

[47]  László Lovász,et al.  Random Walks on Graphs: A Survey , 1993 .

[48]  Bernhard Schölkopf,et al.  Learning from labeled and unlabeled data on a directed graph , 2005, ICML.