Towards a Better Understanding of Churn in Peer-to-Peer Networks

The dynamics of peer participation, or churn, are an inherent property of Peer-to-Peer (P2P) systems that should be incorporated in both the design and evaluation of P2P systems. This requires a proper characterization of churn in real-world P2P networks. However, the few previous measurement-based studies on the characterization of P2P systems have used either unrepresentative group of peers or coarse-grain mea- surements. In this extended abstract, we characterize churn in the Gnutella network based on fine-grained monitoring of the entire population. We developed a new crawler that can capture a complete snapshot of Gnutella network within a few minutes. This not only improves the accuracy, by reducing distortion in captured snap- shots, but it also increase the granularity of captured dy- namics. We present our preliminary characterizations of peer uptime and discuss their implications. In par- ticular, we show that peer uptime follows a power-law distribution rather than the commonly assumed Poisson distribution. In a nutshell, a large portion of up peers are highly stable, yet the remaining peers turnover very quickly. Keywords—Peer-to-Peer, Churn, Characterization, Gnutella

[1]  Thomas E. Anderson,et al.  A Comparison of File System Workloads , 2000, USENIX Annual Technical Conference, General Track.

[2]  John Kubiatowicz,et al.  Handling churn in a DHT , 2004 .

[3]  Jacky C. Chu,et al.  Availability and locality measurements of peer-to-peer file systems , 2002, SPIE ITCom.

[4]  Daniel Stutzbach,et al.  Evaluating the Accuracy of Captured Snapshots by Peer-to-Peer Crawlers , 2005, PAM.

[5]  Robert Tappan Morris,et al.  Comparing the Performance of Distributed Hash Tables Under Churn , 2004, IPTPS.

[6]  Krishna P. Gummadi,et al.  Measuring and analyzing the characteristics of Napster and Gnutella hosts , 2003, Multimedia Systems.

[7]  David R. Karger,et al.  Analysis of the evolution of peer-to-peer systems , 2002, PODC '02.

[8]  Ian T. Foster,et al.  Mapping the Gnutella Network: Properties of Large-Scale Peer-to-Peer Systems and Implications for System Design , 2002, ArXiv.

[9]  Jia Wang,et al.  Analyzing peer-to-peer traffic across large networks , 2004, IEEE/ACM Trans. Netw..

[10]  Miguel Castro,et al.  Controlling the Cost of Reliability in Peer-to-Peer Overlays , 2003, IPTPS.

[11]  Krishna P. Gummadi,et al.  Measurement, modeling, and analysis of a peer-to-peer file-sharing workload , 2003, SOSP '03.

[12]  Jia Wang,et al.  Analyzing peer-to-peer traffic across large networks , 2002, IMW '02.

[13]  Scott Shenker,et al.  Making gnutella-like P2P systems scalable , 2003, SIGCOMM '03.

[14]  Stefan Savage,et al.  Understanding Availability , 2003, IPTPS.