Availability and Popularity Measurements of Peer-to-Peer File Systems

Although peer-to-peer networking applications continue t o increase in popularity, there have been few measurement studi e of their performance. We present the first study of the popularity of files stored and transferred among peers in Napste r and Gnutella over month-long periods. Our analysis indicat es that the popularity of files is skewed in all four cases and fits well to a log-quadratic distribution. This predicts that ca ches of the most popular songs would increase performance of the sys tem. We also took baseline measurements of file types and size s for comparison over time with future studies. Not surprisin gly, audio files are most popular, however a significant fraction o f stored data is occupied by videos. Finally, we measured the distribution of time peers in Gnutella were available for do wnloading. We found that node availability is strongly influen c d by time-of-day effects, and that most user’s tend to be avail ble for only very short contiguous lengths of time.

[1]  Yakov Rekhter,et al.  Address Allocation for Private Internets , 1994, RFC.

[2]  Li Fan,et al.  Web caching and Zipf-like distributions: evidence and implications , 1999, IEEE INFOCOM '99. Conference on Computer Communications. Proceedings. Eighteenth Annual Joint Conference of the IEEE Computer and Communications Societies. The Future is Now (Cat. No.99CH36320).

[3]  J. Frankel,et al.  The gnutella protocol specification v0.4 document revision 1.2 , 2000 .

[4]  Eytan Adar,et al.  Free Riding on Gnutella , 2000, First Monday.

[5]  Brian Neil Levine,et al.  Evaluation of a novel two-step server selection metric , 2001, Proceedings Ninth International Conference on Network Protocols. ICNP 2001.

[6]  Stefan Saroiu,et al.  A Measurement Study of Peer-to-Peer File Sharing Systems , 2001 .

[7]  Jacky C. Chu,et al.  Availability and locality measurements of peer-to-peer file systems , 2002, SPIE ITCom.

[8]  Evangelos P. Markatos,et al.  Tracing a Large-Scale Peer to Peer System: An Hour in the Life of Gnutella , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[9]  Ian T. Foster,et al.  Mapping the Gnutella Network: Properties of Large-Scale Peer-to-Peer Systems and Implications for System Design , 2002, ArXiv.

[10]  Nathaniel Leibowitz,et al.  ARE FILE SWAPPING NETWORKS CACHEABLE? CHARACTERIZING P2P TRAFFIC , 2002 .

[11]  Donald F. Towsley,et al.  Modeling peer-peer file sharing systems , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).