Connecting P2P to the web: Lessons from a prototype Gnutella-WWW gateway

Purpose – The purpose of this paper is to describe a means to improve the accessibility of files across different delivery platforms, making it possible to use a single search modality. The paper shows that both peer‐to‐peer file sharing networks and the worldwide web provide extensive information resources, and either network may contain data that satisfy a searcher's information need.Design/methodology/approach – The paper proposes a gateway between the worldwide web and peer‐to‐peer networks that permits searchers on one side to be able to seamlessly search and retrieve files on the other side of the gateway. The design and prototype implementation of such a gateway to Gnutella is detailed, along with access statistics from test deployments and lessons learned.Findings – The prototype implementation was found to demonstrate the feasibility of a seamless gateway between the Gnutella network and the worldwide web. Gnutella users saw millions of web search results and initiated retrievals via the gateway ...

[1]  Daniel Stutzbach,et al.  Characterizing the two-tier gnutella topology , 2005, SIGMETRICS '05.

[2]  David Mazières,et al.  Kademlia: A Peer-to-Peer Information System Based on the XOR Metric , 2002, IPTPS.

[3]  Geoff Coulson,et al.  Free Riding on Gnutella Revisited: The Bell Tolls? , 2005, IEEE Distributed Syst. Online.

[4]  Ion Stoica,et al.  The Case for a Hybrid P2P Search Infrastructure , 2004, IPTPS.

[5]  Ian T. Foster,et al.  Mapping the Gnutella Network: Properties of Large-Scale Peer-to-Peer Systems and Implications for System Design , 2002, ArXiv.

[6]  Daniel Stutzbach,et al.  Capturing Accurate Snapshots of the Gnutella Network , 2006, INFOCOM.

[7]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[8]  Jia Wang,et al.  Analyzing peer-to-peer traffic across large networks , 2004, IEEE/ACM Trans. Netw..

[9]  Yong Yu,et al.  A Statistical Study of Today's Gnutella , 2006, APWeb.

[10]  David R. Karger,et al.  Chord: a scalable peer-to-peer lookup protocol for internet applications , 2003, TNET.

[11]  Eytan Adar,et al.  Free Riding on Gnutella , 2000, First Monday.

[12]  Adele E. Howe,et al.  SAVVYSEARCH: A Metasearch Engine That Learns Which Search Engines to Query , 1997, AI Mag..

[13]  Chi-Hang Chan,et al.  Bridging the P2P and WWW Divide with DISCOVIR - DIStributed COntent-based Visual Information Retrieval , 2003, WWW.

[14]  Ian T. Foster,et al.  Mapping the Gnutella Network , 2002, IEEE Internet Comput..

[15]  Daniel Stutzbach,et al.  Characterizing files in the modern Gnutella network , 2007, Multimedia Systems.

[16]  Krishna P. Gummadi,et al.  Measuring and analyzing the characteristics of Napster and Gnutella hosts , 2003, Multimedia Systems.

[17]  Ian T. Foster,et al.  Mapping the Gnutella Network: Macroscopic Properties of Large-Scale Peer-to-Peer Systems , 2002, IPTPS.

[18]  Mary K. Vernon,et al.  Characterizing the query behavior in peer-to-peer file sharing systems , 2004, IMC '04.

[19]  Jacky C. Chu,et al.  Availability and locality measurements of peer-to-peer file systems , 2002, SPIE ITCom.

[20]  Yong Wang,et al.  Analyzing the Characteristics of Gnutella Overlays , 2007, Fourth International Conference on Information Technology (ITNG'07).

[21]  Oren Etzioni,et al.  Multi-Engine Search and Comparison Using the MetaCrawler , 1995, World Wide Web J..

[22]  Ophir Frieder,et al.  An analysis of peer-to-peer file-sharing system queries , 2007, SIGIR.

[23]  Dragos Ilie,et al.  Gnutella Network Traffic : Measurements and Characteristics , 2006 .

[24]  Scott Shenker,et al.  Enhancing P2P File-Sharing with an Internet-Scale Query Processor , 2004, VLDB.

[25]  Arne A. Nilsson,et al.  Measurement and Analysis of Gnutella Signaling Traffic , 2004 .

[26]  Stefan Saroiu,et al.  A Measurement Study of Peer-to-Peer File Sharing Systems , 2001 .