iCluster: A Self-organizing Overlay Network for P2P Information Retrieval

We present iCluster, a self-organizing peer-to-peer overlay network for supporting full-fledged information retrieval in a dynamic environment. iCluster works by organizing peers sharing common interests into clusters and by exploiting clustering information at query time for achieving low network traffic and high recall. We define the criteria for peer similarity and peer selection, and we present the protocols for organizing the peers into clusters and for searching within the clustered organization of peers. iCluster is evaluated on a realistic peer-to-peer environment using real-world data and queries. The results demonstrate significant performance improvements (in terms of clustering efficiency, communication load and retrieval accuracy) over a state-of-the-art peer-to-peer clustering method. Compared to exhaustive search by flooding, iCluster exchanged a small loss in retrieval accuracy for much less message flow.

[1]  Wolfgang Nejdl,et al.  HyperCuP - Hypercubes, Ontologies, and Efficient Search on Peer-to-Peer Networks , 2002, AP2PC.

[2]  Geoff Coulson,et al.  Free Riding on Gnutella Revisited: The Bell Tolls? , 2005, IEEE Distributed Syst. Online.

[3]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[4]  Jie Lu,et al.  Content-based retrieval in hybrid peer-to-peer networks , 2003, CIKM '03.

[5]  Mario T. Schlosser,et al.  HyperCuP – Hypercubes , Ontologies and Efficient Search on P 2 P Networks , 2002 .

[6]  Joemon M. Jose,et al.  An architecture for information retrieval over semi-collaborating Peer-to-Peer networks , 2004, SAC '04.

[7]  Tore Risch,et al.  EDUTELLA: a P2P networking infrastructure based on RDF , 2002, WWW.

[8]  Steffen Staab,et al.  Socialisation in Peer-to-Peer Knowledge Management , 2004 .

[9]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[10]  Hector Garcia-Molina,et al.  Semantic Overlay Networks for P2P Systems , 2004, AP2PC.

[11]  Anand Sivasubramaniam,et al.  Semantic small world: an overlay network for peer-to-peer search , 2004, Proceedings of the 12th IEEE International Conference on Network Protocols, 2004. ICNP 2004..

[12]  CoulsonGeoff,et al.  Free Riding on Gnutella Revisited , 2005 .

[13]  David R. Karger,et al.  Chord: a scalable peer-to-peer lookup protocol for internet applications , 2003, TNET.

[14]  David K. Y. Yau,et al.  Small-world overlay P2P networks: Construction, management and handling of dynamic flash crowds , 2006, Comput. Networks.

[15]  Bruce M. Maggs,et al.  Efficient content location using interest-based locality in peer-to-peer systems , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[16]  Florian Schintke,et al.  Peer-to-Peer Computing , 2010, Euro-Par.

[17]  Hector Garcia-Molina,et al.  Routing indices for peer-to-peer systems , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[18]  Klaus-Dieter Althoff,et al.  Professional Knowledge Management, Third Biennial Conference, WM 2005, Kaiserslautern, Germany, April 10-13, 2005, Revised Selected Papers , 2005, Wissensmanagement.

[19]  Christoph Schmitz Self-Organization of a Small World by Topic , 2004, LWA.

[20]  George Karypis,et al.  A Comparison of Document Clustering Techniques , 2000 .

[21]  Gerhard Weikum,et al.  p2pDating: Real life inspired semantic overlay networks for Web search , 2007, Inf. Process. Manag..

[22]  Sonia Bergamaschi,et al.  Agents and Peer-to-Peer Computing - 5th International Workshop, AP2PC 2006, Hakodate, Japan, May 9, 2006, Revised and Invited Papers , 2008, AP2PC.

[23]  Christoph Tempich,et al.  On Ranking Peers in Semantic Overlay Networks , 2005, Wissensmanagement.