Peer-to-Peer Clustering for Semantic Overlay Network Generation

The peer-to-peer (P2P) paradigm presents an attractive solution for applications that require scalability, fault-tolerance and autonomy. P2P systems in their basic unstructured form suffer high costs when it comes to efficiently locating content, mainly due to the lack of global knowledge. It is therefore crucial to organize content in an unsupervised way by creating groups of peers with similar content, in order to support efficient search mechanisms. In this paper, we discuss the need for content organization in unstructured P2P networks and present the requirements that must be fulfilled by any approach. We propose P2P clustering as a potential solution to Semantic Overlay Network (SON) generation for organizing P2P networks, and we present our unsupervised approach for decentralized SON creation towards this end.

[1]  Hector Garcia-Molina,et al.  Semantic Overlay Networks for P2P Systems , 2004, AP2PC.

[2]  Xueqi Cheng,et al.  WonGoo: a pure peer-to-peer full text information retrieval system based on semantic overlay networks , 2004, Third IEEE International Symposium on Network Computing and Applications, 2004. (NCA 2004). Proceedings..

[3]  Ben Y. Zhao,et al.  Tapestry: a resilient global-scale overlay for service deployment , 2004, IEEE Journal on Selected Areas in Communications.

[4]  Scott Shenker,et al.  Making gnutella-like P2P systems scalable , 2003, SIGCOMM '03.

[5]  Ian Clarke,et al.  Freenet: A Distributed Anonymous Information Storage and Retrieval System , 2000, Workshop on Design Issues in Anonymity and Unobservability.

[6]  Lakshmish Ramaswamy,et al.  Connectivity based node clustering in decentralized peer-to-peer networks , 2003, Proceedings Third International Conference on Peer-to-Peer Computing (P2P2003).

[7]  Pascal Felber,et al.  Efficient search in unstructured peer-to-peer networks , 2004, SPAA '04.

[8]  Edith Cohen,et al.  Associative search in peer to peer networks: harnessing latent semantics , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[9]  Wolfgang Müller,et al.  Classifying Documents by Distributed P2P Clustering , 2003, GI Jahrestagung.

[10]  Robert Morris,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM 2001.

[11]  Linpeng Huang,et al.  Distributed Information Retrieval Based on Hierarchical Semantic Overlay Network , 2004, GCC.

[12]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[13]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM 2001.

[14]  Christos Gkantsidis,et al.  Hybrid search schemes for unstructured peer-to-peer networks , 2005, Proceedings IEEE 24th Annual Joint Conference of the IEEE Computer and Communications Societies..

[15]  Jun Wang,et al.  A category overlay infrastructure for peer-to-peer content search , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[16]  Gerhard Weikum,et al.  p2pDating: Real life inspired semantic overlay networks for Web search , 2007, Inf. Process. Manag..

[17]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[18]  Bei Yu,et al.  Efficient semantic-based content search in P2P network , 2004, IEEE Transactions on Knowledge and Data Engineering.

[19]  Hector Garcia-Molina,et al.  Routing indices for peer-to-peer systems , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[20]  Christos Doulkeridis,et al.  DESENT: decentralized and distributed semantic overlay generation in P2P networks , 2007, IEEE Journal on Selected Areas in Communications.

[21]  Richard P. Martin,et al.  PlanetP: using gossiping to build content addressable peer-to-peer information sharing communities , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[22]  Karl Aberer,et al.  GridVine: Building Internet-Scale Semantic Overlay Networks , 2004, SEMWEB.

[23]  Mario A. Nascimento,et al.  Taxonomy-Based Routing Indices for Peer-to-Peer Networks , 2004, Workshop on Peer-to-Peer Information Retrieval.

[24]  Hector Garcia-Molina,et al.  Improving search in peer-to-peer networks , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[25]  Sandhya Dwarkadas,et al.  Peer-to-peer information retrieval using self-organizing semantic overlay networks , 2003, SIGCOMM '03.

[26]  Evangelos P. Markatos,et al.  Tracing a Large-Scale Peer to Peer System: An Hour in the Life of Gnutella , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[27]  Karl Aberer,et al.  P-Grid: a self-organizing structured P2P system , 2003, SGMD.

[28]  Peter Triantafillou,et al.  Towards High Performance Peer-to-Peer Content and Resource Sharing Systems , 2003, CIDR.

[29]  Felix Naumann,et al.  Semantic Overlay Clusters within Super-Peer Networks , 2003, DBISP2P.

[30]  Steffen Staab,et al.  Remindin': semantic query routing in peer-to-peer networks based on social metaphors , 2004, WWW '04.

[31]  Wolfgang Nejdl,et al.  Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks , 2003, WWW '03.

[32]  Peter Druschel,et al.  Pastry: Scalable, distributed object location and routing for large-scale peer-to- , 2001 .

[33]  Alexander Löser,et al.  Taxonomy-based routing overlays in P2P networks , 2004, Proceedings. International Database Engineering and Applications Symposium, 2004. IDEAS '04..

[34]  Eytan Adar,et al.  Free Riding on Gnutella , 2000, First Monday.