A measure for cluster cohesion in semantic overlay networks

Semantic overlay networks cluster peers that are semantically, thematically or socially close into groups by means of a rewiring procedure that is periodically executed by each peer. Rewiring proceeds by establishing new connections to similar peers, and by discarding connections that are outdated or pointing to dissimilar peers. This process aims at improving cluster quality (how well peers with similar content are clustered together) and by this, at improving the flow of information in the network by reducing the number of messages that are exchanged. Therefore, measuring the quality of clustering is an important issue by itself. This is exactly the issue this work is dealing with. In this paper, we introduce a new clustering measure that takes into account the whole neighborhood of a peer (rather than its direct neighbors) thus, providing better insight on the quality of the underlying clustered organisation. Our experimental evaluation with real-word data and queries confirms our assumption that the new measure is better suited for measuring clustering quality than other known measures, such as the (generalised) clustering coefficient.

[1]  Hassan Charaf,et al.  Analytical model for semantic overlay networks in peer-to-peer systems , 2006, ICSE 2006.

[2]  David D. Jensen,et al.  Graph clustering with network structure indices , 2007, ICML '07.

[3]  Evaggelia Pitoura,et al.  Recall-based cluster reformulation by selfish peers , 2008, 2008 IEEE 24th International Conference on Data Engineering Workshop.

[4]  Chi-Hang Chan,et al.  Advanced Peer Clustering and Firework Query Model in the Peer-to-Peer Network , 2003, WWW.

[5]  Euripides G. M. Petrakis,et al.  Information Retrieval and Filtering over Self-organising Digital Libraries , 2008, ECDL.

[6]  Jie Lu,et al.  Content-based retrieval in hybrid peer-to-peer networks , 2003, CIKM '03.

[7]  Ulrik Brandes,et al.  Engineering graph clustering: Models and experimental evaluation , 2008, JEAL.

[8]  Christoph Schmitz Self-Organization of a Small World by Topic , 2004, LWA.

[9]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[10]  Anand Sivasubramaniam,et al.  Semantic small world: an overlay network for peer-to-peer search , 2004, Proceedings of the 12th IEEE International Conference on Network Protocols, 2004. ICNP 2004..

[11]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[12]  Christoph Tempich,et al.  On Ranking Peers in Semantic Overlay Networks , 2005, Wissensmanagement.

[13]  Jie Wu,et al.  Small Worlds: The Dynamics of Networks between Order and Randomness , 2003 .

[14]  Christos Doulkeridis,et al.  Scalable Semantic Overlay Generation for P2P-Based Digital Libraries , 2006, ECDL.

[15]  Peter Triantafillou,et al.  Towards High Performance Peer-to-Peer Content and Resource Sharing Systems , 2003, CIDR.

[16]  Felix Naumann,et al.  Semantic Overlay Clusters within Super-Peer Networks , 2003, DBISP2P.

[17]  David K. Y. Yau,et al.  Small-world overlay P2P networks: Construction, management and handling of dynamic flash crowds , 2006, Comput. Networks.

[18]  Bruce M. Maggs,et al.  Efficient content location using interest-based locality in peer-to-peer systems , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[19]  A. Fronczak,et al.  Higher order clustering coefficients in Barabási–Albert networks , 2002, cond-mat/0212237.

[20]  Gerhard Weikum,et al.  p2pDating: Real life inspired semantic overlay networks for Web search , 2007, Inf. Process. Manag..

[21]  Alex Hansen,et al.  A quantitative measure for path structures of complex networks , 2007 .

[22]  Jim Dowling,et al.  Discovery of Stable Peers in a Self-organising Peer-to-Peer Gradient Topology , 2006, DAIS.

[23]  Pavel Zezula,et al.  Adaptive Approximate Similarity Searching through Metric Social Networks , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[24]  Hector Garcia-Molina,et al.  Efficient search in peer to peer networks , 2004 .

[25]  W. Bruce Croft,et al.  Cluster-based language models for distributed retrieval , 1999, SIGIR '99.

[26]  Sandhya Dwarkadas,et al.  Peer-to-peer information retrieval using self-organizing semantic overlay networks , 2003, SIGCOMM '03.

[27]  Euripides G. M. Petrakis,et al.  iCluster: A Self-organizing Overlay Network for P2P Information Retrieval , 2008, ECIR.

[28]  Tore Risch,et al.  EDUTELLA: a P2P networking infrastructure based on RDF , 2002, WWW.

[29]  Joemon M. Jose,et al.  An architecture for information retrieval over semi-collaborating Peer-to-Peer networks , 2004, SAC '04.