An ant-based approach to cluster peers in P2P database systems

Peer-to-Peer (P2P) systems with distinct advantages such as scalability, robustness and low cost have developed quickly in recent years. The resource search problem, that is to find out a peer having a desired resource, is a fundamental problem to be initially solved in most P2P applications. P2P systems can also be categorized under database systems. One of the most important problems of P2P database systems is the query processing algorithms. Even a rather small improvement over these algorithms can have a significant impact on the performance of a P2P database system. One of the most effective methods to reduce query processing cost is peer clustering. Hence, clustering peers with similar contents makes the queries directed to the appropriate cluster and thus to be efficiently answered. In this paper, we present a new peer clustering algorithm for P2P database systems that uses ant-based clustering algorithms. We have adjusted ant-based clustering algorithm to peer clustering problem. Performance evaluation of the algorithm shows its strength in clustering peers in P2P database systems to be of a higher quality and accuracy compared to the existing systems, to have a relatively short running time and to quickly converge to an optimum. Moreover, our algorithm achieves a high scalability to the increasing problem size.

[1]  Mehdi Goli,et al.  A new vertical fragmentation algorithm based on ant collective behavior in distributed database systems , 2011, Knowledge and Information Systems.

[2]  Gerhard Weikum,et al.  p2pDating: Real life inspired semantic overlay networks for Web search , 2007, Inf. Process. Manag..

[3]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[4]  Verena Kantere,et al.  Coordinating Peer Databases Using ECA Rules , 2003, DBISP2P.

[5]  Wolfgang Ziegler,et al.  Swarm Intelligence From Natural To Artificial Systems , 2016 .

[6]  Patrick Valduriez,et al.  Data Management in Large-Scale P2P Systems , 2004, VECPAR.

[7]  Richard P. Martin,et al.  PlanetP: using gossiping to build content addressable peer-to-peer information sharing communities , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[8]  Karl Aberer,et al.  GridVine: Building Internet-Scale Semantic Overlay Networks , 2004, SEMWEB.

[9]  A.H. Channa,et al.  Texture Segmentation using Ant Tree Clustering , 2006, 2006 IEEE International Conference on Engineering of Intelligent Systems.

[10]  Marco Dorigo,et al.  Swarm intelligence: from natural to artificial systems , 1999 .

[11]  Julia Handl,et al.  Ant-based and swarm-based clustering , 2007, Swarm Intelligence.

[12]  Jun Wang,et al.  A category overlay infrastructure for peer-to-peer content search , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[13]  Pascale Kuntz,et al.  A Stochastic Heuristic for Visualising Graph Clusters in a Bi-Dimensional Space Prior to Partitioning , 1999, J. Heuristics.

[14]  Karl Aberer,et al.  P-Grid: a self-organizing structured P2P system , 2003, SGMD.

[15]  Baldo Faieta,et al.  Diversity and adaptation in populations of clustering ants , 1994 .

[16]  Ajith Abraham,et al.  Web usage mining using artificial ant colony clustering and linear genetic programming , 2003, The 2003 Congress on Evolutionary Computation, 2003. CEC '03..

[17]  Seyed Mohammad Taghi Rouhani Rankoohi,et al.  A new ant colony optimization based algorithm for data allocation problem in distributed databases , 2009, Knowledge and Information Systems.

[18]  Peter Triantafillou,et al.  Towards High Performance Peer-to-Peer Content and Resource Sharing Systems , 2003, CIDR.

[19]  Felix Naumann,et al.  Semantic Overlay Clusters within Super-Peer Networks , 2003, DBISP2P.

[20]  Julia Handl,et al.  Improved Ant-Based Clustering and Sorting , 2002, PPSN.

[21]  Steffen Staab,et al.  Remindin': semantic query routing in peer-to-peer networks based on social metaphors , 2004, WWW '04.

[22]  Philippe Fournier-Viger,et al.  Data Mining and Multi-agent Integration , 2009 .

[23]  Daniel Merkle,et al.  Decentralized packet clustering in networks , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[24]  Mohamed S. Kamel,et al.  An aggregated clustering approach using multi-ant colonies algorithms , 2006, Pattern Recognit..

[25]  Phokion G. Kolaitis Schema mappings, data exchange, and metadata management , 2005, PODS '05.

[26]  Juan Julián Merelo Guervós,et al.  Parallel Problem Solving from Nature - PPSN IX: 9th International Conference, Reykjavik, Iceland, September 9-13, 2006, Proceedings , 2006, PPSN.

[27]  Karl Aberer,et al.  Databases, Information Systems, and Peer-to-Peer Computing , 2003, Lecture Notes in Computer Science.

[28]  Federica Mandreoli,et al.  SUNRISE: Exploring PDMS Networks with Semantic Routing Indexes , 2007 .

[29]  Lakshmish Ramaswamy,et al.  Connectivity based node clustering in decentralized peer-to-peer networks , 2003, Proceedings Third International Conference on Peer-to-Peer Computing (P2P2003).

[30]  Pascal Felber,et al.  Efficient search in unstructured peer-to-peer networks , 2004, SPAA '04.

[31]  Edith Cohen,et al.  Associative search in peer to peer networks: harnessing latent semantics , 2003, IEEE INFOCOM 2003. Twenty-second Annual Joint Conference of the IEEE Computer and Communications Societies (IEEE Cat. No.03CH37428).

[32]  Florian Schintke,et al.  Peer-to-Peer Computing , 2010, Euro-Par.

[33]  Chandrasekar Ramachandran,et al.  An Improved Probabilistic Ant based Clustering for Distributed Databases , 2007, IJCAI.

[34]  Linpeng Huang,et al.  Distributed Information Retrieval Based on Hierarchical Semantic Overlay Network , 2004, GCC.

[35]  Sandhya Dwarkadas,et al.  Peer-to-peer information retrieval using self-organizing semantic overlay networks , 2003, SIGCOMM '03.

[36]  Jean-Louis Deneubourg,et al.  The dynamics of collective sorting robot-like ants and ant-like robots , 1991 .

[37]  Ana L. C. Bazzan,et al.  A Multiagent, Multiobjective Clustering Algorithm , 2009, Data Mining and Multi-agent Integration.

[38]  Wolfgang Müller,et al.  Classifying Documents by Distributed P2P Clustering , 2003, GI Jahrestagung.

[39]  Wolfgang Nejdl,et al.  Super-peer-based routing and clustering strategies for RDF-based peer-to-peer networks , 2003, WWW '03.

[40]  A. Sima Etaner-Uyar,et al.  Parallelization of an ant-based clustering approach , 2010, Kybernetes.

[41]  Christos Doulkeridis,et al.  Peer-to-Peer Clustering for Semantic Overlay Network Generation , 2006, PRIS.

[42]  Leandro Nunes de Castro,et al.  Towards Improving Clustering Ants: An Adaptive Ant Clustering Algorithm , 2005, Informatica.

[43]  Alon Y. Halevy,et al.  Piazza: data management infrastructure for semantic web applications , 2003, WWW '03.

[44]  Aris M. Ouksel,et al.  Distributed databases and peer-to-peer databases: past and present , 2008, SGMD.

[45]  Verena Kantere,et al.  GrouPeer: Dynamic clustering of P2P databases , 2009, Inf. Syst..

[46]  R. Sivakumar,et al.  Ant-based Clustering Algorithms: A Brief Survey , 2010 .

[47]  Hector Garcia-Molina,et al.  Semantic Overlay Networks for P2P Systems , 2004, AP2PC.

[48]  Xueqi Cheng,et al.  WonGoo: a pure peer-to-peer full text information retrieval system based on semantic overlay networks , 2004, Third IEEE International Symposium on Network Computing and Applications, 2004. (NCA 2004). Proceedings..

[49]  Diomidis Spinellis,et al.  A survey of peer-to-peer content distribution technologies , 2004, CSUR.