Intelligent content-based retrieval for P2P networks

Currently, most peer-to-peer (P2P) systems are designed for file sharing by network participants. Simple meta-data search mechanism will be sufficient to support searching and retrieving shared files over P2P networks. However, to share document information such as news articles, scientific publications, company reports, etc., a content-based search mechanism is needed to provide efficient content-based retrieval. In this paper, we propose an intelligent P2P content-based document retrieval system known as iSearch-P2P. In iSearch-P2P, we have incorporated an intelligent technique based on the Fuzzy Adaptive Resonance Theory (Fuzzy ART) neural network to perform document clustering in order to support content-based publishing and retrieval over P2P networks. With intelligent content-based search, the iSearch-P2P system supports scalability and avoids indexing and query flooding problems of most existing P2P systems. In this paper, we describe the architecture, the publishing and retrieval processes, implementation and performance evaluation of the proposed iSearch-P2P system.

[1]  Richard P. Martin,et al.  PlanetP: using gossiping to build content addressable peer-to-peer information sharing communities , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[2]  Tenkasi V. Ramabadran,et al.  A tutorial on CRC computations , 1988, IEEE Micro.

[3]  Thu D. Nguyen,et al.  Text-Based Content Search and Retrieval in Ad-hoc P2P Communities , 2002, NETWORKING Workshops.

[4]  environmet.,et al.  JXTA : A Network Programming Environment , 2022 .

[5]  Thorsten Joachims,et al.  A Probabilistic Analysis of the Rocchio Algorithm with TFIDF for Text Categorization , 1997, ICML.

[6]  J. Ritter Why Gnutella Can't Scale. No, Really , 2001 .

[7]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[8]  Ian H. Witten,et al.  Managing gigabytes , 1994 .

[9]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[10]  Samuel Kaski,et al.  Dimensionality reduction by random mapping: fast similarity computation for clustering , 1998, 1998 IEEE International Joint Conference on Neural Networks Proceedings. IEEE World Congress on Computational Intelligence (Cat. No.98CH36227).

[11]  Zhichen Xu,et al.  PeerSearch: Efficient Information Retrieval in Peer-to-Peer Networks , 2002 .

[12]  James T. Kwok,et al.  Automated Text Categorization Using Support Vector Machine , 1998, ICONIP.

[13]  Proceedings. 2003 International Conference on Cyberworlds , 2003, Proceedings. 2003 International Conference on Cyberworlds.

[14]  Chris Buckley,et al.  Implementation of the SMART Information Retrieval System , 1985 .

[15]  Elizabeth R. Jessup,et al.  Matrices, Vector Spaces, and Information Retrieval , 1999, SIAM Rev..

[16]  Amin Vahdat,et al.  Efficient Peer-to-Peer Keyword Searching , 2003, Middleware.

[17]  S. Grossberg,et al.  Fuzzy ART: an adaptive resonance algorithm for rapid, stable classification of analog patterns , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.