P-Terse: A Peer-to-Peer Based Text Retrieval and Search System

P-TERSE, a peer-to-peer (P2P) text retrieval and search prototype system is introduced in this paper. Compared with existing P2P systems, P-TERSE has three novel features: 1) The text content of the shared documents is searchable. 2) The system is open for extensions. 3) Our search and query processing techniques are implemented in the system. These techniques are designed for achieving high efficiency and scalability. The presentation of the system includes the design strategies of the system and the technologies that are implemented. We also discuss the on-going research and development work related to P-TERSE.

[1]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[2]  Richard P. Martin,et al.  PlanetP: using gossiping to build content addressable peer-to-peer information sharing communities , 2003, High Performance Distributed Computing, 2003. Proceedings. 12th IEEE International Symposium on.

[3]  Karl Aberer,et al.  GridVine: Building Internet-Scale Semantic Overlay Networks , 2004, SEMWEB.

[4]  Jie Wu,et al.  Small Worlds: The Dynamics of Networks between Order and Randomness , 2003 .

[5]  Ian T. Foster,et al.  Locating Data in (Small-World?) Peer-to-Peer Scientific Collaborations , 2002, IPTPS.

[6]  David J. DeWitt,et al.  Computing PageRank in a Distributed Internet Search Engine System , 2004, VLDB.

[7]  Norbert Fuhr,et al.  Combining CORI and the Decision-Theoretic Approach for Advanced Resource Selection , 2004, ECIR.

[8]  Jon M. Kleinberg,et al.  The small-world phenomenon: an algorithmic perspective , 2000, STOC '00.

[9]  Antony I. T. Rowstron,et al.  Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems , 2001, Middleware.

[10]  Robert Morris,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM 2001.

[11]  Gerhard Weikum,et al.  MINERVA: Collaborative P2P Search , 2005, VLDB.

[12]  Aoying Zhou,et al.  KEYNOTE: Keyword Search by Node Selection for Text Retrieval on DHT-Based P2P Networks , 2006, DASFAA.

[13]  Aoying Zhou,et al.  A Distributed Ranking Strategy in Peer-to-Peer Based Information Retrieval Systems , 2004, APWeb.

[14]  Ben Y. Zhao,et al.  Tapestry: a fault-tolerant wide-area application infrastructure , 2002, CCRV.

[15]  Zhichen Xu,et al.  pSearch: information retrieval in structured overlays , 2003, CCRV.

[16]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM 2001.

[17]  Aoying Zhou,et al.  C2: A New Overlay Network Based on CAN and Chord , 2003, GCC.

[18]  Beng Chin Ooi,et al.  Explore the "Small world phenomena" in pure P2P information sharing systems , 2003, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings..

[19]  Torsten Suel,et al.  Efficient query evaluation on large textual collections in a peer-to-peer environment , 2005, Fifth IEEE International Conference on Peer-to-Peer Computing (P2P'05).

[20]  Aoying Zhou,et al.  SIPPER: Selecting Informative Peers in Structured P2P Environment for Content-Based Retrieval , 2006, 22nd International Conference on Data Engineering (ICDE'06).