Scalable keyword search based on semantic in DHT based peer-to-peer system

The common way for keyword search in Distributed Hash Tables (DHTs) based Peer-to-Peer (P2P) system is to construct distributed inverted index by keywords. But it suffers from the problem of unscalable resources (e.g. bandwidth, storage) consumption. In this paper, we present SKS, a scalable keyword search approach in DHTs based P2P system. SKS introduces the ontology to organize the specific domain, which captures the semantic relations between words. SKS constructs distributed inverted index by concepts, which decreases the number of index entries publishing for documents and avoids the intersection of inverted lists between nodes when executing multi-keyword search. With the concept index SKS transforms the keyword search to the match process of concepts, implementing semantic search. Simulation experiment shows that SKS is more efficient than the approach of distributed inverted index by keywords in indices publishing overhead and query overhead.

[1]  Michael B. Jones,et al.  SkipNet: A Scalable Overlay Network with Practical Locality Properties , 2003, USENIX Symposium on Internet Technologies and Systems.

[2]  Guangwen Yang,et al.  Making Peer-to-Peer Keyword Searching Feasible Using Multi-level Partitioning , 2004, IPTPS.

[3]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[4]  David McLean,et al.  An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources , 2003, IEEE Trans. Knowl. Data Eng..

[5]  David R. Karger,et al.  On the Feasibility of Peer-to-Peer Web Indexing and Search , 2003, IPTPS.

[6]  Ricardo Baeza-Yates,et al.  Information Retrieval: Data Structures and Algorithms , 1992 .

[7]  Kang-Won Lee,et al.  Supporting efficient keyword-based file search in peer-to-peer file sharing systems , 2004, IEEE Global Telecommunications Conference, 2004. GLOBECOM '04..

[8]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[9]  Sandhya Dwarkadas,et al.  Hybrid Global-Local Indexing for Efficient Peer-to-Peer Information Retrieval , 2004, NSDI.

[10]  Karl Aberer,et al.  ALVIS peers: a scalable full-text peer-to-peer retrieval engine , 2006, P2PIR '06.

[11]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[12]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[13]  Omprakash D. Gnawali A Keyword-Set Search System for Peer-to-Peer Networks , 2002 .