OSQR: A framework for ontology-based semantic query routing in unstructured P2P networks

Efficient searching for information is an important goal in unstructured peer-to-peer (P2P) networks. While several P2P systems have been proposed for data sharing purposes, many support only semantics-free keyword searches or coarser grained file name searches. In this paper, we present an ontology based semantic query routing algorithm that performs efficient semantic search in unstructured P2P overlay networks. In our proposed system, the queries are routed in the network by forwarding to peers with highly relevant content in their local storages. To aid in this semantic query routing, we propose a scheme where each peer in the network adheres to a global ontology and semantically tags its local document collection with concepts in the ontology. Based on the semantic tags, peer level semantic summaries are generated, exchanged with neighboring peers and propagated along search paths which aid in efficient local query processing and overlay query routing. An extensive set of simulations performed to evaluate the effectiveness of the system on P2P networks show 380% and 717% improvement in average recall rate, and 410% and 725% improvement in average precision over Ontology Index based Query Routing [20] and Random Walk [18], respectively for dynamic networks at comparable message overheads. Thus, our approach represents a significant advance in practical terms.

[1]  Ning Qian,et al.  Search Using Semantic Inference in Unstructured P2P Networks , 2010, 2010 International Conference on Computational Intelligence and Software Engineering.

[2]  Anand Sivasubramaniam,et al.  Semantic small world: an overlay network for peer-to-peer search , 2004, Proceedings of the 12th IEEE International Conference on Network Protocols, 2004. ICNP 2004..

[3]  Juan Li,et al.  OntSum: A Semantic Query Routing Scheme in P2P Networks Based on Concise Ontology Indexing , 2007, 21st International Conference on Advanced Information Networking and Applications (AINA '07).

[4]  Jafar Habibi,et al.  An Ontology Based Local Index in P2P Networks , 2006, SKG.

[5]  Zhichen Xu,et al.  pSearch: information retrieval in structured overlays , 2003, CCRV.

[6]  Jafar Habibi,et al.  Efficient Semantic Based Search in Unstructured Peer-to-Peer Networks , 2008, 2008 Second Asia International Conference on Modelling & Simulation (AMS).

[7]  Dimitrios Gunopulos,et al.  A local search mechanism for peer-to-peer networks , 2002, CIKM '02.

[8]  Hai Jin,et al.  SemreX: Efficient search in a semantic overlay for literature retrieval , 2008, Future Gener. Comput. Syst..

[9]  Hector Garcia-Molina,et al.  Improving search in peer-to-peer networks , 2002, Proceedings 22nd International Conference on Distributed Computing Systems.

[10]  Yang Zhang,et al.  Ontology based P2P Semantic Search Routing Algorithm , 2010, 2010 International Conference on Networking, Sensing and Control (ICNSC).

[11]  Tore Risch,et al.  EDUTELLA: a P2P networking infrastructure based on RDF , 2002, WWW.

[12]  Elizabeth R. Jessup,et al.  Matrices, Vector Spaces, and Information Retrieval , 1999, SIAM Rev..

[13]  Hector Garcia-Molina,et al.  Semantic Overlay Networks for P2P Systems , 2004, AP2PC.

[14]  Sushil K. Prasad,et al.  SPUN: A P2P Probabilistic Search Algorithm Based on Successful Paths in Unstructured Networks , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[15]  W. Bruce Croft,et al.  Discovering key concepts in verbose queries , 2008, SIGIR '08.

[16]  Mark Handley,et al.  A scalable content-addressable network , 2001, SIGCOMM '01.

[17]  David R. Karger,et al.  Chord: A scalable peer-to-peer lookup service for internet applications , 2001, SIGCOMM '01.

[18]  Edith Cohen,et al.  Search and replication in unstructured peer-to-peer networks , 2002, ICS '02.

[19]  Yi-fang Brook Wu,et al.  Domain-specific keyphrase extraction , 2005, CIKM '05.

[20]  James Won-Ki Hong,et al.  Semantic overlay network for peer-to-peer hybrid information search and retrieval , 2011, 12th IFIP/IEEE International Symposium on Integrated Network Management (IM 2011) and Workshops.