Improving Search Engines' Document Ranking Employing Semantics and an Inference Network

The users search mainly diverse information from several topics and their needs are difficult to be satisfied from the techniques currently employed in commercial search engines and without intervention from the user. In this paper, a novel framework is presented for performing re-ranking in the results of a search engine based on feedback from the user. The proposed scheme combines smoothly techniques from the area of Inference Networks and data from semantic knowledge bases. The novelty lies in the construction of a probabilistic network for each query which takes as input the belief of the user to each result (initially, all are equivalent) and produces as output a new ranking for the search results. We have constructed an implemented prototype that supports different Web search engines and it can be extended to support any search engine. Finally extensive experiments were performed using the proposed methods depicting the improvement of the ranking of the search engines results.

[1]  Wei Ji Ma,et al.  Bayesian inference with probabilistic population codes , 2006, Nature Neuroscience.

[2]  Naomie Salim,et al.  Ligand-Based Virtual Screening Using Bayesian Inference Network and Reweighted Fragments , 2012, TheScientificWorldJournal.

[3]  Charles L. A. Clarke,et al.  Overview of the TREC 2011 Web Track | NIST , 2011 .

[4]  Thorsten Joachims,et al.  Dynamic ranked retrieval , 2011, WSDM '11.

[5]  Jaime B. Teevan,et al.  Improving Information Retrieval with Textual Analysis: Bayesian Models and Beyond , 2001 .

[6]  Evangelos Theodoridis,et al.  Improved text annotation with Wikipedia entities , 2013, SAC '13.

[7]  Adele E. Howe,et al.  SAVVYSEARCH: A Metasearch Engine That Learns Which Search Engines to Query , 1997, AI Mag..

[8]  Gerhard Weikum,et al.  WWW 2007 / Track: Semantic Web Session: Ontologies ABSTRACT YAGO: A Core of Semantic Knowledge , 2022 .

[9]  Sang-goo Lee,et al.  Exploiting Taxonomic Knowledge for Personalized Search: A Bayesian Belief Network-based Approach , 2011, J. Inf. Sci. Eng..

[10]  Giannis Tzimas,et al.  Dynamic refinement of search engines results utilizing the user intervention , 2012, J. Syst. Softw..

[11]  Charles L. A. Clarke,et al.  Overview of the TREC 2010 Web Track , 2010, TREC.

[12]  Simone Paolo Ponzetto,et al.  BabelNet: Building a Very Large Multilingual Semantic Network , 2010, ACL.

[13]  V. Latora,et al.  Complex networks: Structure and dynamics , 2006 .

[14]  King-Lup Liu,et al.  Building efficient and effective metasearch engines , 2002, CSUR.

[15]  Michael A. West,et al.  Bayesian Inference on Network Traffic Using Link Count Data , 1998 .

[16]  Lakhmi C. Jain,et al.  Innovations in Bayesian Networks , 2008 .

[17]  Christina Lioma,et al.  Graph-based term weighting for information retrieval , 2011, Information Retrieval.

[18]  Sofia Stamou,et al.  Reducing information redundancy in search results , 2013, SAC '13.

[19]  Ricardo Baeza-Yates,et al.  Modern Information Retrieval - the concepts and technology behind search, Second edition , 2011 .

[20]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[21]  Paolo Ferragina,et al.  TAGME: on-the-fly annotation of short text fragments (by wikipedia entities) , 2010, CIKM.

[22]  Luis M. de Campos,et al.  An information retrieval model based on simple Bayesian networks , 2003, Int. J. Intell. Syst..

[23]  I. S. P. Daryle Niedermayer,et al.  An Introduction to Bayesian Networks and Their Contemporary Applications , 2008, Innovations in Bayesian Networks.

[24]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[25]  Olivier Chapelle,et al.  A dynamic bayesian network click model for web search ranking , 2009, WWW '09.

[26]  Tie-Yan Liu,et al.  Learning to rank for information retrieval , 2009, SIGIR.

[27]  Naomie Salim,et al.  Implementing Relevance Feedback in Ligand-Based Virtual Screening Using Bayesian Inference Network , 2011, Journal of biomolecular screening.

[28]  Charles L. A. Clarke,et al.  Overview of the TREC 2011 Web Track , 2011, TREC.