A Novel Approach to Query Expansion based on Semantic Similarity Measures

In this paper, we present a framework supporting information retrieval over corpora of documents using an automatic semantic query expansion approach. The main idea is to expand the set of words used as query terms exploiting the notion of semantic similarity between the concepts related to the search terms. We leverage existing lexical resources and similarity metrics computed among terms to generate - by a proper mapping into a vectorial space - an index for the fast retrieval of a set of terms “semantically correlated” to a given query term. The vector of expanded terms is then exploited in the query stage to retrieve documents that are significantly related to specific combinations of the query terms. Preliminary experimental results concerning efficiency and effectiveness of the proposed approach are reported and discussed.

[1]  Vincenzo Moscato,et al.  A recommendation strategy based on user behavior in digital ecosystems , 2010, MEDES.

[2]  Claudio Carpineto,et al.  A Survey of Automatic Query Expansion in Information Retrieval , 2012, CSUR.

[3]  Jian-Yun Nie,et al.  Integrating Multiple Resources for Diversified Query Expansion , 2014, ECIR.

[4]  Mayank Singh,et al.  Ontology Based Information Retrieval in Semantic Web: A Survey , 2013 .

[5]  Mandar Mitra,et al.  Improving query expansion using WordNet , 2013, J. Assoc. Inf. Sci. Technol..

[6]  Christos Faloutsos,et al.  FastMap: a fast algorithm for indexing, data-mining and visualization of traditional and multimedia datasets , 1995, SIGMOD '95.

[7]  Antonio Maria Rinaldi,et al.  A content-based approach for document representation and retrieval , 2008, DocEng '08.

[8]  A. R. Rivas,et al.  Study of Query Expansion Techniques and Their Application in the Biomedical Information Retrieval , 2014, TheScientificWorldJournal.

[9]  Pablo Castells,et al.  An Ontology-Based Information Retrieval Model , 2005, ESWC.

[10]  Fabio Persia,et al.  SemTree: An index for supporting semantic retrieval of documents , 2015, 2015 31st IEEE International Conference on Data Engineering Workshops.

[11]  Flora Amato,et al.  Exploiting Cloud Technologies and Context Information for Recommending Touristic Paths , 2013, IDC.

[12]  Escuela Politécnica Superior,et al.  Semantically enhanced Information Retrieval: an ontology-based approach , 2009 .

[13]  Vincenzo Moscato,et al.  A Combined Relevance Feedback Approach for User Recommendation in E-commerce Applications , 2010, 2010 Third International Conference on Advances in Computer-Human Interactions.

[14]  Susan T. Dumais,et al.  The vocabulary problem in human-system communication , 1987, CACM.

[15]  Ben He,et al.  High performance query expansion using adaptive co-training , 2013, Inf. Process. Manag..

[16]  Flora Amato,et al.  Knowledge Representation and Management for E-Government Documents , 2008, E-Government, ICT Professionalism and Competences Service Science.

[17]  Flora Amato,et al.  A Lexicon-Grammar Based Methodology for Ontology Population for e-Health Applications , 2015, 2015 Ninth International Conference on Complex, Intelligent, and Software Intensive Systems.

[18]  Flora Amato,et al.  A system for semantic retrieval and long-term preservation of multimedia documents in the e-government domain , 2009, Int. J. Web Grid Serv..

[19]  M. E. Maron,et al.  On Relevance, Probabilistic Indexing and Information Retrieval , 1960, JACM.

[20]  Antonio Picariello,et al.  Information Retrieval from the Web: An Interactive Paradigm , 2005, Multimedia Information Systems.

[21]  Kai-Hsiang Yang,et al.  Using google distance for query expansion in expert finding , 2014, Ninth International Conference on Digital Information Management (ICDIM 2014).

[22]  Paolo Napoletano,et al.  Weighted Word Pairs for query expansion , 2015, Inf. Process. Manag..

[23]  S. Srinivasan,et al.  A Survey of Text Mining : Retrieval , Extraction and Indexing Techniques , 2012 .

[24]  Pablo Castells,et al.  An Adaptation of the Vector-Space Model for Ontology-Based Information Retrieval , 2007, IEEE Transactions on Knowledge and Data Engineering.

[25]  Sergio Ilarri,et al.  An Approach for Automatic Query Expansion Based on NLP and Semantics , 2014, DEXA.

[26]  Josiane Mothe,et al.  Query expansion in information retrieval : What can we learn from a deep analysis of queries ? , 2014, COLING 2014.

[27]  Philip Resnik,et al.  Semantic Similarity in a Taxonomy: An Information-Based Measure and its Application to Problems of Ambiguity in Natural Language , 1999, J. Artif. Intell. Res..

[28]  James Allan,et al.  Entity query feature expansion using knowledge base links , 2014, SIGIR.