A conceptual representation of documents and queries for information retrieval systems by using light ontologies

This article presents a vector space model approach to representing documents and queries, based on concepts instead of terms and using WordNet as a light ontology. Such representation reduces information overlap with respect to classic semantic expansion techniques. Experiments carried out on the MuchMore benchmark and on the TREC-7 and TREC-8 Ad-Hoc collections demonstrate the effectiveness of the proposed approach.

[1]  Jun Zhai,et al.  Application of Fuzzy Ontology Framework to Information Retrieval for SCM , 2008, 2008 International Symposiums on Information Processing.

[2]  Steffen Staab,et al.  Handbook on Ontologies (International Handbooks on Information Systems) , 2004 .

[3]  Troels Andreasen,et al.  Conceptual Indexing of Text Using Ontologies and Lexical Resources , 2009, FQAS.

[4]  George Lakoff,et al.  Women, Fire, and Dangerous Things , 1987 .

[5]  Xiang Zhang,et al.  An Ontology-Driven Information Retrieval Mechanism for Semantic Information Portals , 2005, 2005 First International Conference on Semantics, Knowledge and Grid.

[6]  Ellen M. Voorhees,et al.  Using WordNet to disambiguate word senses for text retrieval , 1993, SIGIR.

[7]  Nenad Stojanovic,et al.  A logic-based approach for query refinement in ontology-based information retrieval systems , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[8]  W. Bruce Croft,et al.  Query expansion using local and global document analysis , 1996, SIGIR '96.

[9]  Michael K. Buckland,et al.  Annual Review of Information Science and Technology , 2006, J. Documentation.

[10]  Philip S. Yu,et al.  On effective conceptual indexing and similarity search in text data , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[11]  P. Smith,et al.  A review of ontology based query expansion , 2007, Inf. Process. Manag..

[12]  Yi Yu,et al.  Semantic Information Retrieval Based on Fuzzy Ontology for Electronic Commerce , 2008, J. Softw..

[13]  Nicola Guarino,et al.  Ontologies and Knowledge Bases. Towards a Terminological Clarification , 1995 .

[14]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[15]  Tadeusz P. Dobrowiecki,et al.  An Ontology-Based Information Retrieval System , 2003, IEA/AIE.

[16]  W. Bruce Croft,et al.  A framework for selective query expansion , 2004, CIKM '04.

[17]  Steffen Staab,et al.  International Handbooks on Information Systems , 2013 .

[18]  Mohand Boughanem,et al.  An Information Retrieval Driven by Ontology: from Query to Document Expansion , 2007, RIAO.

[19]  Miguel Ángel García Cumbreras,et al.  Integrating MeSH Ontology to Improve Medical Information Retrieval , 2007, CLEF.

[20]  Fang Wu,et al.  Design and Implementation of Ontology-Based Query Expansion for Information Retrieval , 2007, CONFENIS.

[21]  Olfa Dridi Ontology-based information retrieval: Overview and new proposition , 2008, 2008 Second International Conference on Research Challenges in Information Science.

[22]  Bijan Parsia,et al.  Ichigen-San: An Ontology-Based Information Retrieval System , 2006, APWeb.

[23]  Pablo Castells,et al.  An Ontology-Based Information Retrieval Model , 2005, ESWC.

[24]  Justin Zobel,et al.  Techniques for Efficient Query Expansion , 2004, SPIRE.

[25]  Raymond Y. K. Lau,et al.  Mining Fuzzy Ontology for a Web-Based Granular Information Retrieval System , 2009, RSKT.

[26]  Pablo Castells,et al.  An Adaptation of the Vector-Space Model for Ontology-Based Information Retrieval , 2007, IEEE Transactions on Knowledge and Data Engineering.

[27]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[28]  Ting Liu,et al.  Word Sense Language Model for Information Retrieval , 2006, AIRS.

[29]  Jaime G. Carbonell,et al.  Document and Query Expansion Models for Blog Distillation , 2008, TREC.

[30]  Elie Sanchez,et al.  A Fuzzy Ontology-Approach to improve Semantic Information Retrieval , 2007, URSW.

[31]  S. T. Dumais,et al.  Using latent semantic analysis to improve access to textual information , 1988, CHI '88.

[32]  Takenobu Tokunaga,et al.  Query expansion using heterogeneous thesauri , 2000, Inf. Process. Manag..

[33]  G. Miller,et al.  Folk Psychology or Semantic Entailment? Comment on Rips and Conrad (1989). , 1990 .

[34]  Nenad Stojanovic Approach for defining relevance in the ontology-based information retrieval , 2005, The 2005 IEEE/WIC/ACM International Conference on Web Intelligence (WI'05).

[35]  Loïc Maisonnasse,et al.  Incomplete and Fuzzy Conceptual Graphs to Automatically Index Medical Reports , 2007, NLDB.

[36]  Maria Lapata The Semantics of Relationships: An Interdisciplinary Perspective , 2003 .

[37]  Jan O. Pedersen Information Retrieval Based on Word Senses , 1995 .

[38]  Nenad Stojanovic An Approach for the Efficient Retrieval in Ontology-Enhanced Information Portals , 2004, PAKM.

[39]  Zongmin Ma,et al.  Soft Computing in Ontologies and Semantic Web (Studies in Fuzziness and Soft Computing) , 2006 .

[40]  L. Talmy Lexicalisation patterns: semantic structure in lexical forms , 1985 .

[41]  Yi Yu,et al.  Fuzzy Ontology Models Based on Fuzzy Linguistic Variable for Knowledge Management and Information Retrieval , 2008, Intelligent Information Processing.

[42]  Maurizio Panti,et al.  A conceptual indexing method for content-based retrieval , 1999, Proceedings. Tenth International Workshop on Database and Expert Systems Applications. DEXA 99.

[43]  Mohand Boughanem,et al.  Conceptual Indexing Based on Document Content Representation , 2005, CoLIS.

[44]  Troels Andreasen,et al.  On Conceptual Indexing for Data Summarization , 2009, IFSA/EUSFLAT Conf..

[45]  Mauro Dragoni,et al.  Evolving Neural Networks for Word Sense Disambiguation , 2008, 2008 Eighth International Conference on Hybrid Intelligent Systems.

[46]  Amanda Spink,et al.  A study of results overlap and uniqueness among major Web search engines , 2006, Inf. Process. Manag..

[47]  Xiaojun Wan,et al.  Single Document Summarization with Document Expansion , 2007, AAAI.

[48]  T. V. Geetha,et al.  Semantics Based Information Retrieval Using Conceptual Indexing of Documents , 2003, IDEAL.

[49]  Gerard Salton,et al.  Dynamic information and library processing , 1975 .

[50]  Gerard Salton,et al.  Automatic Information Organization And Retrieval , 1968 .

[51]  Tao Tao,et al.  Language Model Information Retrieval with Document Expansion , 2006, NAACL.

[52]  Asunción Gómez-Pérez,et al.  Ontology-based legal information retrieval to improve the information access in e-government , 2006, WWW '06.

[53]  Roberto Navigli,et al.  Word sense disambiguation: A survey , 2009, CSUR.

[54]  Ellen M. Voorhees,et al.  The fifth text REtrieval conference (TREC-5) , 1997 .

[55]  Joo-Hwee Lim,et al.  Using Bayesian Network for Conceptual Indexing: Application to Medical Document Indexing with UMLS Metathesaurus , 2008, CLEF.

[56]  Robert M. Colomb,et al.  Using Ontologies to Index Conceptual Structures for Tendering Automation , 2002, Australasian Database Conference.

[57]  Mohand Boughanem,et al.  Mercure at TREC7 , 1998, Text Retrieval Conference.

[58]  Jianqiang Wang,et al.  CLEF-2005 CL-SR at Maryland: Document and Query Expansion using Side Collections and Thesauri , 2005, CLEF.

[59]  Donna K. Harman,et al.  Overview of the Sixth Text REtrieval Conference (TREC-6) , 1997, Inf. Process. Manag..

[60]  Joemon M. Jose,et al.  Automatic query expansion based on divergence , 2001, CIKM '01.

[61]  Nicola Guarino,et al.  Sweetening Ontologies with DOLCE , 2002, EKAW.

[62]  Zhendong Niu,et al.  Concept Based Query Expansion , 2013, 2013 Ninth International Conference on Semantics, Knowledge and Grids.

[63]  Martin Holub A new approach to conceptual document indexing: building a hierarchical system of concepts based on document clusters , 2003, ISICT.

[64]  Karthik Ramani,et al.  Ontology-based design information extraction and retrieval , 2007, Artificial Intelligence for Engineering Design, Analysis and Manufacturing.

[65]  James Nga-Kwok Liu,et al.  A Fuzzy-Rough Method for Concept-Based Document Expansion , 2004, Rough Sets and Current Trends in Computing.

[66]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[67]  Charles Ruhl On Monosemy: A Study in Linguistic Semantics , 1989 .

[68]  Stein L. Tomassen Research on Ontology-Driven Information Retrieval , 2006, OTM Workshops.

[69]  Julio Gonzalo,et al.  Indexing with WordNet synsets can improve text retrieval , 1998, WordNet@ACL/COLING.

[70]  John Tait,et al.  Word sense disambiguation in information retrieval revisited , 2003, SIGIR.

[71]  W. Bruce Croft,et al.  Lexical ambiguity and information retrieval , 1992, TOIS.

[72]  Yiyu Yao,et al.  Conceptual Query Expansion , 2005, AWIC.

[73]  Adam Kilgarriff,et al.  "I Don’t Believe in Word Senses" , 1997, Comput. Humanit..

[74]  Mohand Boughanem,et al.  Mercure at trec9: Web and Filtering tasks , 2000, TREC.

[75]  Martha W. Evens,et al.  Relational Models of the Lexicon , 1989 .

[76]  Gina-Anne Levow Issues in pre- and post-translation document expansion: untranslatable cognates and missegmented words , 2003, IRAL.