Enhanced semantic representation for improved ontology-based information retrieval

This research addresses the semantic and knowledge gap problem in information retrieval by proposing an ontology-based semantic feature-matching approach, which uses natural language processing, named entity recognition and user-oriented ontologies. The approach comprises four steps: i user-oriented ontology building; ii semantic feature extraction for identifying information objects; iii semantic feature selection using user-oriented ontologies to enhance the semantic representation of the information objects, and iv measuring the similarity between the information objects using their enhanced semantic representations. The experiment conducted explores the retrieval performance of the proposed approach and shows that it consistently outperforms its corresponding term-based approach by demonstrating improved precision, recall and F-score.

[1]  Quan Wang,et al.  Regularized latent semantic indexing , 2011, SIGIR.

[2]  Rossitza Setchi,et al.  An Ontology Based Approach to Measuring the Semantic Similarity between Information Objects in Personal Information Collections , 2010, KES.

[3]  Roi Blanco,et al.  Enhanced results for web search , 2011, SIGIR.

[4]  Gábor Nagypál Improving Information Retrieval Effectiveness by Using Domain Knowledge Stored in Ontologies , 2005, OTM Workshops.

[5]  Paul Buitelaar,et al.  Ontology-based information extraction and integration from heterogeneous data sources , 2008, Int. J. Hum. Comput. Stud..

[6]  Danushka Bollegala,et al.  A Relational Model of Semantic Similarity between Words using Automatically Extracted Lexical Pattern Clusters from the Web , 2009, EMNLP.

[7]  David Sánchez,et al.  Ontology-based semantic similarity: A new feature-based approach , 2012, Expert Syst. Appl..

[8]  Clement T. Yu,et al.  Personalized Web search for improving retrieval effectiveness , 2004, IEEE Transactions on Knowledge and Data Engineering.

[9]  Pablo Castells,et al.  An Ontology-Based Information Retrieval Model , 2005, ESWC.

[10]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[11]  Rossitza Setchi,et al.  User-oriented ontology-based clustering of stored memories , 2012, Expert Syst. Appl..

[12]  Adam L. Berger,et al.  A Maximum Entropy Approach to Natural Language Processing , 1996, CL.

[13]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[14]  Karthik Ramani,et al.  Ontology-based design information extraction and retrieval , 2007, Artificial Intelligence for Engineering Design, Analysis and Manufacturing.

[15]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[16]  Gerhard Weikum,et al.  YAGO: A Large Ontology from Wikipedia and WordNet , 2008, J. Web Semant..

[17]  W. Bruce Croft,et al.  LDA-based document models for ad-hoc retrieval , 2006, SIGIR.

[18]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[19]  John D. Lafferty,et al.  A correlated topic model of Science , 2007, 0708.3601.

[20]  Fuchun Peng,et al.  Personalize web search results with user's location , 2010, SIGIR '10.

[21]  Asunción Gómez-Pérez,et al.  METHONTOLOGY: From Ontological Art Towards Ontological Engineering , 1997, AAAI 1997.

[22]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[23]  Klaus Berberich,et al.  Improving local search ranking through external logs , 2011, SIGIR.

[24]  Euripides G. M. Petrakis,et al.  X-Similarity: Computing Semantic Similarity between Concepts from Different Ontologies , 2006, J. Digit. Inf. Manag..