Query-driven approach of contextual ontology module learning using web snippets

The main objective of this work is to automatically build ontology modules that cover search terms of users in ontology-based question answering on the Web. Indeed, some arising approaches of ontology module extraction aim at solving the problem of identifying ontology fragment candidates that are relevant for the application. The main problem is that these approaches consider only the input of predefined ontologies, instead of the underlying semantics represented in texts. This work proposes an approach of contextual ontology module learning covering particular search terms by analyzing past user queries and by searching for web snippets provided by the traditional search engines. The obtained contextual modules will be used for query reformulation. The proposal has been evaluated on the ground of two criteria: the semantic cotopy measure of discovered ontology modules and the precision measure of the search results obtained by using the resulted ontology modules for query reformulation. The experiments have been carried out according to two case studies: an open domain web search and the medical digital library “PubMed”.

[1]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[2]  Mohammed Bennamoun,et al.  Ontology learning from text: A look back and into the future , 2012, CSUR.

[3]  Jan H. M. Korst,et al.  Automatic Ontology Population by Googling , 2005, BNAIC.

[4]  Benoît Lemaire,et al.  Effects of High-Order Co-occurrences on Word Semantic Similarities , 2006, ArXiv.

[5]  Suresh Manandhar,et al.  An Unsupervised Method for General Named Entity Recognition and Automated Concept Discovery , 2004 .

[6]  Steffen Staab,et al.  Measuring Similarity between Ontologies , 2002, EKAW.

[7]  Alan L. Rector,et al.  Web ontology segmentation: analysis, classification and use , 2006, WWW '06.

[8]  David Sánchez,et al.  Learning relation axioms from text: An automatic Web-based approach , 2012, Expert Syst. Appl..

[9]  C. Mastroianni,et al.  A reference architecture for knowledge management-based Web systems , 2003, Proceedings of the Fourth International Conference on Web Information Systems Engineering, 2003. WISE 2003..

[10]  Doug Downey,et al.  Locating Complex Named Entities in Web Text , 2007, IJCAI.

[11]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[12]  Henda Hajjami Ben Ghézala,et al.  Modular Ontological Warehouse for Adaptative Information Search , 2012, MEDI.

[13]  Doug Downey,et al.  Web-scale information extraction in knowitall: (preliminary results) , 2004, WWW '04.

[14]  J. Silva,et al.  A Local Maxima method and a Fair Dispersion Normalization for extracting multi-word units from corpora , 2009 .

[15]  Hajer Baazaoui Zghal,et al.  Evolutive Content-based Search System - Semantic Search System based on Case-based-Reasoning and Ontology Enrichment , 2018, KDIR.

[16]  Marta Sabou,et al.  Dynamic Integration of Multiple Evidence Sources for Ontology Learning , 2012, J. Inf. Data Manag..

[17]  Stefano Spaccapietra,et al.  Modular Ontologies: Concepts, Theories and Techniques for Knowledge Modularization , 2009, Modular Ontologies.

[18]  David Sánchez,et al.  Learning non-taxonomic relationships from web documents for domain ontology construction , 2008, Data Knowl. Eng..

[19]  Hartmut Ehrig,et al.  Fundamental Theory for Typed Attributed Graphs and Graph Transformation based on Adhesive HLR Categories , 2006, Fundam. Informaticae.

[20]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[21]  David Snchez Domain Ontology Learning from the Web , 2008 .

[22]  Alfredo Cuzzocrea,et al.  Combining multidimensional user models and knowledge representation and management techniques for making web services knowledge-aware , 2006, Web Intell. Agent Syst..

[23]  Marti A. Hearst Automated Discovery of WordNet Relations , 2004 .

[24]  Enrico Motta,et al.  Modularization: a Key for the Dynamic Selection of Relevant Knowledge Components , 2006, WoMO.

[25]  Susan T. Dumais,et al.  The latent semantic analysis theory of knowledge , 1997 .

[26]  Farookh Khadeer Hussain,et al.  SOF: a semi‐supervised ontology‐learning‐based focused crawler , 2013, Concurr. Comput. Pract. Exp..

[27]  Henda Ben Ghezala,et al.  Survey on ontology learning from Web and open issues , 2009 .

[28]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[29]  Ali Rahnama,et al.  Ontology learning: revisted , 2012 .

[30]  Henda Hajjami Ben Ghézala,et al.  Contextual Ontology Module Learning from Web Snippets and Past User Queries , 2011, KES.

[31]  Mark Stevenson,et al.  Comparing Information Extraction Pattern Models , 2006 .

[32]  Mark A. Musen,et al.  Specifying Ontology Views by Traversal , 2004, International Semantic Web Conference.

[33]  Charles T. Meadow,et al.  Text information retrieval systems , 1992 .

[34]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .