Generating core domain ontologies from normalized dictionaries

This paper proposes a general framework for automatic core domain ontology generation from LMF (ISO 24613) standardized dictionaries. The originality of this work lies not only in the use of a unique and finely structured source containing multi-domain and lexical knowledge of morphological, syntactic and semantic levels, lending itself to ontological interpretations, but also in the proper building of the taxonomic backbone of the domain ontology. To this end, we have integrated a validation stage into the proposed process in order to maintain the consistency of the resulting formalized domain ontology core throughout this process and support the checking of anomalies in the handled source. Furthermore, this generation process has been implemented in an iterative and incremental system based on domain- and language-independent rules. The reliability of the proposed process is proven through many experiments that have been conducted on various domains using normalized dictionaries, but without lack of generality, we choose to illustrate an experiment carried out on the Arabic language. This choice is explained by both the great deficiency of work on building of Arabic ontologies and the availability within our research team of an LMF-standardized Arabic dictionary.

[1]  Ollivier Haemmerlé,et al.  Méthodologie de transformation d'un thesaurus en une ontologie de domaine , 2008, Rev. d'Intelligence Artif..

[2]  N. Guarino,et al.  Formal Ontology in Information Systems: Proceedings of the 1st International Conference June 6-8, 1998, Trento, Italy , 1998 .

[3]  Ángel Flores,et al.  A Review of Common Problems in Linguistic Resources and a New Way to Represent Ontological Relations , 2007 .

[4]  Raphael Volz,et al.  Semi-automatic Ontology Acquisition from a Corporate Intranet , 2000 .

[5]  Martin Hepp,et al.  Reusing ontologies and language components for ontology generation , 2010, Data Knowl. Eng..

[6]  Abdelmajid Ben Hamadou,et al.  Evaluating the Content of LMF Standardized Dictionaries , 2017, ACM Trans. Asian Low Resour. Lang. Inf. Process..

[7]  Yue Ma,et al.  Formal Description of Resources for Ontology-based Semantic Annotation , 2010, LREC.

[8]  Mauricio Barcellos Almeida A proposal to evaluate ontology content , 2009, Appl. Ontology.

[9]  Frehiwot Fisseha,et al.  Reengineering Thesauri for New Applications: The AGROVOC Example , 2006, J. Digit. Inf..

[10]  Takahira Yamaguchi,et al.  DODDLE II: A Domain Ontology Development Environment Using a MRD and Text Corpus , 2004, IEICE Trans. Inf. Syst..

[11]  Paul Buitelaar,et al.  Towards Linguistically Grounded Ontologies , 2009, ESWC.

[12]  Eric Nichols,et al.  Robust Ontology Acquisition from Machine-Readable Dictionaries , 2005, IJCAI.

[13]  Aldo Gangemi,et al.  Ontology Learning and Its Application to Automated Terminology Translation , 2003, IEEE Intell. Syst..

[14]  Ping Li,et al.  On Transformation from The Thesaurus into Domain Ontology , 2012 .

[15]  Gio Wiederhold,et al.  Ontology Maintenance with an Algebraic Methodology: a Case Study * , 2003 .

[16]  A. Michiels,et al.  Exploiting a Large Data Base by Longman , 1980, COLING.

[17]  Nicoletta Calzolari,et al.  Detecting Patterns in a Lexical Data Base , 1984, ACL.

[18]  Robert A. Amsler,et al.  A Taxonomy for English Nouns and Verbs , 1981, ACL.

[19]  Asunción Gómez-Pérez,et al.  Multilingual and Localization Support for Ontologies , 2009, ESWC.

[20]  Martin Chodorow,et al.  Extracting Semantic Hierarchies from a Large On-Line Dictionary , 1985, ACL.

[21]  Qin Lu,et al.  Experiments of Ontology Construction with Formal Concept Analysis , 2005, OntoLex@IJCNLP.

[22]  Abdelmajid Ben Hamadou,et al.  Towards Generation of Domain Ontology from LMF Standardized Dictionaries , 2010, SEKE.

[23]  Yarden Katz,et al.  Pellet: A practical OWL-DL reasoner , 2007, J. Web Semant..

[24]  Peter F. Patel-Schneider,et al.  Transforming XML Schema to OWL Using Patterns , 2011, 2011 IEEE Fifth International Conference on Semantic Computing.

[25]  Nathalie Aussenac-Gilles,et al.  Ontology Learning by Analyzing XML Document Structure and Content , 2009, KEOD.

[26]  Maria Teresa Pazienza,et al.  Linguistic Watermark 3.0: An RDF Framework and a Software Library for Bridging Language and Ontologies in the Semantic Web , 2008, SWAP.

[27]  Abdelmajid Ben Hamadou,et al.  LMF Standardized Model for the Editorial Electronic Dictionaries of Arabic , 2018, NLPCS.

[28]  Nathalie Aussenac-Gilles,et al.  The TERMINAE Method and Platform for Ontology Engineering from Texts , 2008, Ontology Learning and Population.

[29]  Maria Teresa Pazienza,et al.  Let's talk about our "being": A linguistic-based ontology framework for coordinating agents , 2007, Appl. Ontology.

[30]  Graeme Hirst,et al.  Ontology and the Lexicon , 2004, Handbook on Ontologies.

[31]  Pedro M. Domingos,et al.  Unsupervised Ontology Induction from Text , 2010, ACL.

[32]  Philipp Cimiano,et al.  Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[33]  Eneko Agirre,et al.  Building Accurate Semantic Taxonomies from Monolingual MRDs , 1998, COLING-ACL.