Enriching ontologies with multilingual information†

This paper presents a novel approach to ontology localization with the objective of obtaining multilingual ontologies. Within the ontology development process, ontology localization has been defined as the activity of adapting an ontology to a concrete linguistic and cultural community. Depending on the ontology layers-terminological and/or conceptual-involved in the ontology localization activity, three heterogeneous multilingual ontology metamodels have been identified, of which we propose one of them. Our proposal consists in associating the ontology metamodel to an external model for representing and structuring lexical and terminological data in different natural languages. Our model has been called Linguistic Information Repository (LIR). The main advantages of this modelling modality rely on its flexibility by allowing (1) the enrichment of any ontology element with as much linguistic information as needed by the final application, and (2) the establishment of links among linguistic elements within and across different natural languages. The LIR model has been designed as an ontology of linguistic elements and is currently available in Web Ontology Language (OWL). The set of lexical and terminological data that it provides to ontology elements enables the localization of any ontology to a certain linguistic and cultural universe. The LIR has been evaluated against the multilingual requirements of the Food and Agriculture Organization of the United Nations in the framework of the NeOn project. It has proven to solve multilingual representation problems related to the establishment of well-defined relations among lexicalizations within and across languages, as well as conceptualization mismatches among different languages. Finally, we present an extension to the Ontology Metadata Vocabulary, the so-called LexOMV, with the aim of reporting on multilinguality at the ontology metadata level. By adding this contribution to the LIR model, we account for multilinguality at the three levels of an ontology: data level, knowledge representation level and metadata level.

[1]  C. K. Ogden,et al.  The meaning of meaning: A study of the influence of thought and of the science of symbolism , 1923 .

[2]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[3]  Paul Buitelaar,et al.  LexOnto: A Model for Ontology Lexicons for Ontology-based NLP , 2007 .

[4]  Graeme Hirst,et al.  Near-Synonymy and Lexical Choice , 2002, CL.

[5]  Margherita Sini,et al.  From AGROVOC to the Agricultural Ontology Service / Concept Server - An OWL Model for Creating Ontologies in the Agricultural Domain , 2006, OWLED.

[6]  Piek Vossen,et al.  EuroWordNet: general document , 2002 .

[7]  Claudia Soria,et al.  LMF for multilingual, specialized lexicons , 2006 .

[8]  Paul Buitelaar,et al.  A Multilingual/Multimedia Lexicon Model for Ontologies , 2006, ESWC.

[9]  Graeme Hirst,et al.  Ontology and the Lexicon , 2004, Handbook on Ontologies.

[10]  Jesús Barrasa Rodríguez Modelo para la definición automática de correspondencias semánticas entre ontologías y modelos relacionales , 2011 .

[11]  Asunción Gómez-Pérez,et al.  Localizing Ontologies in OWL , 2007 .

[12]  James Pustejovsky,et al.  The Generative Lexicon , 1995, CL.

[13]  R. Carnap,et al.  INTERNATIONAL ENCYCLOPEDIA OF UNIFIED SCIENCE. , 1939, Science.

[14]  Asunción Gómez-Pérez,et al.  Modelling Multilinguality in Ontologies , 2008, COLING.

[15]  F. G. Crookshank,et al.  The meaning of meaning : a study of the influence of language upon thought and of the science of symbolism , 1924 .

[16]  Paul Buitelaar,et al.  Towards Linguistically Grounded Ontologies , 2009, ESWC.

[17]  Asunción Gómez-Pérez,et al.  First Attempt towards a Standard Glossary of Ontology Engineering Terminology , 2008 .

[18]  Rosa Estopà,et al.  The GENOMA-KB Project: Towards the Integration of Concepts, Terms, Textual Corpora and Entities , 2004, LREC.

[19]  Heiner Stuckenschmidt,et al.  Results of the Ontology Alignment Evaluation Initiative , 2007 .

[20]  Piek Vossen,et al.  EUROWORDNET: A MULTILINGUAL DATABASE OF AUTONOMOUS AND LANGUAGE-SPECIFIC WORDNETS CONNECTED VIA AN INTER-LINGUALINDEX , 2004, International Journal of Lexicography.

[21]  Asunción Gómez-Pérez,et al.  LexOMV: an OMV extension to capture multilinguality , 2007 .

[22]  Peter Haase,et al.  OMV – Ontology Metadata Vocabulary , 2005 .

[23]  Asunción Gómez-Pérez,et al.  Enriching an Ontology with Multilingual Information , 2008, ESWC.

[24]  Asunción Gómez-Pérez,et al.  Multilingual and Localization Support for Ontologies , 2009, ESWC.

[25]  Asunción Gómez-Pérez,et al.  DEMO - Design Environment for Metadata Ontologies , 2006, ESWC.

[26]  Nicoletta Calzolari,et al.  SIMPLE: A General Framework for the Development of Multilingual Lexicons , 2000, LREC.