Semi Automatic Ontology Instantiation in the domain of Risk Management

One of the challenging tasks in the context of Ontological Engineering is to automatically or semi-automatically support the process of Ontology Learning and Ontology Population from semi-structured documents (texts). In this paper we describe a Semi-Automatic Ontology Instantiation method from natural language text, in the domain of Risk Management. This method is composed from three steps 1 ) Annotation with part-of-speech tags, 2) Semantic Relation Instances Extraction, 3) Ontology instantiation process. It’s based on combined NLP techniques using human intervention between steps 2 and 3 for control and validation. Since it heavily relies on linguistic knowledge it is not domain dependent which is a good feature for portability between the different fields of risk management application. The proposed methodology uses the ontology of the PRIMA1 project (supported by the European community) as a Generic Domain Ontology and populates it via an available corpus. A first validation of the approach is done through an experiment with Chemical Fact Sheets from Environmental Protection Agency2.

[1]  Patricio Martínez-Barco,et al.  Semantic Annotation of a Natural Language Corpus for Knowledge Extraction , 2005, NLDB.

[2]  David E. Millard,et al.  Automatic Ontology-Based Knowledge Extraction from Web Documents , 2003, IEEE Intell. Syst..

[3]  Paola Velardi,et al.  Enriching a Formal Ontology with a Thesaurus: an Application in the Cultural Heritage Domain , 2006, OntologyLearning@COLING/ACL.

[4]  David E. Millard,et al.  Web based Knowledge Extraction and Consolidation for Automatic Ontology Instantiation , 2003 .

[5]  Paul Buitelaar,et al.  Proceedings of the 2nd Workshop on Ontology Learning and Population: Bridging the Gap between Text and Knowledge , 2006, OntologyLearning@COLING/ACL.

[6]  David E. Millard,et al.  Artequakt: Generating Tailored Biographies with Automatically Annotated Fragments from the Web , 2002, SAAKM@ECAI.

[7]  Gerhard Weikum,et al.  LEILA: Learning to Extract Information by Linguistic Analysis , 2006, OntologyLearning@COLING/ACL.

[8]  Chongfu Huang Risk Analysis with Information Described in Natural Language , 2007, International Conference on Computational Science.

[9]  David E. Millard,et al.  Automatic extraction of knowledge from web documents , 2003 .

[10]  H. Igor Ansoff,et al.  Implanting Strategic Management , 1984 .

[11]  Tyne Liang,et al.  Empirical Textual Mining to Protein Entities Recognition from PubMed Corpus , 2005, NLDB.

[12]  Paul Buitelaar,et al.  Ontology-based Information Extraction with SOBA , 2006, LREC.

[13]  Mehrnoush Shamsfard,et al.  The state of the art in ontology learning: a framework for comparison , 2003, The Knowledge Engineering Review.

[14]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[15]  David Hillson,et al.  Describing probability: The limitations of natural language , 2005 .

[16]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.