Document annotation and ontology population from linguistic extractions

In this paper, we present a workbench for semi-automatic ontology population from textual documents. It provides an environment for mapping the linguistic extractions with the domain ontology thanks to knowledge acquisition rules. Those rules are activated when a pertinent linguistic tag is reached. Those linguistic tags are then mapped to a concept, one of its attributes or even a semantic relation between several concepts. The rules instantiate these concepts, attributes and relations in the knowledge base constrained by the domain ontology. This paper deals with the underlying knowledge capture process and presents the first experiments realized on a real client application from the legal publishing domain.

[1]  Luc Grivel,et al.  La construction de composants de connaissance pour l'extraction et le filtrage de l'information sur les réseaux , 2001 .

[2]  Ian Horrocks,et al.  OWL Web Ontology Language Reference-W3C Recommen-dation , 2004 .

[3]  Gustavo Crispino Une plate-forme informatique de l'Exploration Contextuelle : cmodélisation, architecture et réalisation (ContextO) : application au filtrage sémantique de textes , 2003 .

[4]  Ralph Grishman,et al.  Message Understanding Conference- 6: A Brief History , 1996, COLING.

[5]  Kalina Bontcheva,et al.  The Semantic Web : A New Opportunity and Challenge for Human Language Technology , 2003 .

[6]  Steffen Staab,et al.  S-CREAM: Semiautomatic CREAtion of Metadata , 2002, SAAKM@ECAI.

[7]  Steffen Staab,et al.  S-CREAM: Semiautomatic CREAtion of Metadata , 2002, SAAKM@ECAI.

[8]  Dan Brickley,et al.  Resource Description Framework (RDF) Model and Syntax Specification , 2002 .

[9]  Arthur Stutt,et al.  MnM: Ontology-Driven Tool for Semantic Markup , 2002, SAAKM@ECAI.

[10]  Douglas C. Engelbart,et al.  XML Topic Maps: Creating and Using Topic Maps for the Web , 2002 .

[11]  Thomas R. Gruber,et al.  A translation approach to portable ontology specifications , 1993, Knowl. Acquis..

[12]  Marja-Riitta Koivunen,et al.  Annotea: an open RDF infrastructure for shared Web annotations , 2001, WWW '01.

[13]  Florence Amardeilh,et al.  A Semantic Web Portal with HLT Capabilities , 2004 .

[14]  Atanas Kiryakov,et al.  Semantic Annotation, Indexing, and Retrieval , 2003, SEMWEB.

[15]  L. Stein,et al.  OWL Web Ontology Language - Reference , 2004 .

[16]  Jean Charlet,et al.  Construction de ressources terminologiques ou ontologiques à partir de textes Un cadre unificateur pour trois études de cas , 2004, Rev. d'Intelligence Artif..

[17]  J. Minel,et al.  Résumé automatique par filtrage sémantique d'informations dans des textes , 2001 .