论文信息 - An Approach for Populating and Enriching Ontology-Based Repositories

An Approach for Populating and Enriching Ontology-Based Repositories

Publically available text-based documents (e.g. news, meeting transcripts) are a very important source of knowledge, especially for organizations. These documents mention domain entities such as persons, places, professional positions, decisions and actions. Querying these documents (instead of browsing, searching and finding) is a very relevant task for any person in general, and particularly for professionals dealing with intensive knowledge tasks. Querying text-based documents' data, however, is not supported by common technology. For that, such documents' content has to be explicitly and formally captured as facts into a knowledge base. Making use of automatic NLP processes for capturing such facts is a common approach, but their relatively low precision and recall give rise to data quality problems. Furthermore, facts existing in the documents are often insufficient to answer complex queries, thus the need to enrich the captured facts with facts from third-party repositories (e.g. public LOD). This paper describes the adopted process to clean, populate and enrich a knowledge base repository that is further exploited to answer complex queries. This process is triggered by a previous NLP parsing process and conducted by the (rich) ontology describing such repository.

Nuno Silva | Paulo Maio | Alda Canito

[1] Nuno Silva,et al. I3OM - An Iterative, Incremental and Interactive Approach for Ontology Navigation based on Ontology Modularization , 2012, KEOD.

[2] Nuno Silva,et al. Enhancing LOD Complex Query Building with Context , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[3] Boris Motik,et al. User-Driven Ontology Evolution Management , 2002, EKAW.

[4] Atanas Kiryakov,et al. Semantic Annotation, Indexing, and Retrieval , 2003, SEMWEB.

[5] Thiago Alexandre Salgueiro Pardo,et al. Computational Processing of the Portuguese Language - 11th International Conference, PROPOR 2014, São Carlos/SP, Brazil, October 6-8, 2014. Proceedings , 2014, Lecture Notes in Computer Science.

[6] Michael J. Cafarella,et al. Ontology-driven, unsupervised instance population , 2008, J. Web Semant..

[7] Gregory Zacharewicz,et al. An ontology-driven framework towards building enterprise semantic information layer , 2013, Adv. Eng. Informatics.

[8] Jorge Baptista,et al. Computational Processing of the Portuguese Language , 2012, Lecture Notes in Computer Science.

[9] Boris Motik,et al. OWL 2 Web Ontology Language: structural specification and functional-style syntax , 2008 .

[10] Yarden Katz,et al. Pellet: A practical OWL-DL reasoner , 2007, J. Web Semant..