论文信息 - Information Extraction from the Web: An Ontology-Based Method Using Inductive Logic Programming

Information Extraction from the Web: An Ontology-Based Method Using Inductive Logic Programming

Relevant information extraction from text and web pages in particular is an intensive and time-consuming task that needs important semantic resources. Thus, to be efficient, automatic information extraction systems have to exploit semantic resources (or ontologies) and employ machine-learning techniques to make them more adaptive. This paper presents an Ontology-based Information Extraction method using Inductive Logic Programming that allows inducing symbolic predicates expressed in Horn clausal logic that subsume information extraction rules. Such rules allow the system to extract class and relation instances from English corpora for ontology population purposes. Several experiments were conducted and preliminary experimental results are promising, showing that the proposed approach improves previous work over extracting instances of classes and relations, either separately or altogether.

Bernard Espinasse | Rinaldo Lima | Frederico Luiz Gonçalves de Freitas | Hilário Oliveira | Laura Pentagrossa

[1] D. N. Ranasinghe,et al. Inductive Logic Programming in an Agent System forOntological Relation Extraction , 2011 .

[2] Pushpak Bhattacharyya,et al. Incorporating Linguistic Expertise Using ILP for Named Entity Recognition in Data Hungry Indian Languages , 2009, ILP.

[3] Johannes Fürnkranz,et al. Foundations of Rule Learning , 2012, Cognitive Technologies.

[4] Vítor Santos Costa,et al. Inductive Logic Programming , 2013, Lecture Notes in Computer Science.

[5] Jose Santos,et al. Efficient learning and evaluation of complex concepts in inductive logic programming , 2010 .

[6] Philipp Cimiano,et al. Ontology learning and population from text - algorithms, evaluation and applications , 2006 .

[7] Saso Dzeroski,et al. Inductive Logic Programming: Techniques and Applications , 1993 .

[8] L. Getoor,et al. 1 Global Inference for Entity and Relation Identification via a Linear Programming Formulation , 2007 .

[9] Claudio Giuliano,et al. Relation extraction and the influence of automatic named-entity recognition , 2007, TSLP.

[10] Dejing Dou,et al. Ontology-based information extraction: An introduction and a survey of current approaches , 2010, J. Inf. Sci..