Incremental Enrichment of Ontologies through Feature-based Pattern Variations

ABSTRACT In this paper, we propose a model to enrich an ontology by incrementally extending the relations through variations of patterns. In order to generalize initial patterns, combinations of features are considered as candidate patterns. The candidate patterns are used to extract relations from Wikipedia, which are sorted out according to reliability based on corpus frequency. Selected patterns then are used to extract relations, while extracted relations are again used to extend the patterns of the relation. Through making variations of patterns in incremental enrichment process, the range of pattern selection is broaden and refined, which can increase coverage and accuracy of relations extracted. In the experiments with single-feature based pattern models, we observe that the features of lexical, headword, and hypernym provide reliable information, while POS and syntactic features provide general information that is useful for enrichment of relations. Based on observations on the feature types that are appropriate for each syntactic unit type, we propose a pattern model based on the composition of features as our ongoing work.Key Words:ontology enrichment, relation extraction, pattern variation, ontology evolution, incremental relational learning

[1]  Sheen-Mok Lee,et al.  Pattern-based Extraction of Causal Relations in Korean , 2008, Artificial Intelligence and Pattern Recognition.

[2]  Michael Krauthammer,et al.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles , 2001, ISMB.

[3]  Gang Wang,et al.  PORE: Positive-Only Relation Extraction from Wikipedia Text , 2007, ISWC/ASWC.

[4]  Amit P. Sheth,et al.  A Framework for Schema-Driven Relationship Discovery from Unstructured Text , 2006, SEMWEB.

[5]  Aldo Gangemi,et al.  Unsupervised Learning of Semantic Relations between Concepts of a Molecular Biology Ontology , 2005, IJCAI.

[6]  Peter Willett,et al.  Protein Structures and Information Extraction from Biological Texts: The PASTA System , 2003, Bioinform..

[7]  Roxana Gîrju,et al.  Automatic Detection of Causal Relations for Question Answering , 2003, ACL 2003.

[8]  Christopher S. G. Khoo,et al.  Automatic Extraction of Cause-Effect Information from Newspaper Text Without Knowledge-based Inferencing , 1998 .

[9]  Key-Sun Choi,et al.  Enriching Core Ontology with Domain Thesaurus through Concept and Relation Classification , 2007 .

[10]  Dmitry Zelenko,et al.  Kernel Methods for Relation Extraction , 2002, J. Mach. Learn. Res..

[11]  F. Luccio,et al.  Exact Rooted Subtree Matching in Sublinear Time , 2001 .

[12]  Patrick Pantel,et al.  Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations , 2006, ACL.

[13]  Du-Seong Chang,et al.  Incremental cue phrase learning and bootstrapping method for causality extraction using cue phrase and word pair probabilities , 2006, Inf. Process. Manag..

[14]  Paul Buitelaar,et al.  RelExt: A Tool for Relation Extraction from Text in Ontology Extension , 2005, SEMWEB.

[15]  Enrico Motta,et al.  A hybrid approach for extracting semantic relations from texts , 2006, OntologyLearning@COLING/ACL.

[16]  あかね 藥師寺,et al.  Relation information extraction using deep syntactic analysis , 2006 .

[17]  Dan I. Moldovan,et al.  Mining Answers for Causation Questions , 2002 .

[18]  Key-Sun Choi,et al.  Automatic Acquisition of Ranked IS-A Relation from Unstructured Text , 2007 .

[19]  Junichi Tsujii,et al.  RELATION INFORMATION EXTRACTION USING DEEP SYNTACTIC ANALYSIS , 2006 .