Instance-Based Ontology Population Exploiting Named-Entity Substitution

We present an approach to ontology population based on a lexical substitution technique. It consists in estimating the plausibility of sentences where the named entity to be classified is substituted with the ones contained in the training data, in our case, a partially populated ontology. Plausibility is estimated by using Web data, while the classification algorithm is instance-based. We evaluated our method on two different ontology population tasks. Experiments show that our solution is effective, outperforming existing methods, and it can be applied to practical ontology population problems.

[1]  Zellig S. Harris,et al.  Distributional Structure , 1954 .

[2]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[3]  C. Fellbaum An Electronic Lexical Database , 1998 .

[4]  Sergey Brin,et al.  Extracting Patterns and Relations from the World Wide Web , 1998, WebDB.

[5]  Luis Gravano,et al.  Snowball: extracting relations from large plain-text collections , 2000, DL '00.

[6]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[7]  Philip Resnik,et al.  Tagger Evaluation Given Hierarchical Tag Sets , 2000, Comput. Humanit..

[8]  Eduard H. Hovy,et al.  Fine Grained Classification of Named Entities , 2002, COLING.

[9]  Diana McCarthy,et al.  Lexical Substitution as a Task for WSD Evaluation , 2002, SENSEVAL.

[10]  Suresh Manandhar,et al.  Extending a Lexical Ontology by a Combination of Distributional Semantics Signatures , 2002, EKAW.

[11]  Axiomatizing WordNet Glosses in the OntoWordNet Project , 2003 .

[12]  Johanna Völker,et al.  Towards large-scale, open-domain and ontology-based named entity classification , 2005 .

[13]  Doug Downey,et al.  Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..

[14]  Steffen Staab,et al.  Gimme' the context: context-driven automatic semantic annotation with C-PANKOW , 2005, WWW '05.

[15]  Bernardo Magnini,et al.  Weakly Supervised Approaches for Ontology Population , 2008, EACL.

[16]  Roberto Navigli,et al.  SemEval-2007 Task 10: English Lexical Substitution Task , 2007, Fourth International Workshop on Semantic Evaluations (SemEval-2007).

[17]  Claudio Giuliano,et al.  Instance Based Lexical Entailment for Ontology Population , 2007, EMNLP-CoNLL.

[18]  Carlo Strapparava,et al.  FBK-irst: Lexical Substitution Task Exploiting Domain and Syntagmatic Coherence , 2007, SemEval@ACL.

[19]  Diana McCarthy,et al.  SemEval-2007 Task 10: English Lexical Substitution Task , 2007, *SEMEVAL.