Extraction of Hypernymy Information from Text∗

This chapter presents the results of three studies in extracting hypernymy information from a text. In the first, a method based on a single extraction pattern applied to the web is compared with a set of patterns applied to a big corpus. In the second study, it is examined how relation extraction can be performed reliably from a text without having access to a word sense tagger. And in a third experiment, it is checked what the effect of elaborate syntactic information has on the extraction process. Both using more data and the removal of ambiguities from the training data is found to be beneficial for the extraction process. But it is surprising to find a positive effect of additional syntactic information.

[1]  Gertjan van Noord,et al.  The Alpino Dependency Treebank , 2001, CLIN.

[2]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[3]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[4]  David Madigan,et al.  Large-Scale Bayesian Logistic Regression for Text Categorization , 2007, Technometrics.

[5]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[6]  Isa Maks,et al.  Integrating Lexical Units, Synsets and Ontology in the Cornetto Database , 2008, LREC.

[7]  Piek Vossen,et al.  EuroWordNet: A multilingual database with lexical semantic networks , 1998, Springer Netherlands.

[8]  E. F. Tjong Kim Sang To Use a Treebank or Not - Which Is Better for Hypernym Extraction? , 2009 .

[9]  Katja Hofmann,et al.  Automatic Extraction of Dutch Hypernym-Hyponym Pairs , 2007, CLIN 2007.

[10]  Piet Mertens,et al.  Verbum ex machina. Actes de la 13e conférence sur le traitement automatique des langues naturelles. , 2006 .

[11]  Sharon A. Caraballo Automatic construction of a hypernym-labeled noun hierarchy from text , 1999, ACL.

[12]  Gertjan van Noord,et al.  At Last Parsing Is Now Operational , 2006, JEPTALNRECITAL.

[13]  Frank Van Eynde Part of Speech Tagging en Lemmatisering , 2003 .

[14]  Gertjan van Noord Huge Parsed Corpora in LASSY , 2008 .

[15]  Valentin Jijkoun,et al.  Information Extraction for Question Answering: Improving Recall Through Syntactic Patterns , 2004, COLING.

[16]  Lonneke van der Plas,et al.  Automatic Acquisition of Lexico-semantic Knowledge for QA , 2005, IJCNLP.

[17]  Katja Hofmann,et al.  Automatic extension of non-english wordnets , 2007, SIGIR.

[18]  Dan Roth,et al.  Exploring evidence for shallow parsing , 2001, CoNLL.

[19]  Gilad Mishne,et al.  Learning domain ontologies for Web service descriptions: an experiment in bioinformatics , 2005, WWW '05.

[20]  Julie Weeds,et al.  Unsupervised Acquisition of Predominant Word Senses , 2007, CL.