Leveraging WordNet Paths for Neural Hypernym Prediction

We formulate the problem of hypernym prediction as a sequence generation task, where the sequences are taxonomy paths in WordNet. Our experiments with encoder-decoder models show that training to generate taxonomy paths can improve the performance of direct hypernym prediction. As a simple but powerful model, the hypo2path model achieves state-of-the-art performance, outperforming the best benchmark by 4.11 points in hit-at-one (H@1).

[1]  Catherine Havasi,et al.  ConceptNet 5.5: An Open Multilingual Graph of General Knowledge , 2016, AAAI.

[2]  Nigel Collier,et al.  Generating Knowledge Graph Paths from Textual Definitions using Sequence-to-Sequence Models , 2019, NAACL-HLT.

[3]  Aurélie Herbelot,et al.  Towards Incremental Learning of Word Embeddings Using Context Informativeness , 2019, ACL.

[4]  Tom Richens Anomalies in the WordNet Verb Hierarchy , 2008, COLING.

[5]  Christian Biemann,et al.  Negative Sampling Improves Hypernymy Extraction Based on Projection Learning , 2017, EACL.

[6]  Mohammad Taher Pilehvar,et al.  SemEval-2016 Task 14: Semantic Taxonomy Enrichment , 2016, *SEMEVAL.

[7]  Pasquale Minervini,et al.  Convolutional 2D Knowledge Graph Embeddings , 2017, AAAI.

[8]  Timothy M. Hospedales,et al.  Multi-relational Poincaré Graph Embeddings , 2019, NeurIPS.

[9]  Marti A. Hearst Automatic Acquisition of Hyponyms from Large Text Corpora , 1992, COLING.

[10]  Gemma Boleda,et al.  Instances and concepts in distributional space , 2017, EACL.

[11]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[12]  Jacob Eisenstein,et al.  Predicting Semantic Relations using Global Graph Properties , 2018, EMNLP.

[13]  Tomas Mikolov,et al.  Enriching Word Vectors with Subword Information , 2016, TACL.

[14]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[15]  Xiang Li,et al.  Smoothing the Geometry of Probabilistic Box Embeddings , 2018, ICLR.

[16]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[17]  Douwe Kiela,et al.  Poincaré Embeddings for Learning Hierarchical Representations , 2017, NIPS.

[18]  Timothy M. Hospedales,et al.  On Understanding Knowledge Graph Representation , 2019, ArXiv.

[19]  Quoc V. Le,et al.  Multi-task Sequence to Sequence Learning , 2015, ICLR.

[20]  Daniel Jurafsky,et al.  Learning Syntactic Patterns for Automatic Hypernym Discovery , 2004, NIPS.

[21]  Marco Baroni,et al.  High-risk learning: acquiring new word vectors from tiny data , 2017, EMNLP.

[22]  Jason Weston,et al.  Translating Embeddings for Modeling Multi-relational Data , 2013, NIPS.

[23]  Horacio Saggion,et al.  SemEval-2018 Task 9: Hypernym Discovery , 2018, *SEMEVAL.

[24]  Gabriel Bernier-Colborne,et al.  CRIM at SemEval-2018 Task 9: A Hybrid Approach to Hypernym Discovery , 2018, *SEMEVAL.

[25]  Sanja Fidler,et al.  Order-Embeddings of Images and Language , 2015, ICLR.

[26]  Jason Weston,et al.  A semantic matching energy function for learning with multi-relational data , 2013, Machine Learning.

[27]  Douwe Kiela,et al.  Learning Continuous Hierarchies in the Lorentz Model of Hyperbolic Geometry , 2018, ICML.

[28]  Stephen Roller,et al.  Hearst Patterns Revisited: Automatic Hypernym Detection from Large Text Corpora , 2018, ACL.

[29]  Daniel Jurafsky,et al.  Semantic Taxonomy Induction from Heterogenous Evidence , 2006, ACL.

[30]  Christiane Fellbaum,et al.  English Verbs as a Semantic Net , 1990 .

[31]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[32]  Danushka Bollegala,et al.  Joint Learning of Hierarchical Word Embeddings from a Corpus and a Taxonomy , 2019, AKBC.

[33]  Dai Quoc Nguyen,et al.  A Capsule Network-based Embedding Model for Knowledge Graph Completion and Search Personalization , 2018, NAACL.

[34]  Ido Dagan,et al.  Improving Hypernymy Detection with an Integrated Path-based and Distributional Method , 2016, ACL.

[35]  Christiane Fellbaum,et al.  On the Semantics of Troponymy , 2002 .

[36]  Rajarshi Das,et al.  Chains of Reasoning over Entities, Relations, and Text using Recurrent Neural Networks , 2016, EACL.

[37]  David J. Weir,et al.  Learning to Distinguish Hypernyms and Co-Hyponyms , 2014, COLING.

[38]  Oriol Vinyals,et al.  Multilingual Language Processing From Bytes , 2015, NAACL.

[39]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.