MCVAE: Margin-based Conditional Variational Autoencoder for Relation Classification and Pattern Generation

Relation classification is a basic yet important task in natural language processing. Existing relation classification approaches mainly rely on distant supervision, which assumes that a bag of sentences mentioning a pair of entities and extracted from a given corpus should express the same relation type of this entity pair. The training of these models needs a lot of high-quality bag-level data. However, in some specific domains, such as medical domain, it is difficult to obtain sufficient and high-quality sentences in a text corpus that mention two entities with a certain medical relation between them. In such a case, it is hard for existing discriminative models to capture the representative features (i.e., common patterns) from diversely expressed entity pairs with a given relation. Thus, the classification performance cannot be guaranteed when limited features are obtained from the corpus. To address this challenge, in this paper, we propose to employ a generative model, called conditional variational autoencoder (CVAE), to handle the pattern sparsity. We define that each relation has an individually learned latent distribution from all possible sentences expressing this relation. As these distributions are learned based on the purpose of input reconstruction, the model's classification ability may not be strong enough and should be improved. By distinguishing the differences among different relation distributions, a margin-based regularizer is designed, which leads to a margin-based CVAE (MCVAE) that can significantly enhance the classification ability. Besides, MCVAE can automatically generate semantically meaningful patterns that describe the given relations. Experiments on two real-world datasets validate the effectiveness of the proposed MCVAE on the tasks of relation classification and relation-specific pattern generation.

[1]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[2]  Diego Marcheggiani,et al.  Discrete-State Variational Autoencoders for Joint Discovery and Factorization of Relations , 2016, TACL.

[3]  Isabelle Tellier,et al.  Unsupervised Relation Extraction in Specialized Corpora Using Sequence Mining , 2016, IDA.

[4]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[5]  Samy Bengio,et al.  Generating Sentences from a Continuous Space , 2015, CoNLL.

[6]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[7]  William Yang Wang,et al.  Robust Distant Supervision Relation Extraction via Deep Reinforcement Learning , 2018, ACL.

[8]  Roberto Basili,et al.  Kernel-based relation extraction from investigative data , 2009, AND '09.

[9]  Christophe Gravier,et al.  Unsupervised Open Relation Extraction , 2017, ESWC.

[10]  Zhiyuan Liu,et al.  Neural Relation Extraction with Selective Attention over Instances , 2016, ACL.

[11]  Alessandro Moschitti,et al.  Convolution Kernels on Constituent, Dependency and Sequential Structures for Relation Extraction , 2009, EMNLP.

[12]  Alex Graves,et al.  DRAW: A Recurrent Neural Network For Image Generation , 2015, ICML.

[13]  Marie-Francine Moens,et al.  Structured Learning for Temporal Relation Extraction from Clinical Records , 2017, EACL.

[14]  Xiaoyan Zhu,et al.  A Unified Active Learning Framework for Biomedical Relation Extraction , 2012, Journal of Computer Science and Technology.

[15]  Alexander Löser,et al.  Interactive Relation Extraction in Main Memory Database Systems , 2016, COLING.

[16]  Zhifang Sui,et al.  A Soft-label Method for Noise-tolerant Distantly Supervised Relation Extraction , 2017, EMNLP.

[17]  Zhe Gan,et al.  Variational Autoencoder for Deep Learning of Images, Labels and Captions , 2016, NIPS.

[18]  Diederik P. Kingma,et al.  Stochastic Gradient VB and the Variational Auto-Encoder , 2013 .

[19]  Ramesh Nallapati,et al.  Multi-instance Multi-label Learning for Relation Extraction , 2012, EMNLP.

[20]  Shantanu Kumar,et al.  A Survey of Deep Learning Methods for Relation Extraction , 2017, ArXiv.

[21]  Mark Steedman,et al.  Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning , 2012 .

[22]  Philip S. Yu,et al.  On the Generative Discovery of Structured Medical Knowledge , 2018, KDD.

[23]  Andrew McCallum,et al.  Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[24]  Luke S. Zettlemoyer,et al.  Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations , 2011, ACL.

[25]  Ying Tan,et al.  Variational Autoencoder for Semi-Supervised Text Classification , 2017, AAAI.

[26]  Alexander Löser,et al.  Effective Selectional Restrictions for Unsupervised Relation Extraction , 2013, IJCNLP.

[27]  Roberto Basili,et al.  Kernel-Based Learning for Domain-Specific Relation Extraction , 2009, AI*IA.

[28]  Dmitry Zelenko,et al.  Kernel methods for relation extraction , 2003 .

[29]  Haofen Wang,et al.  Effective Chinese Relation Extraction by Sentence Rolling and Candidate Ranking , 2013, CSWS.

[30]  Ralph Grishman,et al.  Semi-supervised Relation Extraction with Large-scale Word Clustering , 2011, ACL.

[31]  Lidong Bing,et al.  Using Graphs of Classifiers to Impose Constraints on Semi-supervised Relation Extraction , 2016, AKBC@NAACL-HLT.

[32]  Max Welling,et al.  Semi-supervised Learning with Deep Generative Models , 2014, NIPS.

[33]  Alessandro Moschitti,et al.  Embedding Semantic Similarity in Tree Kernels for Domain Adaptation of Relation Extraction , 2013, ACL.

[34]  Razvan C. Bunescu,et al.  Subsequence Kernels for Relation Extraction , 2005, NIPS.

[35]  Xiaojun Chen,et al.  Supervised Neural Models Revitalize the Open Relation Extraction , 2019, ArXiv.

[36]  Jian Su,et al.  Exploring Various Knowledge in Relation Extraction , 2005, ACL.

[37]  Alessandro Moschitti,et al.  Self-Crowdsourcing Training for Relation Extraction , 2017, ACL.

[38]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[39]  Li Zhao,et al.  Reinforcement Learning for Relation Classification From Noisy Data , 2018, AAAI.

[40]  Jun Zhao,et al.  Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks , 2015, EMNLP.

[41]  Bo Zhang,et al.  Max-Margin Deep Generative Models for (Semi-)Supervised Learning , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[42]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[43]  Jun Zhao,et al.  Distant Supervision for Relation Extraction with Sentence-Level Attention and Entity Descriptions , 2017, AAAI.