End-to-end relation extraction based on bootstrapped multi-level distant supervision

Distant supervised relation extraction has been widely used to identify new relation facts from free text, since the existence of knowledge base helps these models to build a large dataset with few human intervention and low costs of manpower and time. However, the existing Distant Supervised models are all based on the single-node classifier so that they suffer from the serious false categorization problem especially for the existence of thousands of relations. In this paper, we novelly propose an end-to-end model for relation extraction based on distant supervision. Our model divides the original categorization task into a number of sub-tasks which focus on the construction of tree-like categorization structure in multiple levels. With the tree-like structure, an unlabelled relation instance can be categorized step by step along a path from the root node to a leaf node. An additional benefit of this structure is that it can be used to select negative samples from training data for each child node. In addition, to the best of our knowledge, no effort has been made to update the categorization model with new identified relation facts, which hinders the improvement of the extraction precision and recall. Although bootstrapping methods can contribute to improving the situation, they need additional calculation to evaluate the quality of extracted patterns or tuples when selecting new instances for next iterations. In this paper, we propose to do bootstrapped distant supervision to update the distant supervision model with new learned relation facts iteratively, and we can use scores directly gotten in the model to evaluate the quality of instances instead of additional calculation. As a result, we can further improve the extraction precision and recall. To save the time costs and manpower, we also propose an adaptive method by mapping function to choose the suitable thresholds for each iteration without manual choice rather than using the fixed thresholds. Experimental results conducted on three real datasets prove that our approach outperforms state-of-the-art approaches by reaching 12+% better extraction quality.

[1]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[2]  Luis Gravano,et al.  Snowball: extracting relations from large plain-text collections , 2000, DL '00.

[3]  Luke S. Zettlemoyer,et al.  Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations , 2011, ACL.

[4]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[5]  Jie Zhao,et al.  Business Relations in the Web: Semantics and a Case Study , 2010, J. Softw..

[6]  Bowen Zhou,et al.  Classifying Relations by Ranking with Convolutional Neural Networks , 2015, ACL.

[7]  Jun Zhao,et al.  Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks , 2015, EMNLP.

[8]  Xianpei Han,et al.  Global Distant Supervision for Relation Extraction , 2016, AAAI.

[9]  Yoshua Bengio,et al.  A Neural Probabilistic Language Model , 2003, J. Mach. Learn. Res..

[10]  Yi Yang,et al.  Overcoming Semantic Drift in Information Extraction , 2014, EDBT.

[11]  Guodong Zhou,et al.  Tree Kernel-Based Relation Extraction with Context-Sensitive Structured Parse Tree Information , 2007, EMNLP.

[12]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[13]  Lawrence Carin,et al.  Multi-Task Learning for Classification with Dirichlet Process Priors , 2007, J. Mach. Learn. Res..

[14]  Robinson Piramuthu,et al.  HD-CNN: Hierarchical Deep Convolutional Neural Networks for Large Scale Visual Recognition , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[15]  Jun Zhao,et al.  Distant Supervision for Relation Extraction with Sentence-Level Attention and Entity Descriptions , 2017, AAAI.

[16]  Nitish Srivastava,et al.  Discriminative Transfer Learning with Tree-based Priors , 2013, NIPS.

[17]  Andrew McCallum,et al.  Modeling Relations and Their Mentions without Labeled Text , 2010, ECML/PKDD.

[18]  Ronen Feldman,et al.  Using Corpus Statistics on Entities to Improve Semi-supervised Relation Extraction from the Web , 2007, ACL.

[19]  Luis Gravano,et al.  Extracting Relations from Large Plain-Text Collections , 1999 .

[20]  Yoshua Bengio,et al.  Neural Probabilistic Language Models , 2006 .

[21]  Andrew Y. Ng,et al.  Semantic Compositionality through Recursive Matrix-Vector Spaces , 2012, EMNLP.

[22]  Aron Culotta,et al.  Dependency Tree Kernels for Relation Extraction , 2004, ACL.

[23]  Zhiyuan Liu,et al.  Neural Relation Extraction with Selective Attention over Instances , 2016, ACL.

[24]  Jun Zhao,et al.  Relation Classification via Convolutional Deep Neural Network , 2014, COLING.

[25]  Luis Gravano,et al.  Extracting relations from large text collections , 2005 .

[26]  Rong Xu,et al.  Combining automatic table classification and relationship extraction in extracting anticancer drug-side effect pairs from full-text articles , 2015, J. Biomed. Informatics.

[27]  Daniel Jurafsky,et al.  Distant supervision for relation extraction without labeled data , 2009, ACL.

[28]  Joshua B. Tenenbaum,et al.  Learning to Learn with Compound HD Models , 2011, NIPS.

[29]  Jian Su,et al.  Exploring Various Knowledge in Relation Extraction , 2005, ACL.

[30]  Pietro Perona,et al.  Unsupervised learning of visual taxonomies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Haixun Wang,et al.  Probase: a probabilistic taxonomy for text understanding , 2012, SIGMOD Conference.

[32]  Guodong Zhou,et al.  Tree kernel-based semantic relation extraction with rich syntactic and semantic information , 2010, Inf. Sci..

[33]  Jie Ma,et al.  Bootstrapped Multi-level Distant Supervision for Relation Extraction , 2018, WISE.

[34]  Jun'ichi Tsujii,et al.  From Protein-Protein Interaction to Molecular Event Extraction , 2009, BioNLP@HLT-NAACL.

[35]  Ramesh Nallapati,et al.  Multi-instance Multi-label Learning for Relation Extraction , 2012, EMNLP.

[36]  Eugene Agichtein,et al.  When a Knowledge Base Is Not Enough: Question Answering over Knowledge Bases with External Text Data , 2016, SIGIR.

[37]  Razvan C. Bunescu,et al.  A Shortest Path Dependency Kernel for Relation Extraction , 2005, HLT.

[38]  Estevam R. Hruschka,et al.  Toward an Architecture for Never-Ending Language Learning , 2010, AAAI.