A novel deep learning method for extracting unspecific biomedical relation

Biomedical relation extraction is an important research subject in Natural language processing (NLP). Deep learning technology has shown greater value in improving accuracy of relation extraction results recently. Existing methods mostly focus on extracting (1) specific relation from short texts (eg, drug‐drug interaction and protein‐protein interaction) and (2) unspecific relation from full text corpora. However, extracting unspecific relation from short text, which is more and more important in practical use, is rarely studied. In this paper, a new model called MAT‐LSTM is proposed to extract unspecific relation from short text in biomedical literatures. Experiments on two Biocreative benchmark datasets and one BioNLP benchmark datasets were made to measure the validity of the proposed model MAT‐LSTM, and better performance is achieved. The MAT‐LSTM model is also applied practically in extracting unspecific relation contained in the PubMed literatures. The results extracted from PubMed by using the proposed model were verified by experts mostly, indicating the practical value of the MAT‐LSTM model.

[1]  Zhiyuan Liu,et al.  Neural Relation Extraction with Selective Attention over Instances , 2016, ACL.

[2]  C. Ko,et al.  Fabrication of Novel Hydrogel with Berberine-Enriched Carboxymethylcellulose and Hyaluronic Acid as an Anti-Inflammatory Barrier Membrane , 2016, BioMed research international.

[3]  Xin Rong,et al.  word2vec Parameter Learning Explained , 2014, ArXiv.

[4]  Sampo Pyysalo,et al.  Overview of the Entity Relations (REL) supporting task of BioNLP Shared Task 2011 , 2011, BioNLP@ACL.

[5]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[6]  Yaoyun Zhang,et al.  CD-REST: a system for extracting chemical-induced disease relation in literature , 2016, Database J. Biol. Databases Curation.

[7]  Xin Yao,et al.  Clickbait Convolutional Neural Network , 2018, Symmetry.

[8]  Guanglu Sun,et al.  Internet Traffic Classification Based on Incremental Support Vector Machines , 2018, Mob. Networks Appl..

[9]  Zhiyong Lu,et al.  Text mining for precision medicine: automating disease-mutation relationship extraction from biomedical literature , 2016, J. Am. Medical Informatics Assoc..

[10]  Terri K. Attwood,et al.  BioIE: extracting informative sentences from the biomedical literature , 2005, Bioinform..

[11]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[12]  Xiao Sun,et al.  Multichannel Convolutional Neural Network for Biological Relation Extraction , 2016, BioMed research international.

[13]  Hongfang Liu,et al.  Attention-based Neural Networks for Chemical Protein Relation Extraction , 2017 .

[14]  Shuai Liu,et al.  Fractal generation method based on asymptote family of generalized Mandelbrot set and its application , 2017 .

[15]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[16]  Shasha Li,et al.  Drug-Drug Interaction Extraction via Recurrent Neural Network with Multiple Attention Layers , 2017, ADMA.

[17]  Feng Xiao,et al.  Network traffic classification based on transfer learning , 2018, Comput. Electr. Eng..

[18]  Razvan C. Bunescu,et al.  Consolidating the set of known human protein-protein interactions in preparation for large-scale mapping of the human interactome , 2005, Genome Biology.

[19]  Zhiyong Lu,et al.  BioCreative V CDR task corpus: a resource for chemical disease relation extraction , 2016, Database J. Biol. Databases Curation.

[20]  Casimir A. Kulikowski,et al.  A method for exploring implicit concept relatedness in biomedical knowledge network , 2016, BMC Bioinformatics.

[21]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[22]  Shuai Liu,et al.  A Novel Distance Metric: Generalized Relative Entropy , 2017, Entropy.

[23]  Zhiyong Lu,et al.  PubTator: a web-based text mining tool for assisting biocuration , 2013, Nucleic Acids Res..

[24]  P. Warner Ordinal logistic regression , 2008, Journal of Family Planning and Reproductive Health Care.

[25]  Xiaochun Cheng,et al.  Numeric characteristics of generalized M-set with its asymptote , 2014, Appl. Math. Comput..

[26]  Jun Zhao,et al.  Distant Supervision for Relation Extraction via Piecewise Convolutional Neural Networks , 2015, EMNLP.

[27]  Patrick Schorderet,et al.  NEAT: a framework for building fully automated NGS pipelines and analyses , 2016, BMC Bioinformatics.

[28]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[29]  Peer Bork,et al.  Extraction of regulatory gene/protein networks from Medline , 2006, Bioinform..

[30]  Tian Bai,et al.  Gene-Disease Interaction Retrieval from Multiple Sources: A Network Based Method , 2016, BioMed research international.

[31]  Xiaoyan Zhu,et al.  Building Disease-Specific Drug-Protein Connectivity Maps from Molecular Interaction Networks and PubMed Abstracts , 2009, PLoS Comput. Biol..