Recognizing Nested Named Entity in Biomedical Texts: A Neural Network Model with Multi-Task Learning

Many named entities usually contain nested entities in biomedical texts. Nested entities pose challenge to the task of named entity recognition. Traditional methods try to solve the problem as a graph-structure prediction problem. However, these methods fail to sufficiently capture the boundaries information between nested entities, which limits the performance of the task. In this paper, we take a different view by solving each unique entity type as a separate task, using multi-task learning with dispatched attention to facilitate information exchange between tasks. Results on GENIA corpus show that the proposed method is highly effective, obtaining the best results in the literature.

[1]  Evgueni A. Haroutunian,et al.  Information Theory and Statistics , 2011, International Encyclopedia of Statistical Science.

[2]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[3]  Xiaohui Liang,et al.  A hybrid neural network model for predicting kidney disease in hypertension patients based on electronic health records , 2019, BMC Medical Informatics and Decision Making.

[4]  Benjamin Roth,et al.  Position-aware Self-attention with Relative Positional Encodings for Slot Filling , 2018, ArXiv.

[5]  Sophia Ananiadou,et al.  A Neural Layered Model for Nested Named Entity Recognition , 2018, NAACL.

[6]  Luke S. Zettlemoyer,et al.  Deep Semantic Role Labeling: What Works and What’s Next , 2017, ACL.

[7]  Yue Zhang,et al.  Context-Sensitive Twitter Sentiment Classification Using Neural Network , 2016, AAAI.

[8]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[9]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[10]  Christopher D. Manning,et al.  Nested Named Entity Recognition , 2009, EMNLP.

[11]  Dan Roth,et al.  Joint Mention Extraction and Classification with Mention Hypergraphs , 2015, EMNLP.

[12]  Wei Lu,et al.  Neural Segmental Hypergraphs for Overlapping Mention Recognition , 2018, EMNLP.

[13]  Han Ren,et al.  Context-augmented convolutional neural networks for twitter sarcasm detection , 2018, Neurocomputing.

[14]  Beatrice Alex,et al.  Recognising Nested Named Entities in Biomedical Text , 2007, BioNLP@ACL.

[15]  Dan Roth,et al.  The Importance of Syntactic Parsing and Inference in Semantic Role Labeling , 2008, CL.

[16]  Jun'ichi Tsujii,et al.  GENIA corpus - a semantically annotated corpus for bio-textmining , 2003, ISMB.

[17]  Deyu Zhou,et al.  Position-aware deep multi-task learning for drug-drug interaction extraction , 2018, Artif. Intell. Medicine.

[18]  Chien-Chang Chen,et al.  Detecting spamming reviews using long short-term memory recurrent neural network framework , 2018, ICEEG '18.

[19]  Beth Levin,et al.  English Verb Classes and Alternations: A Preliminary Investigation , 1993 .

[20]  Hongxia Jin,et al.  A Neural Transition-based Model for Nested Mention Recognition , 2018, EMNLP.

[21]  Mirella Lapata,et al.  Neural Semantic Role Labeling with Dependency Path Embeddings , 2016, ACL.

[22]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[23]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[24]  Diego Marcheggiani,et al.  Encoding Sentences with Graph Convolutional Networks for Semantic Role Labeling , 2017, EMNLP.

[25]  Wei Lu,et al.  Labeling Gaps Between Words: Recognizing Overlapping Mentions with Mention Separators , 2017, EMNLP.

[26]  Danqi Chen,et al.  Position-aware Attention and Supervised Data Improve Slot Filling , 2017, EMNLP.

[27]  Claire Cardie,et al.  Nested Named Entity Recognition Revisited , 2018, NAACL.

[28]  Hui Jiang,et al.  A FOFE-based Local Detection Approach for Named Entity Recognition and Mention Detection , 2016, ArXiv.

[29]  Laizhong Cui,et al.  Combine HowNet lexicon to train phrase recursive autoencoder for sentence-level sentiment analysis , 2017, Neurocomputing.