Named Entity Recognition in Chinese Electronic Medical Record Using Attention Mechanism

In order to improve the accuracy of the NER (Named entity recognition) model on the Chinese EMR (Electronic Medical Record), we propose an effective deep learning model based on the attention mechanism and bootstrapping method to identify entity and relationship on the Chinese EMR. For the attention-based deep learning model, we first convert words into word vectors and input them into Bi-directional Long Short-Term Memory (BiLSTM). Then we adopt attention mechanism to assign different attention weights on hidden layer states at different words, thus the semantic features can focus on the words related to entity vocabulary, making up for the deficiencies of the BiLSTM model. Moreover, we employ the transformation matrix in Conditional Random Field (CRF) to solve the labeling bias problem in BiLSTM, thus we can obtain a global optimal tagging for the input sentences. Next, we use the bootstrapping methods to expand the labeled EMR corpus for overcoming the scarcity of the EMR corpus. Finally, the extensive experimental evaluation on Chinese EMR shows the superiority of the proposed approach under multiple evaluation criteria.

[1]  Eduard H. Hovy,et al.  End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[2]  Li Shaozi,et al.  Chinese organization name recognition based on co-training algorithm , 2008, 2008 3rd International Conference on Intelligent System and Knowledge Engineering.

[3]  Christopher D. Manning,et al.  Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[4]  Alex Graves,et al.  Recurrent Models of Visual Attention , 2014, NIPS.

[5]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[6]  Junling Hu,et al.  Bootstrapped Named Entity Recognition for Product Attribute Extraction , 2011, EMNLP.

[7]  Hao Wang,et al.  Sentiment Analysis of Chinese Microblog Based on Stacked Bidirectional LSTM , 2019, IEEE Access.

[8]  Nanyun Peng,et al.  Improving Named Entity Recognition for Chinese Social Media with Word Segmentation Representation Learning , 2016, ACL.

[9]  Laurence T. Yang,et al.  A Distributed HOSVD Method With Its Incremental Computation for Big Data in Cyber-Physical-Social Systems , 2018, IEEE Transactions on Computational Social Systems.

[10]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[11]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[12]  Van Nostrand,et al.  Error Bounds for Convolutional Codes and an Asymptotically Optimum Decoding Algorithm , 1967 .

[13]  Lei Huang,et al.  A multiclass classification method based on deep learning for named entity recognition in electronic medical records , 2016, 2016 New York Scientific Data Summit (NYSDS).

[14]  Shaocheng Tong,et al.  Composite Adaptive Fuzzy Output Feedback Control Design for Uncertain Nonlinear Strict-Feedback Systems With Input Saturation , 2015, IEEE Transactions on Cybernetics.

[15]  Hwee Tou Ng,et al.  An Unsupervised Neural Attention Model for Aspect Extraction , 2017, ACL.

[16]  Laurence T. Yang,et al.  A Multi-Order Distributed HOSVD with Its Incremental Computing for Big Services in Cyber-Physical-Social Systems , 2020, IEEE Transactions on Big Data.

[17]  Eugene Charniak,et al.  Effective Self-Training for Parsing , 2006, NAACL.

[18]  Bowen Zhou,et al.  A Structured Self-attentive Sentence Embedding , 2017, ICLR.

[19]  Joakim Nivre,et al.  Multilingual Named Entity Recognition using Hybrid Neural Networks , 2016 .

[20]  Laurence T. Yang,et al.  A Tensor-Based Big Service Framework for Enhanced Living Environments , 2016, IEEE Cloud Computing.

[21]  Hong Peng,et al.  Chinese Named Entity Recognition using Support Vector Machines , 2006, 2006 International Conference on Machine Learning and Cybernetics.

[22]  Sivaji Bandyopadhyay,et al.  Bengali Named Entity Recognition Using Support Vector Machine , 2008, IJCNLP.

[23]  Makoto Miwa,et al.  End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures , 2016, ACL.

[24]  Lang Jun Number Type Recognition of Chinese Personal Noun Phrase , 2008 .

[25]  Laurence T. Yang,et al.  Improved Multi-Order Distributed HOSVD with Its Incremental Computing for Smart City Services , 2018, IEEE Transactions on Sustainable Computing.

[26]  Ellen Riloff,et al.  A Bootstrapping Method for Learning Semantic Lexicons using Extraction Pattern Contexts , 2002, EMNLP.

[27]  Laurence T. Yang,et al.  A Tensor Computation and Optimization Model for Cyber-Physical-Social Big Data , 2019, IEEE Transactions on Sustainable Computing.

[28]  Doug Downey,et al.  Unsupervised named-entity extraction from the Web: An experimental study , 2005, Artif. Intell..

[29]  Hongyu Guo,et al.  The Unreasonable Effectiveness of Word Representations for Twitter Named Entity Recognition , 2015, NAACL.

[30]  Wei Xu,et al.  Bidirectional LSTM-CRF Models for Sequence Tagging , 2015, ArXiv.

[31]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.