Adversarial training based lattice LSTM for Chinese clinical named entity recognition

Clinical named entity recognition (CNER), which intends to automatically detect clinical entities in electronic health record (EHR), is a committed step for further clinical text mining. Recently, more and more deep learning models are used to Chinese CNER. However, these models do not make full use of the information in EHR, for these models are either word-based or character-based. In addition, neural models tend to be locally unstable and even tiny perturbation may mislead them. In this paper, we firstly propose a novel adversarial training based lattice LSTM with a conditional random field layer (AT-lattice LSTM-CRF) for Chinese CNER. Lattice LSTM is used to capture richer information in EHR. As a powerful regularization method , AT can be used to improve the robustness of neural models by adding perturbations to the training data. Then, we conduct experiments on the proposed neural model with dataset of CCKS-2017 Task 2. The results show that the proposed model achieves a highly competitive performance (with an F1 score of 89.64%) compared to other prevalent neural models, which can be a reinforced baseline for further research in this field.

[1]  Tomas Pfister,et al.  Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Mourad Gridach,et al.  Character-level neural network for biomedical named entity recognition , 2017, J. Biomed. Informatics.

[3]  Hua Xu,et al.  A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries , 2011, J. Am. Medical Informatics Assoc..

[4]  Hua Xu,et al.  Research and applications: A comprehensive study of named entity recognition in Chinese clinical text , 2014, J. Am. Medical Informatics Assoc..

[5]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[6]  Ping He,et al.  Incorporating Dictionaries into Deep Neural Networks for the Chinese Clinical Named Entity Recognition , 2018, J. Biomed. Informatics.

[7]  Chengjie Sun,et al.  LSTM-CRF for Drug-Named Entity Recognition , 2017, Entropy.

[8]  Massimo Piccardi,et al.  Recurrent neural networks with specialized word embeddings for health-domain named-entity recognition , 2017, J. Biomed. Informatics.

[9]  Qi Wang,et al.  Fast and Accurate Recognition of Chinese Clinical Named Entities with Residual Dilated Convolutions , 2018, 2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[10]  Scott T. Weiss,et al.  Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system , 2006, BMC Medical Informatics Decis. Mak..

[11]  David Bamman,et al.  Adversarial Training for Relation Extraction , 2017, EMNLP.

[12]  Xiaolin Li,et al.  GRAM-CNN: a deep learning approach with local context for named entity recognition in biomedical text , 2017, Bioinform..

[13]  Maryam Habibi,et al.  Deep learning with word embeddings improves biomedical named entity recognition , 2017, Bioinform..

[14]  Uri Shaham,et al.  Understanding adversarial training: Increasing local stability of supervised models through robust optimization , 2015, Neurocomputing.

[15]  Thomas C. Rindflesch,et al.  EDGAR: extraction of drugs, genes and relations from the biomedical literature. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.

[16]  Guoming Tang,et al.  A Survey on Edge Computing Systems and Tools , 2019, Proceedings of the IEEE.