MoGCN: Mixture of Gated Convolutional Neural Network for Named Entity Recognition of Chinese Historical Texts

Named Entity Recognition (NER) systems have been largely advanced by deep neural networks in the recent decade. However, the state-of-the-arts on NER have been less applied to Chinese historical texts due to the lack of standard corpora in Chinese historical domains and the difficulty of accessing a quality ancient corpus. This paper addresses the respective issues and proposes an efficient automatic processing solution for tackling NER of ancient Chinese data, including the implementation of data-driven tagging and an innovative end-to-end network namely “MoGCN” (Mixture of Gated Convolutional Neural Network). A corpus consisting of three genres of Chinese historical classics is generated by our tagging approach, which is experimented for uncovering the generalization ability of proposed model. The empirical analysis demonstrates that our proposed model achieves the best results with above 1.5% F1-score improvement over other sophisticated models in this dataset, where the experimental performance shows positive dependence on the quality of corpus. Furthermore, our model can perform much better on shorter entities especially for 2-charater ones, while many long-range entities can be only identified by our model based on our auxiliary attribute analysis. This work serves as a preliminary exploitation of NER for historical data, providing unique insights and reference values for similar tasks. Future work should be focused on more exploration about NER optimization on massive Chinese traditional texts with linguistic features and learning strategies.

[1]  Eric Nichols,et al.  Named Entity Recognition with Bidirectional LSTM-CNNs , 2015, TACL.

[2]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[3]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[4]  Andrew McCallum,et al.  Fast and Accurate Entity Recognition with Iterated Dilated Convolutions , 2017, EMNLP.

[5]  Chao-Lin Liu,et al.  Toward Algorithmic Discovery of Biographical Information in Local Gazetteers of Ancient China , 2015, PACLIC.

[6]  Guoxin Wang,et al.  CAN-NER: Convolutional Attention Network for Chinese Named Entity Recognition , 2019, NAACL.

[7]  Jian Xu,et al.  Recognition and Extraction of Honorifics in Chinese Diachronic Corpora , 2014, CLSW.

[8]  Satoshi Sekine,et al.  A survey of named entity recognition and classification , 2007 .

[9]  Chenliang Li,et al.  A Survey on Deep Learning for Named Entity Recognition , 2018, IEEE Transactions on Knowledge and Data Engineering.

[10]  J. Altham Naming and necessity. , 1981 .

[11]  Chao-Lin Liu,et al.  Mining local gazetteers of literary Chinese with CRF and pattern based methods for biographical information in Chinese history , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[12]  Masanori Hattori,et al.  Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition , 2016, NLPCC/ICCPOL.

[13]  Wen-Hsiang Lu,et al.  A Web-based unsupervised algorithm for learning transliteration model to improve translation of low-frequency proper names , 2005, 2005 International Conference on Natural Language Processing and Knowledge Engineering.

[14]  Chu-Ren Huang,et al.  Named Entity Recognition for Chinese Novels in the Ming-Qing Dynasties , 2016, CLSW.

[15]  Hui Chen,et al.  GRN: Gated Relation Network to Enhance Convolutional Neural Network for Named Entity Recognition , 2019, AAAI.

[16]  Hung-Yu Kao,et al.  An enhanced CRF-based system for disease name entity recognition and normalization on BioCreative V DNER Task , 2015 .

[17]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[18]  Tat-Seng Chua,et al.  Learning pattern rules for Chinese named entity extraction , 2002, AAAI/IAAI.

[19]  Guillaume Lample,et al.  Neural Architectures for Named Entity Recognition , 2016, NAACL.

[20]  Yuan Tian,et al.  Chinese Named Entity Recognition Based on B-LSTM Neural Network with Additional Features , 2017, SpaCCS.

[21]  Runnan Li,et al.  Dilated Residual Network with Multi-head Self-attention for Speech Emotion Recognition , 2019, ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[22]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[23]  Yann Dauphin,et al.  Language Modeling with Gated Convolutional Networks , 2016, ICML.

[24]  Yan Jia,et al.  Chinese Name Entity Recognition Using Highway-LSTM-CRF , 2018, ACAI.

[25]  William W. Cohen,et al.  Exploiting dictionaries in named entity extraction: combining semi-Markov extraction processes and data integration methods , 2004, KDD.

[26]  Yuejie Zhang,et al.  Fusion of Multiple Features for Chinese Named Entity Recognition Based on CRF Model , 2008, AIRS.

[27]  Frederick Reiss,et al.  Domain Adaptation of Rule-Based Annotators for Named-Entity Recognition Tasks , 2010, EMNLP.

[28]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[29]  Rui Liu,et al.  A hybrid approach for named entity recognition in Chinese electronic medical record , 2019, BMC Medical Informatics and Decision Making.

[30]  Yang Xiang,et al.  Chinese Named Entity Recognition with Character-Word Mixed Embedding , 2017, CIKM.

[31]  Fan Yang,et al.  Five-Stroke Based CNN-BiRNN-CRF Network for Chinese Named Entity Recognition , 2018, NLPCC.

[32]  Elin Xu Historical development of the pre-dynastic Khitan , 2005 .

[33]  Jr. G. Forney,et al.  Viterbi Algorithm , 1973, Encyclopedia of Machine Learning.

[34]  Lei Jiang,et al.  An Experimental Study of Hybrid Machine Learning Models for Extracting Named Entities , 2019 .

[35]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[36]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[37]  B. Everitt,et al.  Large sample standard errors of kappa and weighted kappa. , 1969 .

[38]  Shih-Pei Chen,et al.  From text to data : extracting posting data from Chinese local gazetteers , 2016 .

[39]  Douglas E. Appelt,et al.  FASTUS: A Finite-state Processor for Information Extraction from Real-world Text , 1993, IJCAI.

[40]  Guohong Fu,et al.  Chinese named entity recognition using lexicalized HMMs , 2005, SKDD.

[41]  Yuanbo Guo,et al.  A Self-Attention-Based Approach for Named Entity Recognition in Cybersecurity , 2019, 2019 15th International Conference on Computational Intelligence and Security (CIS).

[42]  Xiangdong Huang,et al.  A Dilated CNN Model for Image Classification , 2019, IEEE Access.

[43]  Aitao Chen,et al.  Chinese Named Entity Recognition with Conditional Probabilistic Models , 2006, SIGHAN@COLING/ACL.

[44]  Thomas A. Funkhouser,et al.  Dilated Residual Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[45]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[46]  Donald E. Knuth,et al.  Fast Pattern Matching in Strings , 1977, SIAM J. Comput..

[47]  Xing Xie,et al.  Neural Chinese Named Entity Recognition via CNN-LSTM-CRF and Joint Training with Word Segmentation , 2019, WWW.

[48]  Lei Li,et al.  Personal Attributes Extraction Based on the Combination of Trigger Words, Dictionary and Rules , 2014, CIPS-SIGHAN.

[49]  Alexander M. Rush,et al.  Character-Aware Neural Language Models , 2015, AAAI.

[50]  Yu-Chun Wang,et al.  Transliteration Extraction from Classical Chinese Buddhist Literature Using Conditional Random Fields , 2013, PACLIC.

[51]  Le Sun,et al.  Early results for Chinese named entity recognition using conditional random fields model, HMM and maximum entropy , 2005, 2005 International Conference on Natural Language Processing and Knowledge Engineering.

[52]  Eduard H. Hovy,et al.  End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF , 2016, ACL.

[53]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[54]  Yongbin Qin,et al.  A Radical-Based Method for Chinese Named Entity Recognition , 2019, ICBDT2019.

[55]  Weisi Guo,et al.  LSTM-CRF Neural Network With Gated Self Attention for Chinese NER , 2019, IEEE Access.

[56]  Wei Zhang,et al.  BiLSTM-CRF Chinese Named Entity Recognition Model with Attention Mechanism , 2019 .