Co-training an Improved Recurrent Neural Network with Probability Statistic Models for Named Entity Recognition

Named Entity Recognition (NER) is a subtask of information extraction in Natural Language Processing (NLP) field and thus being wildly studied. Currently Recurrent Neural Network (RNN) has become a popular way to do NER task, but it needs a lot of train data. The lack of labeled train data is one of the hard problems and traditional co-training strategy is a way to alleviate it. In this paper, we consider this situation and focus on doing NER with co-training using RNN and two probability statistic models i.e. Hidden Markov Model (HMM) and Conditional Random Field (CRF). We proposed a modified RNN model by redefining its activation function. Compared to traditional sigmoid function, our new function avoids saturation to some degree and makes its output scope very close to [0, 1], thus improving recognition accuracy. Our experiments are conducted ATIS benchmark. First, supervised learning using those models are compared when using different train data size. The experimental results show that it is not necessary to use whole data, even small part of train data can also get good performance. Then, we compare the results of our modified RNN with original RNN. 0.5% improvement is obtained. Last, we compare the co-training results. HMM and CRF get higher improvement than RNN after co-training. Moreover, using our modified RNN in co-training, their performances are improved further.

[1]  Shan Sung Liew,et al.  Bounded activation functions for enhanced training stability of deep neural networks on visual pattern recognition problems , 2016, Neurocomputing.

[2]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[3]  Kenli Li,et al.  CRFs based parallel biomedical named entity recognition algorithm employing MapReduce framework , 2015, Cluster Computing.

[4]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[5]  Yoshua Bengio,et al.  Investigation of recurrent-neural-network architectures and learning methods for spoken language understanding , 2013, INTERSPEECH.

[6]  Jason Weston,et al.  Natural Language Processing (Almost) from Scratch , 2011, J. Mach. Learn. Res..

[7]  Lishuang Li,et al.  Biomedical named entity recognition based on recurrent neural networks with different extended methods , 2016, Int. J. Data Min. Bioinform..

[8]  Yonghong Yan,et al.  Speeding up Deep Neural Networks in Speech Recognition with Piecewise Quantized Sigmoidal Activation Function , 2016, IEICE Trans. Inf. Syst..

[9]  Liborio Cavaleri,et al.  Prediction of the Fundamental Period of Infilled RC Frame Structures Using Artificial Neural Networks , 2015, Comput. Intell. Neurosci..

[10]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[11]  Keun Ho Ryu,et al.  An Active Co-Training Algorithm for Biomedical Named-Entity Recognition , 2012, J. Inf. Process. Syst..

[12]  Jugal K. Kalita,et al.  Named entity recognition in Assamese using CRFS and rules , 2014, 2014 International Conference on Asian Language Processing (IALP).

[13]  Heyan Huang,et al.  Named Entity Recognition Based on Bilingual Co-training , 2013, CLSW.

[14]  Darko Stefanovic,et al.  Deep Neural Networks Based Recognition of Plant Diseases by Leaf Image Classification , 2016, Comput. Intell. Neurosci..

[15]  Jing Sun,et al.  Boosting performance of gene mention tagging system by hybrid methods , 2012, J. Biomed. Informatics.

[16]  Kamal Sarkar,et al.  An HMM Based Named Entity Recognition System for Indian Languages: The JU System at ICON 2013 , 2014, ArXiv.

[17]  Zhenchao Jiang,et al.  Biomedical named entity recognition based on extended Recurrent Neural Networks , 2015, 2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[18]  Jeon Gue Park,et al.  Deep neural network using trainable activation functions , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[19]  Hae-Chang Rim,et al.  Biomedical named entity recognition using two-phase model based on SVMs , 2004, J. Biomed. Informatics.

[20]  Tao Chen,et al.  Disease named entity recognition by combining conditional random fields and bidirectional recurrent neural networks , 2016, Database J. Biol. Databases Curation.

[21]  Shamil Chollampatt,et al.  Neural Network Translation Models for Grammatical Error Correction , 2016, IJCAI.

[22]  Lishuang Li,et al.  Biomedical named entity recognition based on recurrent neural networks with different extended methods , 2016, Int. J. Data Min. Bioinform..

[23]  Eva Volná,et al.  Knowledge discovery in dynamic data using neural networks , 2015, Cluster Computing.

[24]  Wahiba Ben Abdessalem Karaa Named Entity Recognition Using Web Document Corpus , 2011, ArXiv.

[25]  Eric Nichols,et al.  Named Entity Recognition with Bidirectional LSTM-CNNs , 2015, TACL.