Neural Networks for Bacterial Named Entity Recognition

Bacterial named entity recognition is a challenging task in biomedical field. The task is typically modeled as a sequence labeling problem, and existing work mainly adopts discrete models such as CRF (Conditional Random Fields), requiring a large amount of hand-designed features with domain experience. To address this issue, this paper explores a neural network model for the task. We empirically study the effect of word embeddings and character embeddings on the task by extending a CRF baseline using neural networks. Results show the proposed neural network model achieves competitive performance, outperforming the current best discrete model. Meanwhile, the performance can be further improved by integrating neural and discrete features.

[1]  Yue Zhang,et al.  Deceptive Opinion Spam Detection Using Neural Network , 2016, COLING.

[2]  Donghong Ji,et al.  Long short-term memory RNN for biomedical named entity recognition , 2017, BMC Bioinformatics.

[3]  Zhijian Wu,et al.  Twitter Sarcasm Detection Exploiting a Context-Based Model , 2015, WISE.

[4]  Eduard H. Hovy,et al.  When Are Tree Structures Necessary for Deep Learning of Representations? , 2015, EMNLP.

[5]  Han Ren,et al.  Context-augmented convolutional neural networks for twitter sarcasm detection , 2018, Neurocomputing.

[6]  Keun Ho Ryu,et al.  A Self-training with Active Example Selection Criterion for Biomedical Named Entity Recognition , 2012, ICHIT.

[7]  Wei Li,et al.  Early results for Named Entity Recognition with Conditional Random Fields, Feature Induction and Web-Enhanced Lexicons , 2003, CoNLL.

[8]  Benjamin Van Durme,et al.  Open Domain Targeted Sentiment , 2013, EMNLP.

[9]  Yue Zhang,et al.  Improving Twitter Sentiment Classification Using Topic-Enriched Multi-Prototype Word Embeddings , 2016, AAAI.

[10]  Youngjoong Ko,et al.  A Method to Generate a Machine-Labeled Data for Biomedical Named Entity Recognition with Various Sub-Domains , 2017, DDDSM@IJCNLP.

[11]  Yue Zhang,et al.  Context-Sensitive Twitter Sentiment Classification Using Neural Network , 2016, AAAI.

[12]  Yang Jin,et al.  An entity tagger for recognizing acquired genomic variations in cancer literature , 2004, Bioinform..

[13]  Christopher D. Manning,et al.  Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks , 2015, ACL.

[14]  Dong-Hong Ji,et al.  Neural networks for deceptive opinion spam detection: An empirical study , 2017, Inf. Sci..

[15]  Xiaoyan Wang,et al.  Bacterial named entity recognition based on dictionary and conditional random field , 2017, 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[16]  Quoc V. Le,et al.  Distributed Representations of Sentences and Documents , 2014, ICML.