论文信息 - A Deep Attention Network for Chinese Word Segment

A Deep Attention Network for Chinese Word Segment

Character-level sequence label tagging is the most efficient way to solve unknown words problem for Chinese word segment. But the most widely used model, Conditional Random Fields (CRF), needs a large amount of manual design features. So it is appropriate to combine CRF and neural networks such as recurrent neural network (RNN), which is adopted in many natural language processing (NLP) tasks. However, RNN is rather slow because of the timing dependence between computations and not good at capturing local information of the sentence. In order to solve this problem, we introduce a self-attention mechanism, which completes the calculation between the different positions of the sentence with the same distance, into CWS. And we propose a deep neural network, which combines convolution neural networks and self-attention mechanism. Then, we evaluate the model on the PKU dataset and the MSR dataset. The results show that our model perform much better.

Ping Gong | Lanxin Li | Likun Ji

[1] Tao Shen,et al. DiSAN: Directional Self-Attention Network for RNN/CNN-free Language Understanding , 2017, AAAI.

[2] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3] Xu Sun,et al. A Discriminative Latent Variable Chinese Segmenter with Hybrid Word/Character Information , 2009, HLT-NAACL.

[4] Hai Zhao,et al. Fast and Accurate Neural Word Segmentation for Chinese , 2017, ACL.

[5] Hai Zhao,et al. Unsupervised Segmentation Helps Supervised Learning of Character Tagging for Word Segmentation and Named Entity Recognition , 2008, IJCNLP.

[6] Xuanjing Huang,et al. Long Short-Term Memory Neural Networks for Chinese Word Segmentation , 2015, EMNLP.

[7] Weiwei Sun,et al. Reducing Approximation and Estimation Errors for Chinese Lexical Processing with Heterogeneous Annotations , 2012, ACL.

[8] Stephen Clark,et al. Chinese Segmentation with a Word-Based Perceptron Algorithm , 2007, ACL.

[9] Jakob Uszkoreit,et al. A Decomposable Attention Model for Natural Language Inference , 2016, EMNLP.

[10] Andrew McCallum,et al. Chinese Segmentation and New Word Detection using Conditional Random Fields , 2004, COLING.

[11] Hai Zhao,et al. Neural Word Segmentation Learning for Chinese , 2016, ACL.