论文信息 - Gated Convolutional Neural Network for Sentence Matching

Gated Convolutional Neural Network for Sentence Matching

The recurrent neural networks (RNN) have shown promising results in sentence matching tasks, such as paraphrase identification (PI), natural language inference (NLI) and answer selection (AS). However, the recurrent architecture prevents parallel computation within a sequence and is highly time-consuming. To overcome this limitation, we propose a gated convolutional neural network (GCNN) for sentence matching tasks. In this model, the stacked convolutions encode hierarchical contextaware representations of a sentence, where the gating mechanism optionally controls and stores the convolutional contextual information. Furthermore, the attention mechanism is utilized to obtain interactive matching information between sentences. We evaluate our model on PI and NLI tasks, and the experiments demonstrate the advantages of the proposed approach in terms of both speed and accuracy performance.

[1] Shuohang Wang,et al. Learning Natural Language Inference with LSTM , 2015, NAACL.

[2] Wenpeng Yin,et al. Convolutional Neural Network for Paraphrase Identification , 2015, NAACL.

[3] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] George Kurian,et al. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation , 2016, ArXiv.

[5] Jason Weston,et al. A Neural Attention Model for Abstractive Sentence Summarization , 2015, EMNLP.

[6] Yann Dauphin,et al. Convolutional Sequence to Sequence Learning , 2017, ICML.

[7] Bowen Zhou,et al. Applying deep learning to answer selection: A study and an open task , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).

[8] Jian Zhang,et al. Natural Language Inference over Interaction Space , 2017, ICLR.

[9] Zhen-Hua Ling,et al. Enhanced LSTM for Natural Language Inference , 2016, ACL.

[10] Yann Dauphin,et al. Language Modeling with Gated Convolutional Networks , 2016, ICML.

[11] Samuel R. Bowman,et al. A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference , 2017, NAACL.

[12] Alexander M. Rush,et al. Character-Aware Neural Language Models , 2015, AAAI.

[13] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[14] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[15] Rui Yan,et al. Natural Language Inference by Tree-Based Convolution and Heuristic Matching , 2015, ACL.