论文信息 - Iterative Alternating Neural Attention for Machine Reading - 字舞流文

Iterative Alternating Neural Attention for Machine Reading

We propose a novel neural attention architecture to tackle machine comprehension tasks, such as answering Cloze-style queries with respect to a document. Unlike previous models, we do not collapse the query into a single vector, instead we deploy an iterative alternating attention mechanism that allows a fine-grained exploration of both the query and the document. Our model outperforms state-of-the-art baselines in standard machine comprehension benchmarks such as CNN news articles and the Children’s Book Test (CBT) dataset.

Philip Bachman | Yoshua Bengio | Alessandro Sordoni | Yoshua Bengio | Philip Bachman | Alessandro Sordoni

[1] Wilson L. Taylor,et al. “Cloze Procedure”: A New Tool for Measuring Readability , 1953 .

[2] Razvan Pascanu,et al. Theano: new features and speed improvements , 2012, ArXiv.

[3] Razvan Pascanu,et al. On the difficulty of training recurrent neural networks , 2012, ICML.

[4] Alex Graves,et al. Generating Sequences With Recurrent Neural Networks , 2013, ArXiv.

[5] Yoshua Bengio,et al. Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[6] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[7] Alex Graves,et al. Recurrent Models of Visual Attention , 2014, NIPS.

[8] Surya Ganguli,et al. Exact solutions to the nonlinear dynamics of learning in deep linear neural networks , 2013, ICLR.

[9] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[10] Yoshua Bengio,et al. Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[11] Jason Weston,et al. End-To-End Memory Networks , 2015, NIPS.

[12] Christopher D. Manning,et al. Effective Approaches to Attention-based Neural Machine Translation , 2015, EMNLP.

[13] Navdeep Jaitly,et al. Pointer Networks , 2015, NIPS.

[14] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[15] Phil Blunsom,et al. Teaching Machines to Read and Comprehend , 2015, NIPS.

[16] Yoshua Bengio,et al. Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[17] Jason Weston,et al. The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations , 2015, ICLR.

[18] Richard Socher,et al. Ask Me Anything: Dynamic Memory Networks for Natural Language Processing , 2015, ICML.

[19] Guigang Zhang,et al. Deep Learning , 2016, Int. J. Semantic Comput..

[20] Danqi Chen,et al. A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task , 2016, ACL.

[21] Rudolf Kadlec,et al. Text Understanding with the Attention Sum Reader Network , 2016, ACL.