论文信息 - Entity Tracking Improves Cloze-style Reading Comprehension

Entity Tracking Improves Cloze-style Reading Comprehension

Recent work has improved on modeling for reading comprehension tasks with simple approaches such as the Attention Sum-Reader; however, automatic systems still significantly trail human performance. Analysis suggests that many of the remaining hard instances are related to the inability to track entity-references throughout documents. This work focuses on these hard entity tracking cases with two extensions: (1) additional entity features, and (2) training with a multi-task tracking objective. We show that these simple modifications improve performance both independently and in combination, and we outperform the previous state of the art on the LAMBADA dataset by 8 pts, particularly on difficult entity examples. We also effectively match the performance of more complicated models on the named entity portion of the CBT dataset.

Alexander M. Rush | Sam Wiseman | Luong Hoang

[1] Ruslan Salakhutdinov,et al. Gated-Attention Readers for Text Comprehension , 2016, ACL.

[2] David A. McAllester,et al. Emergent Predication Structure in Hidden State Vectors of Neural Readers , 2016, Rep4NLP@ACL.

[3] Jason Weston,et al. Learning Anaphoricity and Antecedent Ranking Features for Coreference Resolution , 2015, ACL.

[4] Jason Weston,et al. Tracking the World State with Recurrent Entity Networks , 2016, ICLR.

[5] Ali Farhadi,et al. Query-Reduction Networks for Question Answering , 2016, ICLR.

[6] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[7] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[8] Wiebke Wagner,et al. Steven Bird, Ewan Klein and Edward Loper: Natural Language Processing with Python, Analyzing Text with the Natural Language Toolkit , 2010, Lang. Resour. Evaluation.

[9] Jeffrey Pennington,et al. GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[10] Sandro Pezzelle,et al. The LAMBADA dataset: Word prediction requiring a broad discourse context , 2016, ACL.

[11] Jason Weston,et al. The Goldilocks Principle: Reading Children's Books with Explicit Memory Representations , 2015, ICLR.