Modeling Task Effects in Human Reading with Neural Attention

Humans read by making a sequence of fixations and saccades. They often skip words, without apparent detriment to understanding. We offer a novel explanation for skipping: readers optimize a tradeoff between performing a language-related task and fixating as few words as possible. We propose a neural architecture that combines an attention module (deciding whether to skip words) and a task module (memorizing the input). We show that our model predicts human skipping behavior, while also modeling reading times well, even though it skips 40% of the input. A key prediction of our model is that different reading tasks should result in different skipping behaviors. We confirm this prediction in an eye-tracking experiment in which participants answers questions about a text. We are able to capture these experimental results using the our model, replacing the memorization module with a task module that performs neural question answering.

[1]  Yoshinobu Kano,et al.  Predicting Word Fixations in Text with a CRF Model for Capturing General Reading Strategies among Readers , 2012 .

[2]  Danqi Chen,et al.  A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task , 2016, ACL.

[3]  J. Henderson Human gaze control during real-world scene perception , 2003, Trends in Cognitive Sciences.

[4]  Frank Keller,et al.  Data from eye-tracking corpora as evidence for theories of syntactic processing complexity , 2008, Cognition.

[5]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[6]  Slav Petrov,et al.  A Universal Part-of-Speech Tagset , 2011, LREC.

[7]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[8]  R. J. Williams,et al.  Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning , 2004, Machine Learning.

[9]  K. Rayner Eye movements in reading and information processing: 20 years of research. , 1998, Psychological bulletin.

[10]  Phil Blunsom,et al.  Teaching Machines to Read and Comprehend , 2015, NIPS.

[11]  Ralf Engbert,et al.  A dynamical model of saccade generation in reading based on spatially distributed lexical processing , 2002, Vision Research.

[12]  R. Levy Expectation-based syntactic comprehension , 2008, Cognition.

[13]  Yoshua Bengio,et al.  Understanding the difficulty of training deep feedforward neural networks , 2010, AISTATS.

[14]  Brendan J. Frey,et al.  Learning Wake-Sleep Recurrent Attention Models , 2015, NIPS.

[15]  Erik D. Reichle,et al.  The E-Z Reader model of eye-movement control in reading: Comparisons to other models , 2003, Behavioral and Brain Sciences.

[16]  Roger Levy,et al.  A Rational Model of Eye Movement Control in Reading , 2010, ACL.

[17]  John Hale,et al.  A Probabilistic Earley Parser as a Psycholinguistic Model , 2001, NAACL.

[18]  Alexander M. Rush,et al.  Character-Aware Neural Language Models , 2015, AAAI.

[19]  R. Shillcock,et al.  Eye Movements Reveal the On-Line Computation of Lexical Probabilities During Reading , 2003, Psychological science.

[20]  Erik D. Reichle,et al.  Toward a model of eye movement control in reading. , 1998, Psychological review.

[21]  Yoshua Bengio,et al.  Show, Attend and Tell: Neural Image Caption Generation with Visual Attention , 2015, ICML.

[22]  Alex Graves,et al.  Recurrent Models of Visual Attention , 2014, NIPS.

[23]  Nathaniel J. Smith,et al.  The effect of word predictability on reading time is logarithmic , 2013, Cognition.

[24]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[25]  Reinhold Kliegl,et al.  SWIFT: a dynamical model of saccade generation during reading. , 2005, Psychological review.

[26]  R. Shillcock,et al.  Low-level predictive inference in reading: the influence of transitional probabilities on eye movements , 2003, Vision Research.

[27]  Elizabeth R Schotter,et al.  Task effects reveal cognitive flexibility responding to frequency and predictability: Evidence from eye movements in reading and proofreading , 2014, Cognition.

[28]  A. Kennedy,et al.  Parafoveal-on-foveal effects in normal reading , 2005, Vision Research.

[29]  Mats Wirén,et al.  Syntactic Parsing , 2010, Handbook of Natural Language Processing.

[30]  Frank Keller,et al.  Modeling Human Reading with Neural Attention , 2016, EMNLP.

[31]  Peter Henderson,et al.  Reproducibility of Benchmarked Deep Reinforcement Learning Tasks for Continuous Control , 2017, ArXiv.

[32]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[33]  Anders Søgaard,et al.  With Blinkers on: Robust Prediction of Eye Movements across Readers , 2013, EMNLP.

[34]  Patrick Plummer,et al.  See before you jump: full recognition of parafoveal words precedes skips during reading. , 2013, Journal of experimental psychology. Learning, memory, and cognition.

[35]  S. Frank,et al.  Insensitivity of the Human Sentence-Processing System to Hierarchical Structure , 2011, Psychological science.

[36]  Marcel Adam Just,et al.  17 – What Your Eyes Do while Your Mind Is Reading1 , 1983 .

[37]  Arthur M. Jacobs TOWARD A MODEL OF EYE MOVEMENT CONTROL IN VISUAL SEARCH , 1987 .

[38]  Yoshua Bengio,et al.  Neural Machine Translation by Jointly Learning to Align and Translate , 2014, ICLR.

[39]  Maria Barrett,et al.  The Dundee Treebank , 2015 .

[40]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[41]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[42]  Erik D. Reichle,et al.  Models of the reading process. , 2010, Wiley interdisciplinary reviews. Cognitive science.

[43]  Erik D. Reichle,et al.  Using E-Z reader to model the effects of higher level language processing on eye movements during reading , 2009, Psychonomic bulletin & review.

[44]  Joakim Nivre,et al.  Towards a Data-Driven Model of Eye Movement Control in Reading , 2010, CMCL@ACL.

[45]  Joakim Nivre,et al.  Learning Where to Look: Modeling Eye Movements in Reading , 2009, CoNLL.

[46]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[47]  C. A. Weaver,et al.  Psychology of Reading , 2012 .