论文信息 - Team ReadMe at CMCL 2021 Shared Task: Predicting Human Reading Patterns by Traditional Oculomotor Control Models and Machine Learning

Team ReadMe at CMCL 2021 Shared Task: Predicting Human Reading Patterns by Traditional Oculomotor Control Models and Machine Learning

This system description paper describes our participation in CMCL 2021 shared task on predicting human reading patterns. Our focus in this study is making use of well-known,traditional oculomotor control models and machine learning systems. We present experiments with a traditional oculomotor control model (the EZ Reader) and two machine learning models (a linear regression model and a re-current network model), as well as combining the two different models. In all experiments we test effects of features well-known in the literature for predicting reading patterns, such as frequency, word length and predictability.Our experiments support the earlier findings that such features are useful when combined.Furthermore, we show that although machine learning models perform better in comparison to traditional models, combination of both gives a consistent improvement for predicting multiple eye tracking variables during reading.

[1] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[2] Erik D. Reichle,et al. Eye movements in reading and information processing: Keith Rayner’s 40 year legacy , 2016 .

[3] Ralf Engbert,et al. Tracking the mind during reading: the influence of past, present, and future words on fixation durations. , 2006, Journal of experimental psychology. General.

[4] K. Rayner. Eye movements in reading: Models and data , 2003, Behavioral and Brain Sciences.

[5] Yohei Oseki,et al. CMCL 2021 Shared Task on Eye-Tracking Prediction , 2021, CMCL.

[6] K. Rayner. Eye movements in reading and information processing: 20 years of research. , 1998, Psychological bulletin.

[7] Erik D. Reichle,et al. The E-Z Reader model of eye-movement control in reading: Comparisons to other models , 2003, Behavioral and Brain Sciences.

[8] Milan Straka,et al. Universal Dependencies 2.5 Models for UDPipe (2019-12-06) , 2019 .

[9] Steven G. Luke,et al. The Provo Corpus: A large eye-tracking corpus with predictability norms , 2018, Behavior research methods.

[10] Reinhold Kliegl,et al. Eye movements during reading proverbs and regular sentences: the incoming word predictability effect , 2014 .

[11] Keith Rayner,et al. The gaze-contingent moving window in reading: Development and review , 2014 .

[12] Ralf Engbert,et al. Evaluating a Computational Model of Eye-Movement Control in Reading , 2013 .

[13] Ralf Engbert,et al. Length, frequency, and predictability effects of words on eye movements in reading , 2004 .

[14] Milan Straka,et al. Tokenizing, POS Tagging, Lemmatizing and Parsing UD 2.0 with UDPipe , 2017, CoNLL.

[15] Nora Hollenstein,et al. ZuCo, a simultaneous EEG and eye-tracking resource for natural sentence reading , 2018, Scientific Data.

[16] Reinhold Kliegl,et al. SWIFT: a dynamical model of saccade generation during reading. , 2005, Psychological review.

[17] Nora Hollenstein,et al. ZuCo 2.0: A Dataset of Physiological Recordings During Natural Reading and Annotation , 2020, LREC.