The Wordometer -- Estimating the Number of Words Read Using Document Image Retrieval and Mobile Eye Tracking

We introduce the Wordometer, a novel method to estimate the number of words a user reads using a mobile eye tracker and document image retrieval. We present a reading detection algorithm which works with over 91 % accuracy over 10 test subjects using 10-fold cross validation. We implement two algorithms to estimate the read words using a line break detector. A simple version gives an average error rate of 13,5 % for 9 users over 10 documents. A more sophisticated word count algorithm based on support vector regression with an RBF kernel reaches an average error rate from only 8.2 % (6.5 % if one test subject with abnormal behavior is excluded). The achieved error rates are comparable to pedometers that count our steps in our daily life. Thus, we believe the Wordometer can be used as a step counter for the information we read to make our knowledge life healthier.

[1]  Kai Kunze,et al.  Reading Activity Recognition Using an Off-the-Shelf EEG -- Detecting Reading Activities and Distinguishing Genres of Documents , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[2]  Robert J Barry,et al.  EEG Analysis of Children with Attention-Deficit/Hyperactivity Disorder and Comorbid Reading Disabilities , 2002, Journal of learning disabilities.

[3]  Gerhard Tröster,et al.  Eye Movement Analysis for Activity Recognition Using Electrooculography , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  K. Rayner Eye movements in reading and information processing: 20 years of research. , 1998, Psychological bulletin.

[5]  Hao Jiang,et al.  User-oriented document summarization through vision-based eye-tracking , 2009, IUI.

[6]  Roger Fulton,et al.  Correction for head movements in positron emission tomography using an optical motion tracking system , 2000 .

[7]  Kai Kunze,et al.  Towards inferring language expertise using eye tracking , 2013, CHI Extended Abstracts.

[8]  Stephen J. Payne,et al.  Skim reading by satisficing: evidence from eye tracking , 2011, CHI.

[9]  Kai Kunze,et al.  I know what you are reading: recognition of document types using mobile eye tracking , 2013, ISWC '13.

[10]  Andreas Dengel,et al.  A robust realtime reading-skimming classifier , 2012, ETRA.

[11]  Gerhard Tröster,et al.  Robust Recognition of Reading Activity in Transit Using Wearable Electrooculography , 2009, Pervasive.

[12]  Georg Buscher,et al.  Gaze-Based Filtering of Relevant Document Segments , 2009 .

[13]  Evelyn C. Ferstl,et al.  The extended language network: A meta‐analysis of neuroimaging studies on text comprehension , 2008, Human brain mapping.

[14]  BullingAndreas,et al.  Eye Movement Analysis for Activity Recognition Using Electrooculography , 2011 .

[15]  Ralf Engbert,et al.  Tracking the mind during reading: the influence of past, present, and future words on fixation durations. , 2006, Journal of experimental psychology. General.

[16]  I. Olkin,et al.  Using pedometers to increase physical activity and improve health: a systematic review. , 2007, JAMA.

[17]  Ernest T. Pascarella,et al.  Influences affecting the development of students' critical thinking skills , 1995 .

[18]  K. Stanovich,et al.  What Reading Does for the Mind. , 1998 .

[19]  Masakazu Iwamura,et al.  Use of Affine Invariants in Locally Likely Arrangement Hashing for Camera-Based Document Image Retrieval , 2006, Document Analysis Systems.

[20]  A. Jacobs,et al.  Coregistration of eye movements and EEG in natural reading: analyses and review. , 2011, Journal of experimental psychology. General.