Highlights as an Early Predictor of Student Comprehension and Interests

When engaging with a textbook, students are inclined to highlight key content. Although students believe that highlighting and subsequent review of the highlights will further their educational goals, the psychological literature provides little evidence of benefits. Nonetheless, a student's choice of text for highlighting may serve as a window into her mental state-her level of comprehension, grasp of the key ideas, reading goals, and so on. We explore this hypothesis via an experiment in which 400 participants read three sections from a college-level biology text, briefly reviewed the text, and then took a quiz on the material. During initial reading, participants were able to highlight words, phrases, and sentences, and these highlights were displayed along with the complete text during the subsequent review. Consistent with past research, the amount of highlighted material is unrelated to quiz performance. Nonetheless, highlighting patterns may allow us to infer reader comprehension and interests. Using multiple representations of the highlighting patterns, we built probabilistic models to predict quiz performance and matrix factorization models to predict what content would be highlighted in one passage from highlights in other passages. We find that quiz score prediction accuracy reliably improves with the inclusion of highlighting data (by about 1%-2%), both for held-out students and for held-out student questions (i.e., questions selected randomly for each student), but not for held-out questions. Furthermore, an individual's highlighting pattern is informative of what she highlights elsewhere. Our long-term goal is to design digital textbooks that serve not only as conduits of information into the reader's mind but also allow us to draw inferences about the reader at a point where interventions may increase the effectiveness of the material.

[1]  Mark A McDaniel,et al.  Five Popular Study Strategies: Their Pitfalls and Optimal Implementations , 2018, Perspectives on psychological science : a journal of the Association for Psychological Science.

[2]  Sarah E. Peterson,et al.  The cognitive functions of underlining as a study technique , 1991 .

[3]  R. F. Lorch,et al.  Effects of Typographical Cues on Reading and Recall of Text , 1995 .

[4]  T. Gary Waller,et al.  Mathemagenic Behaviours and Efficiency in Learning from Prose Materials: Review, Critique and Recommendations , 1976 .

[5]  Radek Pelánek,et al.  Metrics for Evaluation of Student Models , 2015, EDM.

[6]  Martha Larson,et al.  Collaborative Filtering beyond the User-Item Matrix , 2014, ACM Comput. Surv..

[7]  M. Masson Using confidence intervals for graphically based data interpretation. , 2003, Canadian journal of experimental psychology = Revue canadienne de psychologie experimentale.

[8]  W. Kintsch Learning from text, levels of comprehension, or: Why anyone would read a story anyway , 1980 .

[9]  Mark D. Reckase,et al.  Item Response Theory: Parameter Estimation Techniques , 1998 .

[10]  Sanjoy Dasgupta,et al.  A Generalization of Principal Components Analysis to the Exponential Family , 2001, NIPS.

[11]  Mitchell J. Nathan,et al.  Improving Students’ Learning With Effective Learning Techniques: Promising Directions From Cognitive and Educational Psychology , 2012 .

[12]  Linda F. Annis,et al.  Study Techniques and Cognitive Style: Their Effect on Recall and Recognition. , 1978 .

[13]  Peter W. Hoon Efficacy of Three Common Study Methods , 1974 .

[14]  Evan F. Risko,et al.  Cognitive Coupling During Reading , 2017, Journal of experimental psychology. General.

[15]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[16]  Valjean M. Cashen,et al.  Role of the Isolation Effect in a Formal Educational Setting. , 1970 .

[17]  Kenneth R. Koedinger,et al.  Performance Factors Analysis - A New Alternative to Knowledge Tracing , 2009, AIED.

[18]  Charles Elkan,et al.  Link Prediction via Matrix Factorization , 2011, ECML/PKDD.

[19]  W. Wallace REVIEW OF THE HISTORICAL, EMPIRICAL, AND THEORETICAL STATUS OF THE VON RESTORFF PHENOMENON. , 1965, Psychological bulletin.

[20]  N. J. Slamecka,et al.  The Generation Effect: Delineation of a Phenomenon , 1978 .

[21]  Sherrie L. Nist,et al.  THE TEXT MARKING PATTERNS OF COLLEGE STUDENTS , 1989 .

[22]  Joseph R. Jenkins,et al.  Underlining versus repetitive reading. , 1972 .

[23]  Richard G. Baraniuk,et al.  Tag-Aware Ordinal Sparse Factor Analysis for Learning and Content Analytics , 2014, EDM.

[24]  Danielle S. McNamara Effects of Prior Knowledge on the Generation Advantage: Calculators versus Calculation to Learn Simple Multiplication. , 1995 .

[25]  Yehuda Koren,et al.  Matrix Factorization Techniques for Recommender Systems , 2009, Computer.

[26]  F. Craik,et al.  Levels of Pro-cessing: A Framework for Memory Research , 1975 .

[27]  Nate Kornell,et al.  Highlighting and Its Relation to Distributed Study and Students’ Metacognitive Beliefs , 2015 .

[28]  Jiqiang Guo,et al.  Stan: A Probabilistic Programming Language. , 2017, Journal of statistical software.

[29]  Gigin Sappena Ginting IMPROVING THE STUDENTS , 2013 .

[30]  Yoram Eshet-Alkalai,et al.  The contribution of text-highlighting to comprehension:A comparison of print and digital reading , 2018 .

[31]  Michael Betancourt,et al.  A Conceptual Introduction to Hamiltonian Monte Carlo , 2017, 1701.02434.

[32]  Neil T. Heffernan,et al.  Incorporating Rich Features into Deep Knowledge Tracing , 2017, L@S.

[33]  Yoonkyung Lee,et al.  Dimensionality reduction for binary data through the projection of natural parameters , 2015, J. Multivar. Anal..

[34]  Frank B. Baker,et al.  Item Response Theory : Parameter Estimation Techniques, Second Edition , 2004 .

[35]  John Lutz,et al.  An Examination of the Value of the Generation Effect for Learning New Material , 2003, The Journal of general psychology.

[36]  Patrick y Class size, reading instruction, and commercial materials , 1989 .

[37]  George R. Klare,et al.  The relationship of style difficulty to immediate retention and to acceptability of technical material. , 1955 .

[38]  Lars Schmidt-Thieme,et al.  Learning Attribute-to-Feature Mappings for Cold-Start Recommendations , 2010, 2010 IEEE International Conference on Data Mining.

[39]  Noah Kaplan,et al.  Practical Issues in Implementing and Understanding Bayesian Ideal Point Estimation , 2005, Political Analysis.

[40]  Jay Blanchard,et al.  Underlining Performance Outcomes in Expository Text. , 1987 .

[41]  Steven Abney,et al.  Parsing By Chunks , 1991 .

[42]  Georg Rasch,et al.  Probabilistic Models for Some Intelligence and Attainment Tests , 1981, The SAGE Encyclopedia of Research Design.

[43]  K. Lonka,et al.  The effect of study strategies on learning from text , 1994 .

[44]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[45]  Gerald J. August,et al.  Generative underlining strategies in prose recall. , 1975 .

[46]  Danielle S. McNamara,et al.  Effects of prior knowledge on the generation advantage: Calculators versus calculation to learn simple multiplication. , 1995 .

[47]  R. Mayer,et al.  The Role of Interest in Learning From Scientific Text and Illustrations: On the Distinction Between Emotional Interest and Cognitive Interest , 1997 .

[48]  Robert A. Bjork,et al.  The promise and perils of self-regulated study , 2007, Psychonomic bulletin & review.

[49]  Sherrie L. Nist,et al.  The Role of Underlining and Annotating in Remembering Textual Information. , 1987 .

[50]  John Limber,et al.  Reading Skill, Textbook Marking, and Course Performance , 2009 .

[51]  George R. Klare,et al.  The relationship of patterning (underlining) to immediate retention and to acceptability of technical material. , 1955 .

[52]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[53]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[54]  James H. Crouse,et al.  Effects of Encoding Cues on Prose Learning. , 1972 .

[55]  L. L. Johnson,et al.  Effects of Underlining Textbook Sentences on Passage and Sentence Retention. , 1988 .

[56]  R. Fowler,et al.  Effectiveness of highlighting for retention of text material. , 1974 .

[57]  David M. Pennock,et al.  Categories and Subject Descriptors , 2001 .

[58]  Jianhua Z. Huang,et al.  SPARSE LOGISTIC PRINCIPAL COMPONENTS ANALYSIS FOR BINARY DATA. , 2010, The annals of applied statistics.

[59]  Vincent Aleven,et al.  More Accurate Student Modeling through Contextual Estimation of Slip and Guess Probabilities in Bayesian Knowledge Tracing , 2008, Intelligent Tutoring Systems.

[60]  James Hartley,et al.  Underlining Can Make a Difference—Sometimes , 1980 .