Mining Texts, Learner Productions and Strategies with ReaderBench

The chapter introduces ReaderBench, a multi-lingual and flexible environment that integrates text mining technologies for assessing a wide range of learners' productions and for supporting teachers in several ways. ReaderBench offers three main functionalities in terms of text analysis: cohesion-based assessment, reading strategies identification and textual complexity evaluation. All of these have been subject to empirical validations. ReaderBench may be used throughout an entire educational scenario, starting from the initial complexity assessment of the reading materials, the assignment of texts to learners, the detection of reading strategies reflected in one's self-explanations, and comprehension evaluation fostering learner's self-regulation process.

[1]  Meredith Williams,et al.  Wittgenstein, Mind and Meaning: Towards a Social Conception of Mind , 1999 .

[2]  B. Lemaire Limites de la lemmatisation pour l'extraction de significations , 2008 .

[3]  Stefan Trausan-Matu,et al.  Cohesion-based Analysis of CSCL Conversations: Holistic and Individual Perspectives , 2013, CSCL.

[4]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[5]  Peter Wiemer-Hastings,et al.  Rules for Syntax, Vectors for Semantics , 2001 .

[6]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[7]  Sonia Mandin Modèles cognitifs computationnels de l'activité de résumer : expérimentation d'un EIAH auprès d'élèves de lycée , 2009 .

[8]  W. Kintsch,et al.  Are Good Texts Always Better? Interactions of Text Coherence, Background Knowledge, and Levels of Understanding in Learning From Text , 1996 .

[9]  Benoît Sagot,et al.  Building a free French wordnet from multilingual resources , 2008 .

[10]  Seymour Geisser,et al.  8. Predictive Inference: An Introduction , 1995 .

[11]  Danielle S. McNamara,et al.  Reversing the Reverse Cohesion Effect: Good Texts Can Be Better for Strategic, High-Knowledge Readers , 2007 .

[12]  Philippe Dessus,et al.  Contrôle et régulation de la compréhension : l'acquisition de stratégies de 8 à 11 ans , 2012 .

[13]  K. VanLehn,et al.  Scaffolding Deep Comprehension Strategies Through Point&Query, AutoTutor, and iSTART , 2005 .

[14]  Stefan Trausan-Matu,et al.  Towards an Integrated Approach for Evaluating Textual Complexity for Learning Purposes , 2012, ICWL.

[15]  Danielle S. McNamara Reading comprehension strategies : theories, interventions, and technologies , 2007 .

[16]  Mathieu Bastian,et al.  Gephi: An Open Source Software for Exploring and Manipulating Networks , 2009, ICWSM.

[17]  Mari Ostendorf,et al.  A machine learning approach to reading level assessment , 2009, Comput. Speech Lang..

[18]  Stefan Trausan-Matu,et al.  Utterances Assessment in Chat Conversations , 2010 .

[19]  Danielle S. McNamara,et al.  Handbook of latent semantic analysis , 2007 .

[20]  Lijun Feng,et al.  A Comparison of Features for Automatic Readability Assessment , 2010, COLING.

[21]  Christopher D. Manning,et al.  Enriching the Knowledge Sources Used in a Maximum Entropy Part-of-Speech Tagger , 2000, EMNLP.

[22]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[23]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[24]  Christopher D. Manning,et al.  Multiword Expression Identification with Tree Substitution Grammars: A Parsing tour de force with French , 2011, EMNLP.

[25]  Heeyoung Lee,et al.  Deterministic Coreference Resolution Based on Entity-Centric, Precision-Ranked Rules , 2013, CL.

[26]  Arthur C. Graesser,et al.  Coh-Metrix: Analysis of text on cohesion and language , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[27]  Walter Kintsch,et al.  Comprehension: A Paradigm for Cognition , 1998 .

[28]  John C. Nesbit,et al.  Sequential Pattern Analysis of Learning Logs: Methodology and Applications , 2010 .

[29]  Heeyoung Lee,et al.  A Multi-Pass Sieve for Coreference Resolution , 2010, EMNLP.

[30]  Stefan Gerry Johann Trausan-Matu,et al.  Supporting Polyphonic Collaborative Learning , 2007 .

[31]  W. Kintsch,et al.  Strategies of discourse comprehension , 1983 .

[32]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[33]  Mykola Pechenizkiy,et al.  Handbook of Educational Data Mining , 2010 .

[34]  Joseph E. Gonzalez,et al.  GraphLab: A New Parallel Framework for Machine Learning , 2010 .

[35]  Arthur C. Graesser,et al.  Coh-Metrix: Capturing Linguistic Features of Cohesion , 2010 .

[36]  U. Brandes A faster algorithm for betweenness centrality , 2001 .

[37]  Danielle S. McNamara,et al.  iSTART: A Web-based tutor that teaches self-explanation and metacognitive reading strategies. , 2007 .

[38]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[39]  Martha Palmer,et al.  Verb Semantics and Lexical Selection , 1994, ACL.

[40]  T. Landauer,et al.  A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[41]  Graeme Hirst,et al.  Evaluating WordNet-based Measures of Lexical Semantic Relatedness , 2006, CL.

[42]  Kathleen McKeown,et al.  Improving Word Sense Disambiguation in Lexical Chaining , 2003, IJCAI.

[43]  Joseph M. Hellerstein,et al.  Distributed GraphLab: A Framework for Machine Learning in the Cloud , 2012, Proc. VLDB Endow..

[44]  Stefan Trausan-Matu,et al.  Textual Complexity and Discourse Structure in Computer-Supported Collaborative Learning , 2012, ITS.

[45]  Dan Klein,et al.  Accurate Unlexicalized Parsing , 2003, ACL.

[46]  Ryan S. Baker,et al.  The Potentials of Educational Data Mining for Researching Metacognition, Motivation and Self-Regulated Learning , 2013, EDM 2013.

[47]  J. Oakhill,et al.  Reading comprehension development from 8 to 14 years the contribution of component skills and processes , 2009 .

[48]  Danielle S. McNamara,et al.  iStart: A Web-Based Reading Strategy Intervention That Improves Students's Science Comprehension , 2004, CELDA.

[49]  Elizabeth B. Bernhardt,et al.  Learning and comprehension of text , 1988 .

[50]  Judy Sheard Basics of Statistical Analysis of Interactions Data from Web-Based Learning Environments , 2010 .

[51]  Christopher D. Manning,et al.  Incorporating Non-local Information into Information Extraction Systems by Gibbs Sampling , 2005, ACL.

[52]  Nadine Mandran,et al.  Open platform to model and capture experimental data in Technology enhanced learning systems , 2013 .

[53]  Douglas J. Hacker,et al.  Handbook of Metacognition in Education , 2009 .

[54]  Laurie E Cutting,et al.  Reader-Text Interactions: How Differential Text and Question Types Influence Cognitive Skills Needed for Reading Comprehension. , 2012, Journal of educational psychology.

[55]  Benoît Lemaire,et al.  A semantic space for modeling children's semantic memory , 2007, ArXiv.

[56]  T. Trabasso,et al.  Constructing inferences during narrative text comprehension. , 1994 .

[57]  Thomas François,et al.  Do NLP and machine learning improve traditional readability formulas? , 2012, PITR@NAACL-HLT.

[58]  Richard K. Wagner,et al.  Beyond decoding : the behavioral and biological foundations of reading comprehension , 2009 .

[59]  Joseph M. Hellerstein,et al.  GraphLab: A New Framework For Parallel Machine Learning , 2010, UAI.

[60]  Hinrich Schütze,et al.  Introduction to information retrieval , 2008 .

[61]  Traian Rebedea,et al.  A Polyphonic Model and System for Inter-animation Analysis in Chat Conversations with Multiple Participants , 2010, CICLing.

[62]  Danielle S. McNamara,et al.  Self-Explanation and Metacognition , 2009 .

[63]  D. McNamara SERT: Self-Explanation Reading Training , 2004 .

[64]  A. Garnham,et al.  On theories of belief bias in syllogistic reasoning , 1993, Cognition.

[65]  Stefan Trausan-Matu,et al.  ReaderBench, an Environment for Analyzing Text Complexity and Reading Strategies , 2013, AIED.

[66]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[67]  Heeyoung Lee,et al.  Stanford’s Multi-Pass Sieve Coreference Resolution System at the CoNLL-2011 Shared Task , 2011, CoNLL Shared Task.

[68]  Robert L. Donaway,et al.  A Comparison of Rankings Produced by Summarization Evaluation Measures , 2000 .

[69]  A. Hayes Introduction to Mediation, Moderation, and Conditional Process Analysis: A Regression-Based Approach , 2013 .

[70]  Kenneth R. Koedinger,et al.  A Data Repository for the EDM Community: The PSLC DataShop , 2010 .