Predicting Student Knowledge Level from Domain-Independent Function and Content Words

We explored the possibility of predicting the quality of student answers (error-ridden, vague, partially-correct, and correct) to tutor questions by examining their linguistic patterns in 50 tutoring sessions with expert human tutors As an alternative to existing computational linguistic methods that focus on domain-dependent content words (e.g., velocity, RAM, speed) in interpreting a student's response, we focused on function words (e.g., I, you, but) and domain-independent content words (e.g., think, because, guess) Proportional incidence of these word categories in over 6,000 student responses to tutor questions was automatically computed using Linguistic Inquiry and Word Count (LIWC), a computer program for analyzing text Multiple regression analyses indicated that two parameter models consisting of pronouns (e.g., I, they, those) and discrepant terms (e.g., should, could, would) were effective in predicting the conceptual quality of student responses Furthermore, the classification accuracy of discriminant functions derived from the domain-independent LIWC features competed with conventional domain-dependent assessment methods We discuss the possibility of a composite assessment algorithm that focuses on both domain-dependent and domain-independent words for dialogue-based ITSs.

[1]  William B. Stiles,et al.  Describing talk : a taxonomy of verbal response modes , 1992 .

[2]  Johanna D. Moore,et al.  Using Natural Language Processing to Analyze Tutorial Dialogue Corpora Across Domains Modalities , 2009, AIED.

[3]  James W. Pennebaker,et al.  Linguistic Inquiry and Word Count (LIWC2007) , 2007 .

[4]  J. Pennebaker,et al.  The Secret Life of Pronouns , 2003, Psychological science.

[5]  Kurt VanLehn,et al.  Minimally Invasive Tutoring of Complex Physics Problem Solving , 2002, Intelligent Tutoring Systems.

[6]  B. L. Whorf Language, Thought, and Reality: Selected Writings of Benjamin Lee Whorf , 1956 .

[7]  Stephen E. Levinson,et al.  Cognitive state classification in a spoken tutorial dialogue system , 2006, Speech Commun..

[8]  Danielle S. McNamara,et al.  Using LSA in AutoTutor: Learning Through Mixed-Initiative Dialogue in Natural Language , 2007 .

[9]  L. Boroditsky Does Language Shape Thought?: Mandarin and English Speakers' Conceptions of Time , 2001, Cognitive Psychology.

[10]  Arthur C. Graesser,et al.  Cohesion Relationships in Tutorial Dialogue as Predictors of Affective States , 2009, AIED.

[11]  J. Pennebaker Writing About Emotional Experiences as a Therapeutic Process , 1997 .

[12]  Jeffrey T. Hancock,et al.  On Lying and Being Lied To: A Linguistic Analysis of Deception in Computer-Mediated Communication , 2007 .

[13]  J. Pennebaker,et al.  PERSONALITY PROCESSES AND INDIVIDUAL DIFFERENCES Words of Wisdom: Language Use Over the Life Span , 2003 .

[14]  J. Pennebaker,et al.  Linguistic styles: language use as an individual difference. , 1999, Journal of personality and social psychology.

[15]  Marilyn A. Walker,et al.  Using Linguistic Cues for the Automatic Recognition of Personality in Conversation and Text , 2007, J. Artif. Intell. Res..

[16]  Gordon I. McCalla,et al.  The Fawlty Article Tutor , 1992, Intelligent Tutoring Systems.

[17]  Jacob Cohen,et al.  A power primer. , 1992, Psychological bulletin.

[18]  S. Kitayama,et al.  Interaction between affect and cognition in word perception. , 1990, Journal of personality and social psychology.

[19]  Danielle S. McNamara,et al.  Handbook of latent semantic analysis , 2007 .

[20]  John R. Anderson,et al.  Cognitive Tutors: Lessons Learned , 1995 .

[21]  Trude Heift,et al.  Error Diagnosis and Error Correction in CALL. , 2003 .

[22]  Heather H. Mitchell,et al.  AutoTutor: A tutor with dialogue in natural language , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[23]  James W. Pennebaker,et al.  Language Use and Personality during Crises: Analyses of Mayor Rudolph Giuliani's Press Conferences , 2002 .