Bandit algorithms to personalize educational chatbots

To emulate the interactivity of in-person math instruction, we developed MathBot, a rule-based chatbot that explains math concepts, provides practice questions, and offers tailored feedback. We evaluated MathBot through three Amazon Mechanical Turk studies in which participants learned about arithmetic sequences. In the first study, we found that more than 40% of our participants indicated a preference for learning with MathBot over videos and written tutorials from Khan Academy. The second study measured learning gains, and found that MathBot produced comparable gains to Khan Academy videos and tutorials. We solicited feedback from users in those two studies to emulate a real-world development cycle, with some users finding the lesson too slow and others finding it too fast. We addressed these concerns in the third and main study by integrating a contextual bandit algorithm into MathBot to personalize the pace of the conversation, allowing the bandit to either insert extra practice problems or skip explanations. We randomized participants between two conditions in which actions were chosen uniformly at random (i.e., a randomized A/B experiment) or by the contextual bandit. We found that the bandit learned a similarly effective pedagogical policy to that learned by the randomized A/B experiment while incurring a lower cost of experimentation. Our findings suggest that personalized conversational agents are promising tools to complement existing online resources for math education, and that data-driven approaches such as contextual bandits are valuable tools for learning effective personalization.

[1]  Mustafa H. Hajeer,et al.  SKOPE-IT (Shareable Knowledge Objects as Portable Intelligent Tutors): overlaying natural language tutoring on an adaptive learning system for mathematics , 2018, International journal of STEM education.

[2]  Shipra Agrawal,et al.  Thompson Sampling for Contextual Bandits with Linear Payoffs , 2012, ICML.

[3]  Vincent Aleven,et al.  Scaling Up Programming by Demonstration for Intelligent Tutoring Systems Development: An Open-Access Web Site for Middle School Mathematics Learning , 2009, IEEE Transactions on Learning Technologies.

[4]  Arthur C. Graesser,et al.  AutoTutor and Family: A Review of 17 Years of Natural Language Tutoring , 2014, International Journal of Artificial Intelligence in Education.

[5]  Neil T. Heffernan,et al.  The ASSISTments Ecosystem: Building a Platform that Brings Scientists and Teachers Together for Minimally Invasive Research on Human Learning and Teaching , 2014, International Journal of Artificial Intelligence in Education.

[6]  H. Robbins,et al.  Asymptotically efficient adaptive allocation rules , 1985 .

[7]  Wei Chu,et al.  Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms , 2010, WSDM '11.

[8]  R. Snow Aptitude-treatment interaction as a framework for research on individual differences in learning. , 1989 .

[9]  Richard G. Baraniuk,et al.  A Contextual Bandits Framework for Personalized Learning Action Selection , 2016, EDM.

[10]  James A. Landay,et al.  QuizBot: A Dialogue-based Adaptive Learning System for Factual Knowledge , 2019, CHI.

[11]  Neil T. Heffernan,et al.  Addressing the assessment challenge with an online system that tutors as it assesses , 2009, User Modeling and User-Adapted Interaction.

[12]  Joseph Jay Williams,et al.  Understanding the Effect of In-Video Prompting on Learners and Instructors , 2018, CHI.

[13]  Marilyn A. Walker,et al.  Mixed Initiative in Dialogue: An Investigation into Discourse Segmentation , 1990, ACL.

[14]  Ya'akov Gal,et al.  Combining Difficulty Ranking with Multi-Armed Bandits to Sequence Educational Content , 2018, AIED.

[15]  Albert T. Corbett,et al.  Cognitive Tutor: Applied research in mathematics education , 2007, Psychonomic bulletin & review.

[16]  David Eppstein,et al.  Knowledge Spaces, Applications in Education , 2013 .

[17]  Maxine Eskénazi,et al.  A Finite-State Turn-Taking Model for Spoken Dialog Systems , 2009, NAACL.

[18]  Suresh Manandhar,et al.  A Chatbot-based Interactive Question Answering System , 2007 .

[19]  Javier Snaider,et al.  Conversational Contextual Cues: The Case of Personalization and History for Response Ranking , 2016, ArXiv.

[20]  Rui Yan,et al.  Learning to Respond with Deep Neural Networks for Retrieval-Based Human-Computer Conversation System , 2016, SIGIR.

[21]  Jennifer Chu-Carroll,et al.  Tracking initiative in collaborative dialogue interactions , 1997 .

[22]  Arthur C. Graesser,et al.  Intelligent Tutoring Systems with Conversational Dialogue , 2001, AI Mag..

[23]  Collin Lynch,et al.  Towards Closing the Loop: Bridging Machine-induced Pedagogical Policies to Learning Theories , 2017, EDM.

[24]  Arthur C. Graesser,et al.  AutoTutor: A simulation of a human tutor , 1999, Cognitive Systems Research.

[25]  Heather H. Mitchell,et al.  AutoTutor: A tutor with dialogue in natural language , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[26]  Michael C. Mozer,et al.  Predicting and Improving Memory Retention: Psychological Theory Matters in the Big Data Era , 2016 .

[27]  Sumit Gulwani,et al.  A Framework for Automatically Generating Interactive Instructional Scaffolding , 2015, CHI.

[28]  Kurt VanLehn,et al.  Empirically evaluating the application of reinforcement learning to the induction of effective and adaptive pedagogical strategies , 2011, User Modeling and User-Adapted Interaction.

[29]  Carolyn Penstein Rosé,et al.  The Architecture of Why2-Atlas: A Coach for Qualitative Physics Essay Writing , 2002, Intelligent Tutoring Systems.

[30]  Pierre-Yves Oudeyer,et al.  Multi-Armed Bandits for Intelligent Tutoring Systems , 2013, EDM.

[31]  Arthur C. Graesser,et al.  The impact of a technology-based mathematics after-school program using ALEKS on student's knowledge and behaviors , 2013, Comput. Educ..

[32]  Wei Chu,et al.  A contextual-bandit approach to personalized news article recommendation , 2010, WWW '10.

[33]  Brian Chin,et al.  Can Khan Academy e-Learning Video Tutorials Improve Mathematics Achievement in Sri Lanka?. , 2018 .

[34]  Neil T. Heffernan,et al.  Opening the Door to Non-programmers: Authoring Intelligent Tutor Behavior by Demonstration , 2004, Intelligent Tutoring Systems.

[35]  Vincent Aleven,et al.  Example-Tracing Tutors: Intelligent Tutor Development for Non-programmers , 2016, International Journal of Artificial Intelligence in Education.

[36]  Kurt VanLehn,et al.  Conceptual and Meta Learning During Coached Problem Solving , 1996, Intelligent Tutoring Systems.

[37]  Vincent Aleven,et al.  A New Paradigm for Intelligent Tutoring Systems: Example-Tracing Tutors , 2009, Int. J. Artif. Intell. Educ..

[38]  Kristen N. Moreno,et al.  AutoTutor Improves Deep Learning of Computer Literacy : Is it the Dialog or the Talking Head ? , 2004 .

[39]  Sebastian Hobert,et al.  Sara, the Lecturer: Improving Learning in Online Education with a Scaffolding-Based Conversational Agent , 2020, CHI.

[40]  Anbang Xu,et al.  A New Chatbot for Customer Service on Social Media , 2017, CHI.

[41]  Suresh Manandhar,et al.  Persuasive Argumentation in Human Computer Dialogue , 2006, AAAI Spring Symposium: Argumentation for Consumers of Healthcare.

[42]  Joseph P. Magliano,et al.  Collaborative dialogue patterns in naturalistic one-to-one tutoring , 1995 .

[43]  Geert-Jan Houben,et al.  Evaluating Crowdworkers as a Proxy for Online Learners in Video-Based Learning Contexts , 2018, Proc. ACM Hum. Comput. Interact..

[44]  Zoran Popovic,et al.  Learning Individual Behavior in an Educational Game: A Data-Driven Approach , 2014, EDM.

[45]  Srushti Barve,et al.  Chat-Bot For College Management System , 2021, SSRN Electronic Journal.

[46]  Arthur C. Graesser,et al.  Using LSA in AutoTutor: Learning through mixed-initiative dialogue in natural language. , 2007 .

[47]  Stephanie Seneff,et al.  TINA: A Natural Language System for Spoken Language Applications , 1992, Comput. Linguistics.

[48]  Daniel G. Bobrow,et al.  On Overview of KRL, a Knowledge Representation Language , 1976, Cogn. Sci..