Scalable POMDPs for Diagnosis and Planning in Intelligent Tutoring Systems

A promising application area for proactive assistant agents is automated tutoring and training.  Intelligent tutoring systems (ITSs) assist tutors and tutees by automating diagnosis and adaptive tutoring. These tasks are well modeled by a partially observable Markov decision process (POMDP) since it accounts for the uncertainty inherent in diagnosis. However, an important aspect of making POMDP solvers feasible for real-world problems is selecting appropriate representations for states, actions, and observations. This paper studies two scalable POMDP state and observation representations. State queues allow POMDPs to temporarily ignore less-relevant states. Observation chains represent information in independent dimensions using sequences of observations to reduce the size of the observation set. Preliminary experiments with simulated tutees suggest the experimental representations perform as well as lossless POMDPs, and can model much larger problems.

[1]  KURT VANLEHN Bayesian student modeling, user interfaces and feedback : A sensitivity analysis , 2001 .

[2]  David Hsu,et al.  SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces , 2008, Robotics: Science and Systems.

[3]  D. Norman Categorization of action slips. , 1981 .

[4]  Oliver Brock,et al.  SARSOP: Efficient Point-Based POMDP Planning by Approximating Optimally Reachable Belief Spaces , 2009 .

[5]  Zhendong Niu Bayesian student modeling, user interfaces and feedback: A sensitivity analysis , 2001 .

[6]  Stephen J. Payne,et al.  Algebra Mal‐Rules and Cognitive Accounts of Error , 1990 .

[7]  Paul J. Feltovich,et al.  Categorization and Representation of Physics Problems by Experts and Novices , 1981, Cogn. Sci..

[8]  James C. Lester,et al.  Evaluating the consequences of affective feedback in intelligent tutoring systems , 2009, 2009 3rd International Conference on Affective Computing and Intelligent Interaction and Workshops.

[9]  A. Graesser,et al.  Monitoring Affective Trajectories during Complex Learning , 2007 .

[10]  Leslie Pack Kaelbling,et al.  Planning and Acting in Partially Observable Stochastic Domains , 1998, Artif. Intell..

[11]  Arthur C. Graesser,et al.  When Are Tutorial Dialogues More Effective Than Reading? , 2007, Cogn. Sci..

[12]  Ryan Shaun Joazeiro de Baker,et al.  The Dynamics of Affective Transitions in Simulation Problem-Solving Environments , 2007, ACII.

[13]  A. Cassandra A Survey of POMDP Applications , 2003 .

[14]  What Works Clearinghouse: Procedures and Standards Handbook (Version 2.1) , 2011 .

[15]  Scotty D. Craig,et al.  Affect and learning: An exploratory look into the role of affect in learning with AutoTutor , 2004 .