Memory-Augmented Neural Networks for Knowledge Tracing from the Perspective of Learning and Forgetting

Knowledge tracing (KT) refers to a machine learning technique to assess a student's level of understanding (or knowledge state) based on the student's past performance in exercise-solving. KT accepts a series of question-answer pairs as an input and iteratively updates the knowledge state of the student, eventually returning the probability of the student solving a given question. To estimate the accurate knowledge state, a KT model should imitate the learning and forgetting mechanisms of the student. Deep learning-based KT models, proposed recently, show a higher predictive performance than traditional machine learning-based KT models due to the representative power of neural networks. The dynamic key value memory network (DKVMN), a kind of memory augmented neural network (MANN), is a state-of-the-art KT model, but it has some limitations. DKVMN does not utilize information from a current knowledge state and overestimates the amount of forgetting when updating the knowledge state. To improve the learning and forgetting mechanism of the DKVMN, we propose a knowledge tracing model that incorporates: (1) an adaptive knowledge growth depending on the current knowledge state, and (2) an additional loss term that can regularize the degree of forgetting. To measure the degree of forgetting of the KT model, we define a positive update ratio (PUR) that can complement the predictive performance metric (AUC). According to our experiments using four public benchmarks, the proposed approaches outperform the original DKVMN in terms of both AUC (predictive performance) and PUR (degree of forgetting).

[1]  Neil T. Heffernan,et al.  Incorporating Rich Features into Deep Knowledge Tracing , 2017, L@S.

[2]  Leonidas J. Guibas,et al.  Deep Knowledge Tracing , 2015, NIPS.

[3]  Chaitanya Ekanadham,et al.  Back to the basics: Bayesian extensions of IRT outperform neural networks for proficiency estimation , 2016, EDM.

[4]  Daan Wierstra,et al.  One-shot Learning with Memory-Augmented Neural Networks , 2016, ArXiv.

[5]  Richard C. Atkinson,et al.  Human Memory: A Proposed System and its Control Processes , 1968, Psychology of Learning and Motivation.

[6]  Mirjana Ivanović,et al.  Intelligent tutoring system as multiagent system , 1997, 1997 IEEE International Conference on Intelligent Processing Systems (Cat. No.97TH8335).

[7]  Thorsten Joachims,et al.  Latent Skill Embedding for Personalized Lesson Sequence Recommendation , 2016, ArXiv.

[8]  Tsuyoshi Murata,et al.  {m , 1934, ACML.

[10]  Alexander G. Schwing,et al.  Dynamic Bayesian Networks for Student Modeling , 2017, IEEE Transactions on Learning Technologies.

[11]  John R. Anderson,et al.  Knowledge tracing: Modeling the acquisition of procedural knowledge , 2005, User Modeling and User-Adapted Interaction.

[12]  Michael C. Mozer,et al.  Does Deep Knowledge Tracing Model Interactions Among Skills? , 2018, EDM.

[13]  Enhong Chen,et al.  Exercise-Enhanced Sequential Modeling for Student Performance Prediction , 2018, AAAI.

[14]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[15]  Dit-Yan Yeung,et al.  Dynamic Key-Value Memory Networks for Knowledge Tracing , 2016, WWW.

[16]  Peter Brusilovsky,et al.  ELM-ART: An Intelligent Tutoring System on World Wide Web , 1996, Intelligent Tutoring Systems.

[17]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[18]  Clara E. Bussenius,et al.  Memory : A Contribution to Experimental Psychology , 2017 .

[19]  Jung-Woo Ha,et al.  Reinforcement Learning based Recommender System using Biclustering Technique , 2018, ArXiv.

[20]  D. Rohrer The effects of spacing and mixing practice problems , 2009 .

[21]  Erik L. L. Sonnhammer,et al.  A Hidden Markov Model for Predicting Transmembrane Helices in Protein Sequences , 1998, ISMB.

[22]  Yee Lee Shing,et al.  The Influence of Prior Knowledge on Memory: A Developmental Cognitive Neuroscience Perspective , 2013, Front. Behav. Neurosci..

[23]  Kenneth R. Koedinger,et al.  Performance Factors Analysis - A New Alternative to Knowledge Tracing , 2009, AIED.

[24]  Hassan Khosravi,et al.  Predicting Student Performance: The Case of Combining Knowledge Tracing and Collaborative Filtering , 2018 .

[25]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[26]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[27]  Kenneth R. Koedinger,et al.  Learning Factors Analysis - A General Method for Cognitive Model Evaluation and Improvement , 2006, Intelligent Tutoring Systems.