Implicit Heterogeneous Features Embedding in Deep Knowledge Tracing

Deep recurrent neural networks have been successfully applied to knowledge tracing, namely, deep knowledge tracing (DKT), which aims to automatically trace students’ knowledge states by mining their exercise performance data. Two main issues exist in the current DKT models: First, the complexity of the DKT models increases the tension of psychological interpretation. Second, the input of existing DKT models is only the exercise tags representing via one-hot encoding. The correlation between the hidden knowledge components and students’ responses to the exercises heavily relies on training the DKT models. The existing rich and informative features are excluded in the training, which may yield sub-optimal performance. To utilize the information embedded in these features, researchers have proposed a manual method to pre-process the features, i.e., discretizing them based on the inner characteristics of individual features. However, the proposed method requires many feature engineering efforts and is infeasible when the selected features are huge. To tackle the above issues, we design an automatic system to embed the heterogeneous features implicitly and effectively into the original DKT model. More specifically, we apply tree-based classifiers to predict whether the student can correctly answer the exercise given the heterogeneous features, an effective way to capture how the student deviates from others in the exercise. The predicted response and the true response are then encoded into a 4-bit one-hot encoding and concatenated with the original one-hot encoding features on the exercise tags to train a long short-term memory (LSTM) model, which can output the probability that a student will answer the exercise correctly on the corresponding exercise. We conduct a thorough evaluation on two educational datasets and demonstrate the merits and observations of our proposal.

[1]  Lukás Burget,et al.  Recurrent neural network based language model , 2010, INTERSPEECH.

[2]  Michael C. Mozer,et al.  Integrating latent-factor and knowledge-tracing models to predict individual differences in learning , 2014, EDM.

[3]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[4]  Michael C. Mozer,et al.  How Deep is Knowledge Tracing? , 2016, EDM.

[5]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[6]  Changjin Xu,et al.  Dynamics in Four-Neuron Bidirectional Associative Memory Networks with Inertia and Multiple Delays , 2015, Cognitive Computation.

[7]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[8]  Lukás Burget,et al.  Extensions of recurrent neural network language model , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[9]  Kenneth R. Koedinger,et al.  An Open Repository and analysis tools for fine-grained, longitudinal learner data , 2008, EDM.

[10]  Michael W. Spratling A Hierarchical Predictive Coding Model of Object Recognition in Natural Images , 2016, Cognitive Computation.

[11]  Huan Liu,et al.  Feature selection for classification: A review , 2014 .

[12]  Laura Czerniewicz,et al.  MOOC—making and open educational practices , 2017, J. Comput. High. Educ..

[13]  S. Kotsiantis,et al.  Discretization Techniques: A recent survey , 2006 .

[14]  Dit-Yan Yeung,et al.  Dynamic Key-Value Memory Networks for Knowledge Tracing , 2016, WWW.

[15]  Aditya Johri,et al.  Next-Term Student Performance Prediction: A Recommender Systems Approach , 2016, EDM.

[16]  Kenneth R. Koedinger,et al.  Performance Factors Analysis - A New Alternative to Knowledge Tracing , 2009, AIED.

[17]  Michel C. Desmarais,et al.  Adaptive Test Design with a Naive Bayes Framework , 2008, EDM.

[18]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[19]  Roman Timofeev,et al.  Classification and Regression Trees(CART)Theory and Applications , 2004 .

[20]  Neil T. Heffernan,et al.  Comparing Knowledge Tracing and Performance Factor Analysis by Using Multiple Model Fitting Procedures , 2010, Intelligent Tutoring Systems.

[21]  Peter Brusilovsky,et al.  A Data-Driven Framework of Modeling Skill Combinations for Deeper Knowledge Tracing , 2016, EDM.

[22]  Haiqin Yang,et al.  Heterogeneous Features Integration in Deep Knowledge Tracing , 2017, ICONIP.

[23]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[24]  Ron Sun,et al.  Anatomy of the Mind: a Quick Overview , 2017, Cognitive Computation.

[25]  Igor Labutov,et al.  Calibrated Self-Assessment , 2016, EDM.

[26]  Michael R. Lyu,et al.  Boosting Response Aware Model-Based Collaborative Filtering , 2015, IEEE Transactions on Knowledge and Data Engineering.

[27]  Neil T. Heffernan,et al.  Incorporating Rich Features into Deep Knowledge Tracing , 2017, L@S.

[28]  Zachary A. Pardos,et al.  Modeling Individualization in a Bayesian Networks Implementation of Knowledge Tracing , 2010, UMAP.

[29]  Huzefa Rangwala,et al.  Next-term student grade prediction , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[30]  Gilles Louppe,et al.  Understanding variable importances in forests of randomized trees , 2013, NIPS.

[31]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[32]  Geoffrey E. Hinton,et al.  Speech recognition with deep recurrent neural networks , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[33]  John R. Anderson,et al.  Knowledge tracing: Modeling the acquisition of procedural knowledge , 2005, User Modeling and User-Adapted Interaction.

[34]  Michael R. Lyu,et al.  Online Nonlinear AUC Maximization for Imbalanced Data Sets , 2018, IEEE Transactions on Neural Networks and Learning Systems.

[35]  Thomas G. Dietterich Adaptive computation and machine learning , 1998 .

[36]  Kuan-Ta Chen,et al.  Modeling Exercise Relationships in E-Learning: A Unified Approach , 2015, EDM.

[37]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[38]  Yoshua Bengio,et al.  Gated Feedback Recurrent Neural Networks , 2015, ICML.

[39]  Francisco Herrera,et al.  A Survey of Discretization Techniques: Taxonomy and Empirical Analysis in Supervised Learning , 2013, IEEE Transactions on Knowledge and Data Engineering.

[40]  Rebecca Nugent,et al.  A Comparison of Student Skill Knowledge Estimates , 2009, EDM.

[41]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[42]  Chris Piech,et al.  Deep Knowledge Tracing On Programming Exercises , 2017, L@S.

[43]  Vincent Aleven,et al.  More Accurate Student Modeling through Contextual Estimation of Slip and Guess Probabilities in Bayesian Knowledge Tracing , 2008, Intelligent Tutoring Systems.

[44]  Erfu Yang,et al.  Visual Attention Model Based Vehicle Target Detection in Synthetic Aperture Radar Images: A Novel Approach , 2015, Cognitive Computation.

[45]  Dumitru Erhan,et al.  Show and Tell: Lessons Learned from the 2015 MSCOCO Image Captioning Challenge , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Rakesh Agrawal,et al.  Data-Driven Education: Some opportunities and Challenges , 2016, EDM.

[47]  Peter Brusilovsky,et al.  General Features in Knowledge Tracing to Model Multiple Subskills, Temporal Item Response Theory, and Expert Knowledge , 2014, EDM.

[48]  Joseph E. Beck,et al.  Going Deeper with Deep Knowledge Tracing , 2016, EDM.

[49]  John Mingers,et al.  An Empirical Comparison of Pruning Methods for Decision Tree Induction , 1989, Machine Learning.

[50]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[51]  Erik Cambria,et al.  Sentic Computing , 2015, Cognitive Computation.

[52]  Haiqin Yang,et al.  A deep learning approach for predicting the quality of online health expert question-answering services , 2016, J. Biomed. Informatics.