Multi-Relational Factorization Models for Predicting Student Performance

Predicting student performance (PSP) is an important task in educational data mining, where we can give the students early feedbacks to help them improving their study results. A good and reliable model which accurately predicts the student performance may replace the current standardized tests, thus, reducing the pressure on teaching and learning for examinations as well as saving a lot of time and effort for both teachers and students [Feng et al. 2009; Thai-Nghe et al. 2011]. Precisely, PSP is the task where we would like to know how the students learn (e.g. generally or narrowly), how quickly or slowly they adapt to new problems or if it is possible to infer the knowledge requirements to solve the problems directly from student performance data [Corbett and Anderson 1995; Feng et al. 2009], and eventually, we would like to know whether the students perform the tasks (exercises) correctly (or with some levels of certainty). The benefits of PSP have been vastly discussed in the literature [Cen et al. 2006; Feng et al. 2009; Thai-Nghe et al. 2011]. To address the problem of PSP, several works have been published, e.g. as summarized in Romero et al. [2010], but most of them relied on traditional classification/regression techniques. For example, Corbett and Anderson [1995] proposed the Knowledge Tracing (KT) model, which is usually used for tracing the students’ knowledge in applying their skills as well as for PSP; Cen et al. [2006] proposed a semi-automated method for improving a cognitive model called Learning Factors Analysis that combines a statistical model, human expertise and a combinatorial search; Yu et al. [2010] used linear support vector machines together with feature engineering and ensembling techniques for predicting student performance. This approach, however, requires intensive computer memory and much human effort on data pre-processing. Recently, researchers have proposed using recommender system techniques, e.g. k-NN collaborative filtering and matrix factorization, for PSP [Cetintas et al. 2010; Toscher and Jahrer 2010; Thai-Nghe et al. 2011]. The literature have shown that PSP can be considered as rating prediction task in recommender systems since the student, task, and performance would become user, item, and rating, respectively. The authors also shown that matrix factorization is a promising approach for PSP.

[1]  Neil T. Heffernan,et al.  Addressing the assessment challenge with an online system that tutors as it assesses , 2009, User Modeling and User-Adapted Interaction.

[2]  Lars Schmidt-Thieme,et al.  Factorization Techniques for Predicting Student Performance , 2012 .

[3]  Lars Schmidt-Thieme,et al.  Factorization Models for Forecasting Student Performance , 2011, EDM.

[4]  John R. Anderson,et al.  Knowledge tracing: Modeling the acquisition of procedural knowledge , 2005, User Modeling and User-Adapted Interaction.

[5]  Albert T. Corbett,et al.  A Bayes Net Toolkit for Student Modeling in Intelligent Tutoring Systems , 2006, Intelligent Tutoring Systems.

[6]  Michael Jahrer,et al.  Collaborative Filtering Applied to Educational Data Mining , 2010 .

[7]  Yehuda Koren,et al.  Factor in the neighbors: Scalable and accurate collaborative filtering , 2010, TKDD.

[8]  Luo Si,et al.  Predicting Correctness of Problem Solving in ITS with a Temporal Collaborative Filtering Approach , 2010, Intelligent Tutoring Systems.

[9]  Kenneth R. Koedinger,et al.  A Data Repository for the EDM Community: The PSLC DataShop , 2010 .

[10]  Kenneth R. Koedinger,et al.  Performance Factors Analysis - A New Alternative to Knowledge Tracing , 2009, AIED.

[11]  Kenneth R. Koedinger,et al.  Learning Factors Analysis - A General Method for Cognitive Model Evaluation and Improvement , 2006, Intelligent Tutoring Systems.

[12]  Volker Tresp,et al.  Relation Prediction in Multi-Relational Domains using Matrix Factorization , 2008 .

[13]  Vincent Aleven,et al.  More Accurate Student Modeling through Contextual Estimation of Slip and Guess Probabilities in Bayesian Knowledge Tracing , 2008, Intelligent Tutoring Systems.

[14]  N. Heffernan,et al.  Using HMMs and bagged decision trees to leverage rich features of user and skill from an intelligent tutoring system dataset , 2010 .

[15]  Shou-De Lin,et al.  Feature Engineering and Classifier Ensemble for KDD Cup 2010 , 2010, KDD 2010.

[16]  Geoffrey J. Gordon,et al.  Relational learning via collective matrix factorization , 2008, KDD.

[17]  Lars Schmidt-Thieme,et al.  Learning Attribute-to-Feature Mappings for Cold-Start Recommendations , 2010, 2010 IEEE International Conference on Data Mining.

[18]  Mykola Pechenizkiy,et al.  Handbook of Educational Data Mining , 2010 .