By learning a more distributed representation of the input space, clustering can be a powerful source of information for boosting the performance of predictive models. While such semi-supervised methods based on clustering have been applied to increase the accuracy of predictions of external tests, they have not yet been applied to improve within-tutor prediction of student responses. We use a widely adopted model for student prediction called knowledge tracing as our predictor and demonstrate how clustering students can improve model accuracy. The intuition behind this application of clustering is that different groups of students can be better fit with separate models. High performing students, for example, might be better modeled with a higher knowledge tracing learning rate parameter than lower performing students. We use a bagging method that exploits clusterings at different values for K in order to capture a variety of different categorizations of students. The method then combines the predictions of each cluster in order to produce a more accurate result than without clustering.
[1]
Ulrike von Luxburg,et al.
A tutorial on spectral clustering
,
2007,
Stat. Comput..
[2]
Zachary A. Pardos,et al.
Clustering Students to Generate an Ensemble to Improve Standard Test Score Predictions
,
2011,
AIED.
[3]
John R. Anderson,et al.
Knowledge tracing: Modeling the acquisition of procedural knowledge
,
2005,
User Modeling and User-Adapted Interaction.
[4]
N. Heffernan,et al.
Using HMMs and bagged decision trees to leverage rich features of user and skill from an intelligent tutoring system dataset
,
2010
.
[5]
R. Charles Murray,et al.
Reducing the Knowledge Tracing Space
,
2009,
EDM.
[6]
R. Sawyer.
The Cambridge Handbook of the Learning Sciences: Introduction
,
2014
.