论文信息 - The real world significance of performance prediction

The real world significance of performance prediction

In recent years, the educational data mining and user modeling communities have been aggressively introducing models for predicting student performance on external measures such as standardized tests as well as within-tutor performance. While these models have brought statistically reliable improvement to performance prediction, the real world significance of the differences in errors has been largely unexplored. In this paper we take a deeper look at what reported errors actually mean in the context of high stakes test score prediction as well as student mastery prediction. We report how differences in common error and accuracy metrics on prediction tasks translate to impact on students and depict how standard validation methods can lead to overestimated accuracies in these prediction tasks. Two years of student tutor use and corresponding student state test scores are used for the analysis of test prediction while a simulation study is conducted to investigate the correspondence between performance prediction error and latent knowledge prediction.

Zachary A. Pardos | Qing Yang Wang | Shubhendu Trivedi

[1] Paul S. Goodman,et al. Technology Enhanced Learning: Opportunities For Change , 2001 .

[2] Saharon Rosset,et al. Leakage in data mining: formulation, detection, and avoidance , 2011, TKDD.

[3] John R. Anderson,et al. Knowledge tracing: Modeling the acquisition of procedural knowledge , 2005, User Modeling and User-Adapted Interaction.

[4] Zachary A. Pardos,et al. Clustering Students to Generate an Ensemble to Improve Standard Test Score Predictions , 2011, AIED.

[5] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[6] Neil T. Heffernan,et al. Addressing the assessment challenge with an online system that tutors as it assesses , 2009, User Modeling and User-Adapted Interaction.

[7] N. Heffernan,et al. Using HMMs and bagged decision trees to leverage rich features of user and skill from an intelligent tutoring system dataset , 2010 .

[8] Z. Pardos,et al. The Sum Is Greater than the Parts: Ensembling Student Knowledge Models in Assistments , 2022 .