Explaining Errors in Predictions of At-Risk Students in Distance Learning Education

Despite recognising the importance of transparency and understanding of predictive models, little effort has been made to investigate the errors made by these models. In this paper, we address this gap by interviewing 12 students whose results and predictions of submitting their assignment differed. Following our previous quantitative analysis of 25,000+ students, we conducted online interviews with two groups of students: those predicted to submit their assignment, yet they did not (False Negative) and those predicted not to submit, yet they did (False Positive). Interviews revealed that, in False Negatives, the non-submission of assignments was explained by personal, financial and practical reasons. Overall, the factors explaining the different outcomes were not related to any of the student data currently captured by the predictive model.

[1]  Ryan Shaun Joazeiro de Baker,et al.  Modeling and Experimental Design for MOOC Dropout Prediction: A Replication Perspective , 2019, EDM.

[2]  Kalyan Veeramachaneni,et al.  Likely to stop? Predicting Stopout in Massive Open Online Courses , 2014, ArXiv.

[3]  Bart Rienties,et al.  Empowering online teachers through predictive learning analytics , 2019, Br. J. Educ. Technol..

[4]  Christothea Herodotou,et al.  Why Predictions of At-Risk Students Are Not 100% Accurate? Showing Patterns in False Positive and False Negative Predictions , 2020 .

[5]  Xavier Ochoa,et al.  Quantitative and Qualitative Analysis of the Learning Analytics and Knowledge Conference 2018 , 2018, J. Learn. Anal..

[6]  Zdenek Zdráhal,et al.  Ouroboros: early identification of at-risk students without models based on legacy data , 2017, LAK.

[7]  Carol Calvert,et al.  Student feedback to improved retention: using a mixed-methods approach to extend specific feedback to a generalisable concept , 2018, Open Learning: The Journal of Open, Distance and e-Learning.

[8]  V. Braun,et al.  Using thematic analysis in psychology , 2006 .

[9]  Joseph B. Berger,et al.  A Modified Model of College Student Persistence: Exploring the Relationship Between Astin's Theory of Involvement and Tinto's Theory of Student Departure , 1997 .

[10]  Paul Prinsloo,et al.  Speaking the unspoken in learning analytics: troubling the defaults , 2020 .

[11]  Zdenek Zdráhal,et al.  Implementing predictive learning analytics on a large scale: the teacher's perspective , 2017, LAK.

[12]  Simon Buckingham Shum,et al.  Embracing imperfection in learning analytics , 2018, LAK.