Developing an early-warning system for spotting at-risk students by using eBook interaction logs

Early prediction systems have already been applied successfully in various educational contexts. In this study, we investigated developing an early prediction system in the context of eBook-based teaching-learning and used students’ eBook reading data to develop an early warning system for students at-risk of academic failure -students whose academic performance is low. To determine the best performing model and optimum time for possible interventions we created prediction models by using 13 prediction algorithms with the data from different weeks of the course. We also tested effects of data transformation on prediction models. 10-fold cross-validation was used for all prediction models. Accuracy and Kappa metrics were used to compare the performance of the models. Our results revealed that in a sixteen-week long course all models reached their highest performance with the data from the 15th week. On the other hand, starting from the 3rd week, the models classified low and high performing students with an accuracy of over 79%. In terms of algorithms, Random Forest (RF) outperformed other algorithms when raw data were used, however, with the transformed data J48 algorithm performed better. When categorical data were used, Naive Bayes (NB) outperformed other algorithms. Results also indicated that models with transformed data performed lower than the models created using categorical data. However, models with categorical data showed similar performance with models with raw data. The implications of the results presented in this research were also discussed with respect to the field of Learning Analytics.

[1]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[2]  Ryan Shaun Joazeiro de Baker,et al.  Analyzing Early At-Risk Factors in Higher Education e-Learning Courses , 2015, EDM.

[3]  Farshid Marbouti,et al.  Models for early prediction of at-risk students in a course using standards-based grading , 2016, Comput. Educ..

[4]  Sebastián Ventura,et al.  Data mining in course management systems: Moodle case study and tutorial , 2008, Comput. Educ..

[5]  César Hervás-Martínez,et al.  Data Mining Algorithms to Classify Students , 2008, EDM.

[6]  Jae Young Chung,et al.  Dropout early warning systems for high school students using machine learning , 2019, Children and Youth Services Review.

[7]  Abigail Selzer King,et al.  Using Signals for appropriate feedback: Perceptions and practices , 2011, Comput. Educ..

[8]  David B. Daniel,et al.  E-textbooks at what cost? Performance and use of electronic v. print texts , 2013, Comput. Educ..

[9]  Maiga Chang,et al.  Smart learning environments , 2015 .

[10]  Brendan FLANAGANa,et al.  Integration of Learning Analytics Research and Production Systems While Protecting Privacy , 2018 .

[11]  Chia-Lun Lo,et al.  Developing early warning systems to predict students' online learning performance , 2014, Comput. Hum. Behav..

[12]  Shane Dawson,et al.  Mining LMS data to develop an "early warning system" for educators: A proof of concept , 2010, Comput. Educ..

[13]  Matthew D. Pistilli,et al.  Course signals at Purdue: using learning analytics to increase student success , 2012, LAK.

[14]  Hiroaki Ogata,et al.  e-Book-based learning analytics in University education , 2015, ICCE 2015.

[15]  Reynol Junco,et al.  Predicting course outcomes with digital textbook usage data , 2015, Internet High. Educ..

[16]  Kurt Hornik,et al.  Mining Association Rules and Frequent Itemsets , 2015 .

[17]  Rafael Molina-Carmona,et al.  Improving the expressiveness of black-box models for predicting student performance , 2017, Comput. Hum. Behav..

[18]  S. Taruna,et al.  Towards the integration of multiple classifier pertaining to the Student's performance prediction , 2016 .

[19]  Baldoino Fonseca dos Santos Neto,et al.  Evaluating the effectiveness of educational data mining techniques for early prediction of students' academic failure in introductory programming courses , 2017, Comput. Hum. Behav..

[20]  Zdenek Zdráhal,et al.  Developing predictive models for early detection of at-risk students on distance learning modules , 2014, LAK Workshops.

[21]  Zachary A. Pardos,et al.  Affective states and state tests: investigating how affect throughout the school year predicts end of year learning outcomes , 2013, LAK '13.

[22]  Sebastián Ventura,et al.  Predicting students' final performance from participation in on-line discussion forums , 2013, Comput. Educ..

[23]  Maria Meehan,et al.  Contrasting prediction methods for early warning systems at undergraduate level , 2016, Internet High. Educ..