Using learning analytics to develop early-warning system for at-risk students

In the current study interaction data of students in an online learning setting was used to research whether the academic performance of students at the end of term could be predicted in the earlier weeks. The study was carried out with 76 second-year university students registered in a Computer Hardware course. The study aimed to answer two principle questions: which algorithms and features best predict the end of term academic performance of students by comparing different classification algorithms and pre-processing techniques and whether or not academic performance can be predicted in the earlier weeks using these features and the selected algorithm. The results of the study indicated that the kNN algorithm accurately predicted unsuccessful students at the end of term with a rate of 89%. When findings were examined regarding the analysis of data obtained in weeks 3, 6, 9, 12, and 14 to predict whether the end-of-term academic performance of students could be predicted in the earlier weeks, it was observed that students who were unsuccessful at the end of term could be predicted with a rate of 74% in as short as 3 weeks’ time. The findings obtained from this study are important for the determination of features for early warning systems that can be developed for online learning systems and as indicators of student success. At the same time, it will aid researchers in the selection of algorithms and pre-processing techniques in the analysis of educational data.

[1]  Alejandro Peña Ayala,et al.  Educational data mining: A survey and a data mining-based analysis of recent works , 2014, Expert Syst. Appl..

[2]  Abigail Selzer King,et al.  Using Signals for appropriate feedback: Perceptions and practices , 2011, Comput. Educ..

[3]  Agma J. M. Traina,et al.  A new algorithm for data discretization and feature selection , 2008, SAC '08.

[4]  Matthew D. Pistilli,et al.  Course signals at Purdue: using learning analytics to increase student success , 2012, LAK.

[5]  Alejandro Peña-Ayala Review: Educational data mining: A survey and a data mining-based analysis of recent works , 2014 .

[6]  Baldoino Fonseca dos Santos Neto,et al.  Evaluating the effectiveness of educational data mining techniques for early prediction of students' academic failure in introductory programming courses , 2017, Comput. Hum. Behav..

[7]  Sebastián Ventura,et al.  Web usage mining for predicting final marks of students that use Moodle courses , 2013, Comput. Appl. Eng. Educ..

[8]  Shane Dawson,et al.  Mining LMS data to develop an "early warning system" for educators: A proof of concept , 2010, Comput. Educ..

[9]  Kimberly E. Arnold Signals: Applying Academic Analytics. , 2010 .

[10]  Graham J. Williams,et al.  Data Mining , 2000, Communications in Computer and Information Science.

[11]  César Hervás-Martínez,et al.  Data Mining Algorithms to Classify Students , 2008, EDM.

[12]  Chia-Lun Lo,et al.  Developing early warning systems to predict students' online learning performance , 2014, Comput. Hum. Behav..

[13]  Martin Mozina,et al.  Orange: data mining toolbox in python , 2013, J. Mach. Learn. Res..

[14]  John P. Campbell,et al.  Academic Analytics: A New Tool for a New Era. , 2007 .

[15]  Edin Osmanbegović,et al.  DATA MINING APPROACH FOR PREDICTING STUDENT PERFORMANCE , 2012 .

[16]  Karl Rihaczek,et al.  1. WHAT IS DATA MINING? , 2019, Data Mining for the Social Sciences.

[17]  Hiroaki Ogata,et al.  Developing an early-warning system for spotting at-risk students by using eBook interaction logs , 2019, Smart Learning Environments.

[18]  Ulrik Schroeder,et al.  A reference model for learning analytics , 2012 .

[19]  Selwyn Piramuthu,et al.  Artificial Intelligence and Information Technology Evaluating feature selection methods for learning in data mining applications , 2004 .

[20]  Larry Johnson,et al.  The 2011 Horizon Report. , 2011 .

[21]  Jiawei Han,et al.  Data Mining: Concepts and Techniques, Second Edition , 2006, The Morgan Kaufmann series in data management systems.

[22]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[23]  Chloé Pou-Prom,et al.  Developing an Early Warning System for Sepsis , 2019 .

[24]  Sebastián Ventura,et al.  A meta-learning approach for recommending a subset of white-box classification algorithms for Moodle datasets , 2013, EDM.

[25]  Marie Bienkowski,et al.  Enhancing Teaching and Learning Through Educational Data Mining and Learning Analytics: An Issue Brief , 2012 .

[26]  Dragan Gasevic,et al.  Learning analytics should not promote one size fits all: The effects of instructional conditions in predicting academic success , 2016, Internet High. Educ..

[27]  Élise Lavoué,et al.  Social Tagging to Enhance Collaborative Learning , 2011, ICWL.