OULAD MOOC Dropout and Result Prediction using Ensemble, Deep Learning and Regression Techniques

Massive Open Online Courses (MOOCs) have become increasingly popular since their start in the year 2008. Universities known worldwide for their traditional confined classroom education are also changing their practices by hosting MOOCs. These are Internet-based courses where students can learn at their own pace and follow their own schedule. Study materials and videos are provided that can be used in a blended learning program. Despite its many advantages, it suffers from problems such as high dropout and failure rates. Previous studies have mostly focused on predicting student dropout. This paper contributes to the body of research by investigating both student dropout and result prediction performance of machine learning models built based on different types of attributes such as demographic info, assessment info and interaction with the VLE. An analysis on the OULAD dataset showed that models based on student’s interaction with the VLE achieved the high performance in terms of AUC, of up to 0.91 for dropout prediction and 0.93 for result prediction in case of Gradient Boosting Machine.

[1]  Shin Kiyohara,et al.  Machine learning approaches for ELNES/XANES. , 2020, Microscopy.

[2]  Qian Zhang,et al.  Modeling and Predicting Learning Behavior in MOOCs , 2016, WSDM.

[3]  Bowei Hong,et al.  Discovering learning behavior patterns to predict dropout in MOOC , 2017, 2017 12th International Conference on Computer Science and Education (ICCSE).

[4]  Giuseppe Riccardi,et al.  Predicting students' final exam scores from their course activities , 2015, 2015 IEEE Frontiers in Education Conference (FIE).

[5]  Andy Laws,et al.  The Application of Gaussian Mixture Models for the Identification of At-Risk Learners in Massive Open Online Courses , 2018, 2018 IEEE Congress on Evolutionary Computation (CEC).

[6]  Linda Corrin,et al.  Predicting success: how learners' prior knowledge, skills and activities predict MOOC performance , 2015, LAK.

[7]  Martin Hlosta,et al.  Open University Learning Analytics dataset , 2017, Scientific Data.

[8]  Nitesh V. Chawla,et al.  MOOC Dropout Prediction: Lessons Learned from Making Pipelines Interpretable , 2017, WWW.

[9]  Wanli Xing,et al.  Dropout Prediction in MOOCs: Using Deep Learning for Personalized Intervention , 2019 .

[10]  Zdenek Zdráhal,et al.  Ouroboros: early identification of at-risk students without models based on legacy data , 2017, LAK.

[11]  Chao Li,et al.  Machine learning application in MOOCs: Dropout prediction , 2016, 2016 11th International Conference on Computer Science & Education (ICCSE).

[12]  Hua Li,et al.  Dropout prediction in MOOCs using behavior features and multi-view semi-supervised learning , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[13]  Leonidas J. Guibas,et al.  Deep Knowledge Tracing , 2015, NIPS.

[14]  David E. Pritchard,et al.  Studying Learning in the Worldwide Classroom Research into edX's First MOOC. , 2013 .

[15]  Jihie Kim,et al.  Predicting Student Attrition in MOOCs using Sentiment Analysis and Neural Networks , 2015, AIED Workshops.

[16]  Xiu Li,et al.  Finding out Reasons for Low Completion in MOOC Environment: An Explicable Approach Using Hybrid Data Mining Methods , 2017 .

[17]  Zhihai Wang,et al.  A Time Series Classification Method for Behaviour-Based Dropout Prediction , 2018, 2018 IEEE 18th International Conference on Advanced Learning Technologies (ICALT).

[18]  Mingjie Tan,et al.  Prediction of Student Dropout in E-Learning Program Through the Use of Machine Learning Method , 2015, Int. J. Emerg. Technol. Learn..

[19]  Hendrik Heuer,et al.  Student Success Prediction and the Trade-Off between Big Data and Data Minimization , 2018, DeLFI.

[20]  E. Zaluska,et al.  Predicting attrition from Massive Open Online Courses in FutureLearn and edX , 2017 .

[21]  Zenun Kastrati,et al.  MOOC dropout prediction using machine learning techniques: Review and research challenges , 2018, 2018 IEEE Global Engineering Education Conference (EDUCON).

[22]  Chunyan Miao,et al.  Deep Model for Dropout Prediction in MOOCs , 2017, ICCSE'17.

[23]  Girish Balakrishnan,et al.  Predicting Student Retention in Massive Open Online Courses using Hidden Markov Models , 2013 .

[24]  Andy Laws,et al.  Machine learning approaches to predict learning outcomes in Massive open online courses , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).