Towards Portability of Models for Predicting Students’ Final Performance in University Courses Starting from Moodle Logs

Predicting students’ academic performance is one of the older challenges faced by the educational scientific community. However, most of the research carried out in this area has focused on obtaining the best accuracy models for their specific single courses and only a few works have tried to discover under which circumstances a prediction model built on a source course can be used in other different but similar courses. Our motivation in this work is to study the portability of models obtained directly from Moodle logs of 24 university courses. The proposed method intends to check if grouping similar courses by the degree or the similar level of usage of activities provided by the Moodle logs, and if the use of numerical or categorical attributes affect in the portability of the prediction models. We have carried out two experiments by executing the well-known classification algorithm over all the datasets of the courses in order to obtain decision tree models and to test their portability to the other courses by comparing the obtained accuracy and loss of accuracy evaluation measures. The results obtained show that it is only feasible to directly transfer predictive models or apply them to different courses with an acceptable accuracy and without losing portability under some circumstances.

[1]  Gabriela Csurka,et al.  A Comprehensive Survey on Domain Adaptation for Visual Applications , 2017, Domain Adaptation in Computer Vision Applications.

[2]  Rianne Conijn,et al.  Predicting Student Performance from LMS Data: A Comparison of 17 Blended Courses Using Moodle LMS , 2017, IEEE Transactions on Learning Technologies.

[3]  Alaa Khalaf Hamoud,et al.  Predicting Student Performance in Higher Education Institutions Using Decision Tree Analysis , 2018, Int. J. Interact. Multim. Artif. Intell..

[4]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[5]  Nathalie Japkowicz,et al.  The class imbalance problem: A systematic study , 2002, Intell. Data Anal..

[6]  Jacob Whitehill,et al.  MOOC Dropout Prediction: How to Measure Accuracy? , 2017, L@S.

[7]  Philip S. Yu,et al.  Top 10 algorithms in data mining , 2007, Knowledge and Information Systems.

[8]  Shiliang Sun,et al.  A survey of multi-source domain adaptation , 2015, Inf. Fusion.

[9]  Suraiya Yeasmin,et al.  Analysis of Student Performance using Data Mining , 2014 .

[10]  Sebastián Ventura,et al.  Data mining in education , 2013, WIREs Data Mining Knowl. Discov..

[11]  Sebastián Ventura,et al.  Data mining in course management systems: Moodle case study and tutorial , 2008, Comput. Educ..

[12]  Ryan Shaun Joazeiro de Baker,et al.  Enabling End-To-End Machine Learning Replicability: A Case Study in Educational Data Mining , 2018, ArXiv.

[13]  Ryan S. Baker,et al.  Educational Data Mining and Learning Analytics , 2014 .

[14]  Ryan S. Baker,et al.  Challenges for the Future of Educational Data Mining: The Baker Learning Analytics Prizes , 2019 .

[15]  Daniel L. Schwartz,et al.  Modeling exploration strategies to predict student performance within a learning environment and beyond , 2017, LAK.

[16]  RomeroCristobal,et al.  Data mining in education , 2013 .

[17]  Sebastián Ventura,et al.  Educational Data Mining: A Review of the State of the Art , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[18]  Una-May O'Reilly,et al.  Transfer Learning using Representation Learning in Massive Open Online Courses , 2019, LAK.

[19]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[20]  Lars Schmidt-Thieme,et al.  Improving Academic Performance Prediction by Dealing with Class Imbalance , 2009, 2009 Ninth International Conference on Intelligent Systems Design and Applications.

[21]  Dragan Gasevic,et al.  Learning analytics should not promote one size fits all: The effects of instructional conditions in predicting academic success , 2016, Internet High. Educ..

[22]  Ozren Gamulin,et al.  Comparing classification models in the final exam performance prediction , 2014, 2014 37th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[23]  Sebastián Ventura,et al.  Web usage mining for predicting final marks of students that use Moodle courses , 2013, Comput. Appl. Eng. Educ..

[24]  Taghi M. Khoshgoftaar,et al.  A survey of transfer learning , 2016, Journal of Big Data.

[25]  Kalyan Veeramachaneni,et al.  Robust Predictive Models on MOOCs : Transferring Knowledge across Courses , 2016, EDM.

[26]  Kalyan Veeramachaneni,et al.  Transfer Learning for Predictive Models in Massive Open Online Courses , 2015, AIED.

[27]  Marlia Mohd Puteh,et al.  Blended Learning or E-Learning? , 2013, ArXiv.

[28]  J. M. LUNA,et al.  MDM tool: A data mining framework integrated into Moodle , 2017, Comput. Appl. Eng. Educ..

[29]  Qiang Yang,et al.  A Survey on Transfer Learning , 2010, IEEE Transactions on Knowledge and Data Engineering.

[30]  Peter Charles Taylor,et al.  Moodle: Using Learning Communities to Create an Open Source Course Management System , 2003 .

[31]  Mina Shirvani Boroujeni,et al.  On generalizability of MOOC models , 2016, EDM.

[32]  Dan Roth,et al.  DiAd: Domain Adaptation for Learning at Scale , 2019, LAK.