Predicting Student Academic Performance by Means of Associative Classification

The Learning Analytics community has recently paid particular attention to early predict learners’ performance. An established approach entails training classification models from past learner-related data in order to predict the exam success rate of a student well before the end of the course. Early predictions allow teachers to put in place targeted actions, e.g., supporting at-risk students to avoid exam failures or course dropouts. Although several machine learning and data mining solutions have been proposed to learn accurate predictors from past data, the interpretability and explainability of the best performing models is often limited. Therefore, in most cases, the reasons behind classifiers’ decisions remain unclear. This paper proposes an Explainable Learning Analytics solution to analyze learner-generated data acquired by our technical university, which relies on a blended learning model. It adopts classification techniques to early predict the success rate of about 5000 students who were enrolled in the first year courses of our university. It proposes to apply associative classifiers at different time points and to explore the characteristics of the models that led to assign pass or fail success rates. Thanks to their inherent interpretability, associative models can be manually explored by domain experts with the twofold aim at validating classifier outcomes through local rule-based explanations and identifying at-risk/successful student profiles by interpreting the global rule-based model. The results of an in-depth empirical evaluation demonstrate that associative models (i) perform as good as the best performing classification models, and (ii) give relevant insights into the per-student success rate assignments.

[1]  Bing Liu Classification by Association Rule Analysis , 2009, Encyclopedia of Database Systems.

[2]  Luca Cagliero,et al.  Educational video services in universities: A systematic effectiveness analysis , 2017, 2017 IEEE Frontiers in Education Conference (FIE).

[3]  Ramaswamy Palaniappan,et al.  Predicting students’ final degree classification using an extended profile , 2019, Education and Information Technologies.

[4]  Ibrahim Aljarah,et al.  Mining Educational Data to Predict Student’s academic Performance using Ensemble Methods , 2016 .

[5]  Mojisola G. Asogbon,et al.  A Multi-class Support Vector Machine Approach for Students Academic Performance Prediction , 2016 .

[6]  A. D. Carson,et al.  Predicting Student Success from the Lassi for Learning Online (LLO) , 2011 .

[7]  Daniel Spikol,et al.  Modelling collaborative problem-solving competence with transparent learning analytics: is video data enough? , 2020, LAK.

[8]  Nick Z. Zacharis,et al.  A multivariate approach to predicting student outcomes in web-enabled blended learning courses , 2015, Internet High. Educ..

[9]  Mung Chiang,et al.  Behavior-Based Grade Prediction for MOOCs Via Time Series Neural Networks , 2017, IEEE Journal of Selected Topics in Signal Processing.

[10]  Josef Guggemos,et al.  On the predictors of computational thinking and its growth at the high-school level , 2021, Comput. Educ..

[11]  E. P. Lewis In perspective. , 1972, Nursing outlook.

[12]  M. Moore Editorial: Three types of interaction , 1989 .

[13]  Neil T. Heffernan,et al.  Incorporating Rich Features into Deep Knowledge Tracing , 2017, L@S.

[14]  Miguel Ángel Conde González,et al.  Can we predict success from log data in VLEs? Classification of interactions for learning analytics and their relation with performance in VLE-supported F2F and online learning , 2014, Comput. Hum. Behav..

[15]  Joachim M. Buhmann,et al.  The Balanced Accuracy and Its Posterior Distribution , 2010, 2010 20th International Conference on Pattern Recognition.

[16]  David Boulanger,et al.  Explainable Automated Essay Scoring: Deep Learning Really Has Pedagogical Value , 2020, Frontiers in Education.

[17]  Rianne Conijn,et al.  Predicting Student Performance from LMS Data: A Comparison of 17 Blended Courses Using Moodle LMS , 2017, IEEE Transactions on Learning Technologies.

[18]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[19]  Shane Dawson,et al.  Mining LMS data to develop an "early warning system" for educators: A proof of concept , 2010, Comput. Educ..

[20]  George Siemens,et al.  Learning analytics and educational data mining: towards communication and collaboration , 2012, LAK.

[21]  Sebastián Ventura,et al.  Guest Editorial: Special Issue on Early Prediction and Supporting of Learning Performance , 2019, IEEE Trans. Learn. Technol..

[22]  Jürgen Popp,et al.  Common mistakes in cross-validating classification models , 2017 .

[23]  Elena Baralis,et al.  A Lazy Approach to Associative Classification , 2008, IEEE Transactions on Knowledge and Data Engineering.

[24]  Sarah Smith Heckman,et al.  How Widely Can Prediction Models Be Generalized? Performance Prediction in Blended Courses , 2019, IEEE Transactions on Learning Technologies.

[25]  Yaohang Li,et al.  Identifying At-Risk Students for Early Interventions—A Time-Series Clustering Approach , 2017, IEEE Transactions on Emerging Topics in Computing.

[26]  George Karypis,et al.  Feature Extraction for Next-Term Prediction of Poor Student Performance , 2019, IEEE Transactions on Learning Technologies.

[27]  Mohammed J. Zaki,et al.  Lazy Associative Classification , 2006, Sixth International Conference on Data Mining (ICDM'06).

[28]  Charu C. Aggarwal An Introduction to Data Classification , 2014, Data Classification: Algorithms and Applications.

[29]  Juho Leinonen,et al.  Predicting academic performance: a systematic literature review , 2018, ITiCSE.

[30]  Dirk T. Tempelaar,et al.  In search for the most informative data for feedback generation: Learning analytics in a data-rich context , 2015, Comput. Hum. Behav..

[31]  Chia-Lun Lo,et al.  Developing early warning systems to predict students' online learning performance , 2014, Comput. Hum. Behav..

[32]  Tassos A. Mikropoulos,et al.  Predicting Secondary School Students' Performance Utilizing a Semi-supervised Learning Approach , 2019 .

[33]  Hugh Chen,et al.  From local explanations to global understanding with explainable AI for trees , 2020, Nature Machine Intelligence.

[34]  Gabriella Casalino,et al.  Explainable Artificial Intelligence for Human-Centric Data Analysis in Virtual Learning Environments , 2019, HELMeTO.

[35]  Sebastián Ventura,et al.  Evaluating associative classification algorithms for Big Data , 2019, Big Data Analytics.

[36]  Luca Cagliero,et al.  Experimental Validation of a Massive Educational Service in a Blended Learning Environment , 2017, 2017 IEEE 41st Annual Computer Software and Applications Conference (COMPSAC).

[37]  Filip Karlo Dosilovic,et al.  Explainable artificial intelligence: A survey , 2018, 2018 41st International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO).

[38]  Brett E. Shelton,et al.  Improving Predictive Modeling for At-Risk Student Identification: A Multistage Approach , 2019, IEEE Transactions on Learning Technologies.

[39]  Olugbenga Adejo,et al.  Predicting student academic performance using multi-model heterogeneous ensemble approach , 2017 .

[40]  Vicente Matellán Olivera,et al.  Predicting academic success through students’ interaction with Version Control Systems , 2019, Open Comput. Sci..

[41]  Sunday Olusanya Olatunji,et al.  Student performance prediction using Support Vector Machine and K-Nearest Neighbor , 2017, 2017 IEEE 30th Canadian Conference on Electrical and Computer Engineering (CCECE).

[42]  Pang-Ning Tan,et al.  Interestingness Measures for Association Patterns : A Perspective , 2000, KDD 2000.

[43]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[44]  Wynne Hsu,et al.  Integrating Classification and Association Rule Mining , 1998, KDD.

[45]  Carlos Guestrin,et al.  "Why Should I Trust You?": Explaining the Predictions of Any Classifier , 2016, ArXiv.

[46]  Anjeela D. Jokhan,et al.  Early warning system as a predictor for student performance in higher education blended courses , 2019 .

[47]  Marek Hatala,et al.  Learning at distance: Effects of interaction traces on academic achievement , 2015, Comput. Educ..