Supervised Learning Algorithms in Educational Data Mining: A Systematic Review

The academic institutions always looking for tools that improve their performance and enhance individuals outcomes. Due to the huge ability of data mining to explore hidden patterns and trends in the data, many researchers paid attention to Educational Data Mining (EDM) in the last decade. This field explores different types of data using different algorithms to extract knowledge that supports decision-making and academic sector development. The researchers in the field of EDM have proposed and adopted different algorithms in various directions. In this review, we have explored the published papers between 2010-2020 in the libraries (IEEE, ACM, Science Direct, and Springer) in the field of EDM are to answer review questions. We aimed to find the most used algorithm by researchers in the field of supervised machine learning in the period of 2010-2020. Additionally, we explored the most direction in the EDM and the interest of the researchers. During our research and analysis, many limitations have been examined and in addition to answering the review questions, some future works have been presented.

[1]  Alaa Khalaf Hamoud,et al.  Student Performance Prediction Model based on Supervised Machine Learning Algorithms , 2020 .

[2]  Miltiadis D. Lytras,et al.  Predicting Student Performance using Advanced Learning Analytics , 2017, WWW.

[3]  Chaman Verma,et al.  Age Group Predictive Models for the Real Time Prediction of the University Students using Machine Learning: Preliminary Results , 2019, 2019 IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT).

[4]  Saeed Shiry Ghidary,et al.  Prediction of student course selection in online higher education institutes using neural network , 2013, Comput. Educ..

[5]  Alaa Khalaf Hamoud Selection of Best Decision Tree Algorithm for Prediction and Classification of Students’ Action , 2016 .

[6]  Baldoino Fonseca dos Santos Neto,et al.  Evaluating the effectiveness of educational data mining techniques for early prediction of students' academic failure in introductory programming courses , 2017, Comput. Hum. Behav..

[7]  Hassan Zeineddine,et al.  Enhancing prediction of student success: Automated machine learning approach , 2021, Comput. Electr. Eng..

[8]  Sonia Berman,et al.  Using Bayesian Networks and Machine Learning to Predict Computer Science Success , 2018 .

[9]  Mustafa Çevik,et al.  Prediction of academic achievements of vocational and technical high school (VTS) students in science courses through artificial neural networks (comparison of Turkey and Malaysia) , 2019, Education and Information Technologies.

[10]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[11]  A M O'Connor,et al.  Conducting Systematic Reviews of Intervention Questions I: Writing the Review Protocol, Formulating the Question and Searching the Literature , 2014, Zoonoses and public health.

[12]  Y. So,et al.  A Tutorial on Logistic Regression , 1996 .

[13]  Nong Ye,et al.  Data Mining: Theories, Algorithms, and Examples , 2013 .

[14]  Maja Matetic,et al.  Mining student data to assess the impact of moodle activities and prior knowledge on programming course success , 2015, CompSysTech '15.

[15]  Alaa Khalaf Hamoud,et al.  Building Data Warehouse for Diseases Registry: First step for Clinical Data Warehouse. , 2013 .

[16]  Tsuyoshi Usagawa,et al.  Mining Educational Data to Predict Academic Dropouts: a Case Study in Blended Learning Course , 2018, TENCON 2018 - 2018 IEEE Region 10 Conference.

[17]  Khaled Shaalan,et al.  Mining in Educational Data: Review and Future Directions , 2020, AICV.

[18]  Rui Guo,et al.  Participation-based student final performance prediction model through interpretable Genetic Programming: Integrating learning analytics, educational data mining and theory , 2015, Comput. Hum. Behav..

[19]  Addin Osman,et al.  Using Data Mining Techniques to Guide Academic Programs Design and Assessment , 2019 .

[20]  Farshid Marbouti,et al.  Models for early prediction of at-risk students in a course using standards-based grading , 2016, Comput. Educ..

[21]  Ravinder Ahuja,et al.  Predicting the probability of student's degree completion by using different data mining techniques , 2017, 2017 Fourth International Conference on Image Information Processing (ICIIP).

[22]  Lars Schmidt-Thieme,et al.  Recommender system for predicting student performance , 2010, RecSysTEL@RecSys.

[23]  Majid Zaman,et al.  An Intelligent Prediction System for Educational Data Mining Based on Ensemble and Filtering approaches , 2020 .

[24]  Syed Abbas Ali,et al.  Analyzing undergraduate students' performance using educational data mining , 2017, Comput. Educ..

[25]  Shane Dawson,et al.  Predicting academic performance by considering student heterogeneity , 2018, Knowl. Based Syst..

[26]  Manjusha Pandey,et al.  Analyzing Student Performance in Engineering Placement Using Data Mining , 2019 .

[27]  D. Gough,et al.  An Introduction to Systematic Reviews , 2017 .

[28]  Jimmy Armas-Aguirre,et al.  Predictive model to reduce the dropout rate of university students in Perú: Bayesian Networks vs. Decision Trees , 2020, 2020 15th Iberian Conference on Information Systems and Technologies (CISTI).

[29]  Moti Zwilling,et al.  Student data mining solution-knowledge management system related to higher education institutions , 2014, Expert Syst. Appl..

[30]  Aqeel Majeed Humadi,et al.  Students’ Success Prediction based on Bayes Algorithms , 2017 .

[31]  Afnan Algobail,et al.  Predicting Students’ Performance in University Courses: A Case Study and Tool in KSU Mathematics Department☆ , 2016 .

[32]  V. Shanmugarajeshwari,et al.  Analysis of students' performance evaluation using classification techniques , 2016, 2016 International Conference on Computing Technologies and Intelligent Data Engineering (ICCTIDE'16).

[33]  Aqeel Majeed Humadi,et al.  Online Real Time Fuzzy Inference System Based Human Health Monitoring and Medical Decision Making , 2017 .

[34]  Norlida Buniyamin,et al.  Educational data mining for prediction and classification of engineering students achievement , 2015, 2015 IEEE 7th International Conference on Engineering Education (ICEED).

[35]  Alaa Khalaf Hamoud,et al.  Using OLAP with Diseases Registry Warehouse for Clinical Decision Support , 2014 .

[36]  Rahul Patil,et al.  Prediction System for Student Performance Using Data Mining Classification , 2018, 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA).

[37]  Hanife Goker,et al.  Improving an Early Warning System to Prediction of Student Examination  Achievement , 2014, 2014 13th International Conference on Machine Learning and Applications.

[38]  Priyanka Sharma,et al.  Performance prediction of students using distributed data mining , 2015, 2015 International Conference on Innovations in Information, Embedded and Communication Systems (ICIIECS).

[39]  Pearl Brereton,et al.  Performing systematic literature reviews in software engineering , 2006, ICSE.

[40]  Charu C. Aggarwal,et al.  Data Mining: The Textbook , 2015 .

[41]  Jitti Niramitranon,et al.  Student performance prediction model for early-identification of at-risk students in traditional classroom settings , 2018, MEDES.

[42]  Cristóbal Romero,et al.  Educational data mining and learning analytics: An updated survey , 2020, WIREs Data Mining Knowl. Discov..

[43]  M. Petticrew,et al.  Systematic Reviews in the Social Sciences: A Practical Guide , 2005 .

[44]  Ryan S. Baker,et al.  The State of Educational Data Mining in 2009: A Review and Future Visions. , 2009, EDM 2009.

[45]  Alaa Khalaf Hamoud,et al.  CLINICAL DATA WAREHOUSE: A REVIEW , 2018, Iraqi Journal for Computers and Informatics.

[46]  Vinayak Hegde,et al.  Prediction of students performance using Educational Data Mining , 2016, 2016 International Conference on Data Mining and Advanced Computing (SAPIENCE).

[47]  Janice D. Gobert,et al.  Using educational data mining to assess students’ skills at designing and conducting experiments within a complex systems microworld , 2015 .

[48]  Fabio A. González,et al.  A Model to Predict Low Academic Performance at a Specific Enrollment Using Data Mining , 2015, IEEE Revista Iberoamericana de Tecnologias del Aprendizaje.

[49]  Jürgen Börstler,et al.  Educational Data Mining and Learning Analytics in Programming: Literature Review and Case Studies , 2015, ITiCSE-WGR.

[50]  Oliver Kramer,et al.  Dimensionality Reduction with Unsupervised Nearest Neighbors , 2013, Intelligent Systems Reference Library.

[51]  Jeffrey S. Rosenthal,et al.  Predicting University Students’ Academic Success and Major Using Random Forests , 2018, Research in Higher Education.

[52]  Hind Almayan,et al.  Improving accuracy of students' final grade prediction model using PSO , 2016, 2016 6th International Conference on Information Communication and Management (ICICM).

[53]  Francisco José García-Peñalvo,et al.  Predicting Student Failure in an Introductory Programming Course with Multiple Back-Propagation , 2019, TEEM.

[54]  Boran Sekeroglu,et al.  Student Performance Prediction and Classification Using Machine Learning Algorithms , 2019, Proceedings of the 2019 8th International Conference on Educational and Information Technology.

[55]  Alaa Khalaf Hamoud,et al.  The Effect of Not Using Internet of Things in Critical life Situations in the Health Field and the Effect on Iraqi Profitability: Empirical Study in Basra , 2019, Journal of Southwest Jiaotong University.

[56]  Ching-Chieh Kiu,et al.  Data Mining Analysis on Student’s Academic Performance through Exploration of Student’s Background and Social Activities , 2018, 2018 Fourth International Conference on Advances in Computing, Communication & Automation (ICACCA).

[57]  Wahyu Indrawan,et al.  Data mining for predicting students' learning result , 2017, 2017 4th International Conference on Computer Applications and Information Processing Technology (CAIPT).

[58]  Vrushali Mhetre,et al.  Classification based data mining algorithms to predict slow, average and fast learners in educational system using WEKA , 2017, 2017 International Conference on Computing Methodologies and Communication (ICCMC).

[59]  Angelos Charitopoulos,et al.  On the Use of Soft Computing Methods in Educational Data Mining and Learning Analytics Research: a Review of Years 2010–2018 , 2020, International Journal of Artificial Intelligence in Education.

[60]  Ceasar Ian P. Benablo,et al.  Higher Education Student's Academic Performance Analysis through Predictive Analytics , 2018, ICSCA.

[61]  Hüseyin Gürüler,et al.  A new student performance analysing system using knowledge discovery in higher educational databases , 2010, Comput. Educ..

[62]  Sérgio Manuel Serra da Cruz,et al.  Towards automatic prediction of student performance in STEM undergraduate degree programs , 2015, SAC.

[63]  Alaa Khalaf Hamoud,et al.  A REVIEW ON INTERNET OF THINGS ARCHITECTURE FOR BIG DATA PROCESSING , 2020 .

[64]  M. C. Nicoletti,et al.  A data mining approach for forecasting students' performance , 2018, 2018 13th Iberian Conference on Information Systems and Technologies (CISTI).

[65]  Raoul Kwuimi,et al.  Educational Data Mining to Improve Learner Performance in Gauteng Primary Schools , 2018, 2018 International Conference on Advances in Big Data, Computing and Data Communication Systems (icABCD).

[66]  Marija Bezbradica,et al.  Predicting Students’ Academic Performance and Main Behavioral Features Using Data Mining Techniques , 2019, Communications in Computer and Information Science.

[67]  Nongnuch Ketui,et al.  Using Classification Data Mining Techniques for Students Performance Prediction , 2019, 2019 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (ECTI DAMT-NCON).

[68]  Li Yang,et al.  Predicting Students Performance in Educational Data Mining , 2015, 2015 International Symposium on Educational Technology (ISET).

[69]  Subitha Sivakumar,et al.  Predictive Modeling of Students Performance Through the Enhanced Decision Tree , 2018 .

[70]  Andre B. de Carvalho,et al.  Supervised Learning in the Context of Educational Data Mining to Avoid University Students Dropout , 2019, 2019 IEEE 19th International Conference on Advanced Learning Technologies (ICALT).

[71]  Alaa Khalaf Hamoud Applying Association Rules and Decision Tree Algorithms with Tumor Diagnosis Data , 2017 .

[72]  V. Hurbungs,et al.  A Machine Learning Model to Predict the Performance of University Students , 2018 .

[73]  P. G. Sunitha Hiremath,et al.  Student academic performance and social behavior predictor using data mining techniques , 2017, 2017 International Conference on Computing, Communication and Automation (ICCCA).

[74]  Saud Altaf,et al.  Student Performance Prediction using Multi-Layers Artificial Neural Networks: A Case Study on Educational Data Mining , 2019, ICISDM.

[75]  A. Sabitha,et al.  CLASSIFYING STUDENTS ’ ANSWERS USING CLUSTERING ALGORITHMS BASED ON PRINCIPLE COMPONENT ANALYSIS , 2018 .

[76]  Vicente García-Díaz,et al.  Supporting academic decision making at higher educational institutions using machine learning-based algorithms , 2018, Soft Computing.

[77]  Wu Zhang,et al.  Using machine learning to predict student difficulties from learning session data , 2018, Artificial Intelligence Review.

[78]  Nazar Zaki,et al.  Using Educational Data Mining Techniques to Predict Student Performance , 2019, 2019 International Conference on Electrical and Computing Technologies and Applications (ICECTA).

[79]  Elizabeth A. Cudney,et al.  Predicting Student Retention Using Support Vector Machines , 2019, Procedia Manufacturing.

[80]  Justine Cassell,et al.  Connecting the Dots: Predicting Student Grade Sequences from Bursty MOOC Interactions over Time , 2015, L@S.

[81]  Harris Cooper,et al.  Effects of Full-Day Kindergarten on Academic Achievement and Social Development , 2010 .

[82]  Sebastián Ventura,et al.  Educational Data Mining: A Review of the State of the Art , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[83]  Camilo Castellanos,et al.  Applying Data Mining Techniques to Predict Student Dropout: A Case Study , 2018, 2018 IEEE 1st Colombian Conference on Applications in Computational Intelligence (ColCACI).

[84]  E. Mohammadi,et al.  Barriers and facilitators related to the implementation of a physiological track and trigger system: A systematic review of the qualitative evidence , 2017, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[85]  William Stafford Noble,et al.  Support vector machine , 2013 .

[86]  Raymond Lister,et al.  Students' Syntactic Mistakes in Writing Seven Different Types of SQL Queries and its Application to Predicting Students' Success , 2016, SIGCSE.

[87]  Anne-Sophie Hoffait,et al.  Early detection of university students with potential difficulties , 2017, Decis. Support Syst..

[88]  Rommel N. Carvalho,et al.  Educational data mining: Predictive analysis of academic performance of public school students in the capital of Brazil , 2019, Journal of Business Research.

[89]  Wichai Puarungroj,et al.  Application of Data Mining Techniques for Predicting Student Success in English Exit Exam , 2018, IMCOM.

[90]  Md. Rabiul Islam,et al.  Predict Student's Academic Performance and Evaluate the Impact of Different Attributes on the Performance Using Data Mining Techniques , 2017, 2017 2nd International Conference on Electrical & Electronic Engineering (ICEEE).

[91]  Shinichi Oeda,et al.  Visualization of Programming Skill Structure by Log-Data Analysis with Decision Tree , 2019, KES.

[92]  Sushruta Mishra,et al.  Enhancing the capabilities of Student Result Prediction System , 2016, ICTCS.

[93]  Alaa Khalaf Hamoud,et al.  A Review of Various Steganography Techniques in Cloud Computing , 2019, University of Thi-Qar Journal of Science.

[94]  T. Chellatamilan,et al.  Effect of mining educational data to improve adaptation of learning in e-learning system , 2011 .

[95]  Kamran Shaukat,et al.  Student's performance in the context of data mining , 2016, 2016 19th International Multi-Topic Conference (INMIC).

[96]  José Antonio Pow-Sang,et al.  A systematic review of usability techniques in agile methodologies , 2014, EATIS '14.

[97]  Reshma Gulwani,et al.  Predictive analytics for E learning system , 2017, 2017 International Conference on Inventive Systems and Control (ICISC).

[98]  Marian Cristian Mihaescu,et al.  Review on publicly available datasets for educational data mining , 2021, Wiley Interdiscip. Rev. Data Min. Knowl. Discov..

[99]  Theo Stijnen,et al.  A Model to Predict Student Failure in the First Year of the Undergraduate Medical Curriculum , 2017 .

[100]  Sebastián Ventura,et al.  Data mining in education , 2013, WIREs Data Mining Knowl. Discov..

[101]  Mohamed Ezz,et al.  Adaptive recommendation system using machine learning algorithms for predicting student’s best academic program , 2019, Education and Information Technologies.

[102]  Manpreet Singh,et al.  Classification and Prediction Based Data Mining Algorithms to Predict Slow Learners in Education Sector , 2015 .

[103]  D. S. B. Fonseca,et al.  A data mining approach to predict undergraduate students' performance , 2018, 2018 13th Iberian Conference on Information Systems and Technologies (CISTI).

[104]  Maria Meehan,et al.  Contrasting prediction methods for early warning systems at undergraduate level , 2016, Internet High. Educ..

[105]  S. Sathiya Keerthi,et al.  Parallel sequential minimal optimization for the training of support vector machines , 2006, IEEE Trans. Neural Networks.

[106]  Sebastián Ventura,et al.  Predicting students' final performance from participation in on-line discussion forums , 2013, Comput. Educ..

[107]  Rashmi Agrawal,et al.  Analysis of Educational Data Mining using Classification , 2019, 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon).

[108]  Alexander A. Hernandez,et al.  Modelling Student Performance Using Data Mining Techniques: Inputs for Academic Program Development , 2019, Proceedings of the 2019 5th International Conference on Computing and Data Engineering - ICCDE' 19.

[109]  Juan Alfonso Lara,et al.  Data mining for modeling students' performance: A tutoring action plan to prevent academic dropout , 2017, Comput. Electr. Eng..

[110]  Reymon Rotikan,et al.  Students' Academic Performance Prediction using Data Mining , 2018, 2018 Third International Conference on Informatics and Computing (ICIC).

[111]  Singh Umesh Kumar,et al.  Data mining: Prediction for performance improvement of graduate students using classification , 2012, 2012 Ninth International Conference on Wireless and Optical Communications Networks (WOCN).

[112]  Hiroaki Ogata,et al.  A neural network approach for students' performance prediction , 2017, LAK.

[113]  Pornthep Rojanavasu,et al.  Educational Data Analytics using Association Rule Mining and Classification , 2019, 2019 Joint International Conference on Digital Arts, Media and Technology with ECTI Northern Section Conference on Electrical, Electronics, Computer and Telecommunications Engineering (ECTI DAMT-NCON).

[114]  Youqiang Guo,et al.  Improving Prediction of Student Performance based on Multiple Feature Selection Approaches , 2017 .

[115]  Abhishek Singhal,et al.  PREDICTION & WARNING: a method to improve student's performance , 2014, SOEN.

[116]  Alaa Khalaf Hamoud,et al.  Design and Implementing Cancer Data Warehouse to Support Clinical Decisions , 2016 .

[117]  Sujing Wang,et al.  Analyze and Predict Student Dropout from Online Programs , 2018, ICCDA 2018.

[118]  Thelma D. Palaoag,et al.  Predicting Student's Board Examination Performance using Classification Algorithms , 2018, ICSCA.

[119]  William G. Griswold,et al.  A Robust Machine Learning Technique to Predict Low-performing Students , 2019, ACM Trans. Comput. Educ..

[120]  Steven M. Corns,et al.  Multi-Objective Evolutionary Neural Network to Predict Graduation Success at the United States Military Academy , 2018 .

[121]  Jayanthi Ranjan,et al.  Effective educational process: a data‐mining approach , 2007 .

[122]  M. Bennett,et al.  Preadmission Predictors of Student Success in a Baccalaureate of Science in Nursing Program , 2016 .

[123]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[124]  Aqeel Majeed Humadi,et al.  Liver Hepatitis Diagnosing based on Fuzzy Inference System , 2019 .

[125]  Erdal Irmak,et al.  The Estimation of Students' Academic Success by Data Mining Methods , 2013, 2013 12th International Conference on Machine Learning and Applications.

[126]  Jalal Nouri,et al.  Identifying Factors for Master Thesis Completion and Non-completion Through Learning Analytics and Machine Learning , 2019, EC-TEL.

[127]  Muna S. Al-Razgan,et al.  Predicting Critical Courses Affecting Students Performance: A Case Study , 2016 .

[128]  Qing Zhou,et al.  Predicting the students with mental health risk by using Internet access logs , 2018, 2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC).

[129]  Mariel Musso,et al.  Predicting key educational outcomes in academic trajectories: a machine-learning approach , 2020, Higher Education.

[130]  Tomasz Imielinski,et al.  Mining association rules between sets of items in large databases , 1993, SIGMOD Conference.

[131]  Ignacio Bosch,et al.  Machine Learning Prediction Approach to Enhance Congestion Control in 5G IoT Environment , 2019, Electronics.

[132]  Cristóbal Romero,et al.  Predicting academic performance of university students from multi-sources data in blended learning , 2019, DATA.