Enhanced Harris Hawks optimization as a feature selection for the prediction of student performance

Predicting student performance for educational organizations such as universities, community colleges, schools, and training centers will enhance the overall results of these organizations. Big data can be extracted from the internal systems of these organizations, such as exam records, statistics about virtual courses, and e-learning systems. Finding meaningful knowledge from extracted data is a challenging task. In this paper, we proposed a modified version of Harris Hawks Optimization (HHO) algorithm by controlling the population diversity to overcome the early convergence problem and prevent trapping in a local optimum. The proposed approach is employed as a feature selection algorithm to discover the most valuable features for student performance prediction problem. A dynamic controller that controls the population diversity by observing the performance of HHO using the k-nearest neighbors (kNN) algorithm as a clustering approach. Once all solutions belong to one cluster, an injection process is employed to redistribute the solutions over the search space. A set of machine learning classifiers such as kNN, Layered recurrent neural network (LRNN), Naive Bayes, and Artificial Neural Network are used to evaluate the overall prediction system. A real dataset obtained from UCI machine learning repository is adopted in this paper. The obtained results show the importance of predicting students’ performance at an earlier stage to avoid students’ failure and improve the overall performance of the educational organization. Moreover, the reported results show that the combination between the enhanced HHO and LRNN can outperform other classifiers with accuracy equal to $$92\%$$ , since LRNN is a deep learning algorithm that is able to learn from previous and current input values.

[1]  Umeshwar Dayal,et al.  Multi-dimensional sequential pattern mining , 2001, CIKM '01.

[2]  Mexhid Ferati,et al.  IoTutor: How Cognitive Computing Can Be Applied to Internet of Things Education , 2018, IFIPIoT@WCC.

[3]  W. F. Punch,et al.  Predicting student performance: an application of data mining methods with an educational Web-based system , 2003, 33rd Annual Frontiers in Education, 2003. FIE 2003..

[4]  Fred W. Glover,et al.  Future paths for integer programming and links to artificial intelligence , 1986, Comput. Oper. Res..

[5]  Ayundyah Kesumawati,et al.  Predicting patterns of student graduation rates using Naïve bayes classifier and support vector machine , 2018 .

[6]  Emile H. L. Aarts,et al.  Simulated Annealing: Theory and Applications , 1987, Mathematics and Its Applications.

[7]  Ponnuthurai N. Suganthan,et al.  Heterogeneous comprehensive learning particle swarm optimization with enhanced exploration and exploitation , 2015, Swarm Evol. Comput..

[8]  Sebastián Ventura,et al.  A collaborative educational association rule mining tool , 2011, Internet High. Educ..

[9]  Urvinder Singh,et al.  Modified Grey Wolf Optimizer for Global Engineering Optimization , 2016, Appl. Comput. Intell. Soft Comput..

[10]  Hossam Faris,et al.  Harris hawks optimization: Algorithm and applications , 2019, Future Gener. Comput. Syst..

[11]  Amirrudin Kamsin,et al.  Classification, Clustering and Association Rule Mining in Educational Datasets Using Data Mining Tools: A Case Study , 2018, CSOC.

[12]  Helen Lam,et al.  Educational Data Mining (EDM): Researching Impact on Online Business Education , 2018 .

[13]  Una-May O'Reilly,et al.  Improving Genetic Programming with Novel Exploration - Exploitation Control , 2019, EuroGP.

[14]  Piotr Jedrzejowicz,et al.  Current Trends in the Population-Based Optimization , 2019, ICCCI.

[15]  Reza Safabakhsh,et al.  Novel Adaptive Genetic Algorithm Sample Consensus , 2017, Appl. Soft Comput..

[16]  Murad Khan,et al.  IoT-based students interaction framework using attention-scoring assessment in eLearning , 2018, Future Gener. Comput. Syst..

[17]  Dali Wei,et al.  Tradeoff strategy between exploration and exploitation for PSO , 2011, 2011 Seventh International Conference on Natural Computation.

[18]  Hamza Turabieh,et al.  Hybrid Machine Learning Classifiers to Predict Student Performance , 2019, 2019 2nd International Conference on new Trends in Computing Sciences (ICTCS).

[19]  Robertas Damasevicius,et al.  Analysis of Academic Results for Informatics Course Improvement Using Association Rule Mining , 2008, ISD.

[20]  Hossein Moayedi,et al.  A competitive chain-based Harris Hawks Optimizer for global optimization and multi-level image thresholding problems , 2020, Appl. Soft Comput..

[21]  Giorgos Paschalis,et al.  Association Rules Mining from the Educational Data of ESOG Web-Based Application , 2012, AIAI.

[22]  Zachary A. Pardos,et al.  Clustering Students to Generate an Ensemble to Improve Standard Test Score Predictions , 2011, AIED.

[23]  Frederick W. B. Li,et al.  Study on student performance estimation, student progress analysis, and student potential prediction based on data mining , 2018, Comput. Educ..

[24]  Marjan Mernik,et al.  Exploration and exploitation in evolutionary algorithms: A survey , 2013, CSUR.

[25]  Sun Yi,et al.  Using IoT technology to improve online education through data mining , 2017, 2017 International Conference on Applied System Innovation (ICASI).

[26]  Mitsuo Gen,et al.  Auto-tuning strategy for evolutionary algorithms: balancing between exploration and exploitation , 2008, Soft Comput..

[27]  Osmar R. Zaïane,et al.  Educational data mining applications and tasks: A survey of the last 10 years , 2017, Education and Information Technologies.

[28]  Hosam Al-Samarraie,et al.  Educational data mining and learning analytics for 21st century higher education: A review and synthesis , 2019, Telematics Informatics.

[29]  Suk Gyu Lee,et al.  Hybrid Stochastic Exploration Using Grey Wolf Optimizer and Coordinated Multi-Robot Exploration Algorithms , 2019, IEEE Access.

[30]  Y. Ho,et al.  Simple Explanation of the No-Free-Lunch Theorem and Its Implications , 2002 .

[31]  Qingfu Zhang,et al.  Balancing exploration and exploitation in multiobjective evolutionary optimization , 2018, GECCO.

[32]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[33]  Evandro de Barros Costa,et al.  Monitoring student performance using data clustering and predictive modelling , 2014, 2014 IEEE Frontiers in Education Conference (FIE) Proceedings.

[34]  Rommel N. Carvalho,et al.  Educational data mining: Predictive analysis of academic performance of public school students in the capital of Brazil , 2019, Journal of Business Research.

[35]  Andrew Lewis,et al.  The Whale Optimization Algorithm , 2016, Adv. Eng. Softw..

[36]  Xiaodong Li,et al.  Iterated feature selection algorithms with layered recurrent neural network for software fault prediction , 2019, Expert Syst. Appl..

[37]  Roopali Garg,et al.  Student’s Performance Evaluation of an Institute Using Various Classification Algorithms , 2018 .

[38]  Mohammad Reza Meybodi,et al.  Balancing exploration and exploitation in memetic algorithms: A learning automata approach , 2018, Comput. Intell..

[39]  Helena Ramalhinho Dias Lourenço,et al.  Iterated Local Search , 2001, Handbook of Metaheuristics.

[40]  Ryan S. Baker,et al.  The State of Educational Data Mining in 2009: A Review and Future Visions. , 2009, EDM 2009.

[41]  Harwati,et al.  Mapping Student's Performance Based on Data Mining Approach (A Case Study)☆ , 2015 .

[42]  El-Ghazali Talbi,et al.  Metaheuristics - From Design to Implementation , 2009 .

[43]  Du Q. Huynh,et al.  A supervised learning framework for learning management systems , 2018, DATA.

[44]  D. Beukelman,et al.  An elementary student with severe expressive communication impairment in a general education classroom: Sequential analysis of interactions , 2000 .

[45]  Xin-She Yang,et al.  A New Metaheuristic Bat-Inspired Algorithm , 2010, NICSO.

[46]  Seyed Mohammad Mirjalili,et al.  Moth-flame optimization algorithm: A novel nature-inspired heuristic paradigm , 2015, Knowl. Based Syst..

[47]  Yasuhiko Morimoto,et al.  Sequential pattern mining method for analysis of programming learning history based on the learning process , 2014, 2014 International Conference on Education Technologies and Computers (ICETC).

[48]  Abhay Bansal,et al.  Application of learning analytics using clustering data Mining for Students’ disposition analysis , 2018, Education and Information Technologies.

[49]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[50]  Seyedali Mirjalili,et al.  Dragonfly algorithm: a new meta-heuristic optimization technique for solving single-objective, discrete, and multi-objective problems , 2015, Neural Computing and Applications.

[51]  Alexander García Dávalos,et al.  IoT in Education: Integration of Objects with Virtual Academic Communities , 2016, WorldCIST.

[52]  Sebastián Ventura,et al.  Educational data mining: A survey from 1995 to 2005 , 2007, Expert Syst. Appl..

[53]  Lorenz Kemper,et al.  Predicting student dropout: A machine learning approach , 2020, European Journal of Higher Education.