Early Prediction of Students' Grade Point Averages at Graduation: A Data Mining Approach.

Problem Statement: There has recently been interest in educational databases containing a variety of valuable but sometimes hidden data that can be used to help less successful students to improve their academic performance. The extraction of hidden information from these databases often implements aspects of the educational data mining (EDM) theory, which aims to study available data in order to shed light on more valuable, hidden information. Data clustering, classification, and regression methods such as K-means clustering, neural networks (NN), extreme learning machine (ELM), and support vector machines (SVM) can be used for to predict aspects of the educational data. EDM outputs can ultimately identify which students will need additional help to improve their grade point averages (GPAs) at graduation. Purpose of Study: This study aims to implement several prediction techniques in data mining to assist educational institutions with predicting their students’ GPAs at graduation. If students are predicted to have low GPAs at graduation, then extra efforts can be made to improve their academic performance and, in turn, GPAs. Methods: NN, SVM, and ELM algorithms are applied to data of computer education and instructional technology students to predict their GPAs at graduation. Findings and Results: A comparative analysis of the results indicates that the SVM technique yielded more accurate predictions at a rate of 97.98%. By contrast, the ELM method yielded the second most accurate prediction rate (94.92%) evaluated based on the criterion of correlation coefficient. NN reported the least accurate prediction rate (93.76%). Conclusions and Recommendations: The use of data mining methodologies has recently expanded for a variety of educational purposes. The

[1]  Abdulkadir Sengur,et al.  Online modulation recognition of analog communication signals using neural network , 2007 .

[2]  Jennifer L. Kobrin,et al.  Students with Discrepant High School GPA and SAT Scores , 2003 .

[3]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.

[4]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[5]  Sotiris B. Kotsiantis,et al.  A combinational incremental ensemble of classifiers as a technique for predicting students' performance in distance education , 2010, Knowl. Based Syst..

[6]  Marjan Laal,et al.  Knowledge management in higher education , 2011, WCIT.

[7]  Mehmet Esen,et al.  Forecasting of a ground-coupled heat pump performance using neural networks with statistical data weighting pre-processing , 2008 .

[8]  Saurabh Pal,et al.  Data Mining: A prediction for performance improvement using classification , 2012, ArXiv.

[9]  Neil T. Heffernan,et al.  Can an Intelligent Tutoring System Predict Math Proficiency as Well as a Standarized Test? , 2008, EDM.

[10]  Zlatko J. Kovacic,et al.  Early Prediction of Student Success: Mining Students Enrolment Data , 2010 .

[11]  Abdulkadir Sengür,et al.  Modelling of a new solar air heater through least-squares support vector machines , 2009, Expert Syst. Appl..

[12]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[13]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[14]  W. F. Punch,et al.  Predicting student performance: an application of data mining methods with an educational Web-based system , 2003, 33rd Annual Frontiers in Education, 2003. FIE 2003..

[15]  Lisa Gjedde,et al.  Research, Reflections and Innovations in Integrating ICT in Education Examining online learning processes based on log files analysis: A case , 2022 .

[16]  Senol Zafer Erdogan,et al.  A DATA MINING APPLICATION IN A STUDENT DATABASE , 2005 .

[17]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[18]  Erman Yukselturk,et al.  Predicting Dropout Student: An Application of Data Mining Methods in an Online Education Program , 2014 .

[19]  Mustafa Inalli,et al.  Modeling a ground-coupled heat pump system by a support vector machine , 2008 .

[20]  Okan Bulut,et al.  Application of Computerized Adaptive Testing to Entrance Examination for Graduate Studies in Turkey , 2012 .

[21]  Dursun Delen,et al.  Predicting and analyzing secondary education placement-test scores: A data mining approach , 2012, Expert Syst. Appl..

[22]  R. Bhaskaran,et al.  A Study on Feature Selection Techniques in Educational Data Mining , 2009, ArXiv.

[23]  Qasem A. Al-Radaideh,et al.  Mining Student Data Using Decision Trees , 2006 .

[24]  D. Pintar,et al.  The use of data mining in education environment , 2007, 2007 9th International Conference on Telecommunications.

[25]  Jing Luan,et al.  Data Mining and Knowledge Management in Higher Education -Potential Applications. , 2002 .

[26]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[27]  K. Shyamala,et al.  Data Mining Model for a Better Higher Educational System , 2006 .

[28]  Jennifer L. Kobrin,et al.  Students with Discrepant High School GPA and SAT® I Scores. Research Notes. RN-15. , 2002 .

[29]  Saurabh Pal,et al.  Mining Educational Data to Analyze Students' Performance , 2012, ArXiv.