An empirical analysis of classification techniques for predicting academic performance

A multiclass classification refers to the classification of the instance into more than two classes. In real life many classification problem requires decisions among a set of contending classes. Multiclass classification and prediction is suitable for hand written digit recognition, hand written character recognition, speech recognition and body parts recognition etc. This paper compares five classification algorithms namely Decision Tree, Naïve Bayes, Naïve Bayes Tree, K-Nearest Neighbor and Bayesian Network algorithms for predicting students' grade particularly for engineering students. This is a four class prediction problem. Student's marks are classified into four classes A, B, C and F respectively. Initially complete data set is used to build the classifiers then Bootstrap method is used to improve the accuracy of the each classifier. Bootstrap method is a resample function available in WEKA tool kit. The excellent results of this function can be seen through IBK, Decision Tree and Bayes Net algorithm. However the overall results of all four algorithms are good but the results of individual classes for Naïve Bayes and NB Tree is not sufficient enough for the individual class prediction particularly for this study. This paper also presents a comparative study of the previous work related to student's performance predictions.

[1]  Vivek Kumar Sharma,et al.  A Decision Tree Algorithm Pertaining to the Student Performance Analysis and Prediction , 2013 .

[2]  R. R.Kabra,et al.  Performance Prediction of Engineering Students using Decision Trees , 2011 .

[3]  Boumedyen A.N. Shannaq,et al.  Student Relationship in Higher Education Using Data Mining Techniques , 2010 .

[4]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[5]  Petra Perner,et al.  Data Mining - Concepts and Techniques , 2002, Künstliche Intell..

[6]  P. Haddawy,et al.  A decision support system for evaluating international student applications , 2007, 2007 37th Annual Frontiers In Education Conference - Global Engineering: Knowledge Without Borders, Opportunities Without Passports.

[7]  Sotiris B. Kotsiantis Use of machine learning techniques for educational proposes: a decision support system for forecasting students’ grades , 2011, Artificial Intelligence Review.

[8]  Marta E. Zorrilla,et al.  Comparing classification methods for predicting distance students' performance , 2011, WAPA.

[9]  Sunita B Aher,et al.  Data Mining in Educational System using WEKA , 2011 .

[10]  Nguyen Thai Nghe,et al.  A comparative analysis of techniques for predicting academic performance , 2007, 2007 37th Annual Frontiers In Education Conference - Global Engineering: Knowledge Without Borders, Opportunities Without Passports.

[11]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[12]  Abdul Razak Hamdan,et al.  Classification and Prediction of Academic Talent Using Data Mining Techniques , 2010, KES.

[13]  R. S. Bichkar,et al.  Performance Prediction of Engineering Students using Decision Trees , 2011 .

[14]  Qasem A. Al-Radaideh,et al.  Mining Student Data Using Decision Trees , 2006 .

[15]  VialardiCésar,et al.  A data mining approach to guide students through the enrollment process based on academic performance , 2011 .

[16]  Cali M. Davis,et al.  Data Mining Applications in Higher Education , 2007 .

[17]  Margaret H. Dunham,et al.  Data Mining: Introductory and Advanced Topics , 2002 .

[18]  Saurabh Pal,et al.  Mining Educational Data to Analyze Students' Performance , 2012, ArXiv.

[19]  Alvaro Ortigosa,et al.  A data mining approach to guide students through the enrollment process based on academic performance , 2011, User Modeling and User-Adapted Interaction.

[20]  S. Anupama Kumar EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT'S ACADEMIC PERFORMANCE , 2011 .