Early Prediction of Students Performance using Machine Learning Techniques

Anal Acharya, Department of Computer Science, St Xavier’s College, Kolkata, India. Devadatta Sinha, Department of Computer Science and Engineering, University of Calcutta, Kolkata, India. ABSTRACT In recent years Educational Data Mining (EDM) has emerged as a new field of research due to the development of several statistical approaches to explore data in educational context. One such application of EDM is early prediction of student results. This is necessary in higher education for identifying the “weak” students so that some form of remediation may be organized for them. In this paper a set of attributes are first defined for a group of students majoring in Computer Science in some undergraduate colleges in Kolkata. Since the numbers of attributes are reasonably high, feature selection algorithms are applied on the data set to reduce the number of features. Five classes of Machine Learning Algorithm (MLA) are then applied on this data set and it was found that the best results were obtained with the decision tree class of algorithms. It was also found that the prediction results obtained with this model are comparable with other previously developed models.

[1]  Sotiris B. Kotsiantis,et al.  Supervised Machine Learning: A Review of Classification Techniques , 2007, Informatica.

[2]  Anal Acharya,et al.  Application of Feature Selection Methods in Educational Data Mining , 2014 .

[3]  Sebastián Ventura,et al.  Educational data mining: A survey from 1995 to 2005 , 2007, Expert Syst. Appl..

[4]  Kate Smith-Miles,et al.  On learning algorithm selection for classification , 2006, Appl. Soft Comput..

[5]  Y. Zhao,et al.  Comparison of decision tree methods for finding active objects , 2007, 0708.4274.

[6]  Huan Liu,et al.  Feature Selection for Classification , 1997, Intell. Data Anal..

[7]  Jerzy Stefanowski,et al.  An Experimental Study of Methods Combining Multiple Classifiers-Diversified both by Feature Selection and Bootstrap Sampling , 2005 .

[8]  Zlatko J. Kovacic,et al.  Early Prediction of Student Success: Mining Students Enrolment Data , 2010 .

[9]  Àngela Nebot,et al.  Applying Data Mining Techniques to e-Learning Problems , 2007 .

[10]  W. F. Punch,et al.  Predicting student performance: an application of data mining methods with an educational Web-based system , 2003, 33rd Annual Frontiers in Education, 2003. FIE 2003..

[11]  William F. Punch,et al.  PREDICTING STUDENT PERFORMANCE: AN APPLICATION OF DATA MINING METHODS WITH THE EDUCATIONAL WEB-BASED SYSTEM LON-CAPA , 2003 .

[12]  Ioannis E. Livieris,et al.  Predicting students' performance using artificial neural networks , 2012 .

[13]  Mohamed El Zeweidy,et al.  A Comparative Analysis of Techniques for Predicting Academic Performance , 2013 .

[14]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[15]  V. O. Oladokun,et al.  Predicting Students' Academic Performance using Artificial Neural Network: A Case Study of an Engineering Course. , 2008 .

[16]  Sotiris B. Kotsiantis,et al.  PREDICTING STUDENTS' PERFORMANCE IN DISTANCE LEARNING USING MACHINE LEARNING TECHNIQUES , 2004, Appl. Artif. Intell..

[17]  Stamos T. Karamouzis,et al.  An Artificial Neural Network for Predicting Student Graduation Outcomes , 2008 .

[18]  R. Bhaskaran,et al.  A CHAID Based Performance Prediction Model in Educational Data Mining , 2010, ArXiv.

[19]  Peter Reutemann,et al.  WEKA Manual for Version 3-6-10 , 2008 .

[20]  Pedro M. Domingos A few useful things to know about machine learning , 2012, Commun. ACM.