Predictive Models for Undergraduate Student Retention Using Machine Learning Algorithms

In this paper, we have presented some results of undergraduate student retention using machine learning and wavelet decomposition algorithms for classifying the student data. We have also made some improvements to the classification algorithms such as Decision tree, Support Vector Machines (SVM), and neural networks supported by Weka software toolkit. The experiments revealed that the main factors that influence student retention in the Historically Black Colleges and Universities (HBCU) are the cumulative grade point average (GPA) and total credit hours (TCH) taken. The target functions derived from the bare minimum decision tree and SVM algorithms were further revised to create a two-layer neural network and a regression to predict the retention. These new models improved the classification accuracy. Furthermore, we utilized wavelet decomposition and achieved better results.

[1]  Vijayalakshmi,et al.  Implication Of Classification Techniques In Predicting Student’s Recital , 2011 .

[2]  Ivan W. Selesnick,et al.  Wavelet Transforms | A Quick Study , 2007 .

[3]  Manohar Mareboyana,et al.  Machine Learning Algorithms and Predictive Models for Undergraduate Student Retention , 2013 .

[4]  Rachelle S. Heller,et al.  African-american males in computer science---examining the pipeline for clogs , 2008 .

[5]  R Alkhasawneh,et al.  Modeling student retention in science and engineering disciplines using neural networks , 2011, 2011 IEEE Global Engineering Education Conference (EDUCON).

[6]  Tim Menzies,et al.  Learning patterns of university student retention , 2011, Expert Syst. Appl..

[7]  Eibe Frank,et al.  Pruning Decision Trees and Lists , 2000 .

[8]  Donato Malerba,et al.  A Comparative Analysis of Methods for Pruning Decision Trees , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Alan Seidman,et al.  College Student Retention: Formula for Student Success. , 2005 .

[10]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[11]  Dorina Kabakchieva,et al.  Student Performance Prediction by Using Data Mining Classification Algorithms , 2012 .

[12]  Teknik Informatika,et al.  PREDICTION OF STUDENT ACADEMIC PERFORMANCE BY AN APPLICATION OF DATA MINING TECHNIQUES , 2011 .

[13]  Shieu-Hong Lin Data mining for student retention management , 2012 .

[14]  Linda Serra Hagedorn,et al.  How to Define Retention: A New Look at an Old Problem. , 2006 .

[15]  Dipti D. Patil,et al.  Evaluation of Decision Tree Pruning Algorithms for Complexity and Classification Accuracy , 2010 .

[16]  Samuel DiGangi,et al.  A Data Mining Approach for Identifying Predictors of Student Retention from Sophomore to Junior Year , 2021, Journal of Data Science.