Building Student Course Performance Prediction Model Based on Deep Learning

The deferral of graduation rate in Taiwan's universities is estimated 16%, which will affect the scheduling of school resources. Therefore, if we can expect to take notice of students' academic performance and provide guidance to students who cannot pass the threshold as expected, the waste of school resources can effectively be reduced. In this research, the recent years' student data and course results are used as training data to construct student performance prediction models. The K-Means algorithm was used to classify all courses from the freshman to the senior. The related courses will be grouped in the same cluster, which will more likely to find similar features and improve the accuracy of the prediction. Then, this study constructs independent neural networks for each course according to the different academic year. Each model will be pre-trained by using Denoising Auto-encoder. After pre-training, the corresponding structure and weights are taken as the initial value of the neural network model. Each neural network is treated as a base predictor. All predictors will be integrated into an Ensemble predictor according to different years' weights to predict the current student's course performance. As the students finish the course at the end of each semester, the prediction model will continue track and update to enhance model accuracy through online learning.

[1]  Mihaela van der Schaar,et al.  A Machine Learning Approach for Tracking and Predicting Student Performance in Degree Programs , 2017, IEEE Journal of Selected Topics in Signal Processing.

[2]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[3]  Georgios Kostopoulos,et al.  Predicting University Students' Grades Based on Previous Academic Achievements , 2018, 2018 9th International Conference on Information, Intelligence, Systems and Applications (IISA).

[4]  Alan M. Baas Promising Strategies for At-Risk Youth. ERIC Digest No. 59. , 1991 .

[5]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[6]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[7]  Mihaela van der Schaar,et al.  Personalized Grade Prediction: A Data Mining Approach , 2015, 2015 IEEE International Conference on Data Mining.

[8]  Ford Lumban Gaol,et al.  Using Machine Learning Techniques to Earlier Predict Student's Performance , 2018, 2018 Indonesian Association for Pattern Recognition International Conference (INAPR).

[9]  Andrew Y. Ng,et al.  Preventing "Overfitting" of Cross-Validation Data , 1997, ICML.

[10]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[11]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[12]  Jong Yih Kuo,et al.  Using Stacked Denoising Autoencoder for the Student Dropout Prediction , 2017, 2017 IEEE International Symposium on Multimedia (ISM).

[13]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[14]  Yoshua Bengio,et al.  Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.