Predicting students marks in Hellenic Open University

The ability to provide assistance for a student at the appropriate level is invaluable in the learning process. Not only does it aids the student's learning process but also prevents problems, such as student frustration and floundering. Students' key demographic characteristics and their marks in a small number of written assignments can constitute the training set for a regression method in order to predict the student's performance. The scope of this work compares some of the state of the art regression algorithms in the application domain of predicting students' marks. A number of experiments have been conducted with six algorithms, which were trained using datasets provided by the Hellenic Open University. Finally, a prototype version of software support tool for tutors has been constructed implementing the M5rules algorithm, which proved to be the most appropriate among the tested algorithms.

[1]  John C. Platt Using Analytic QP and Sparseness to Speed Training of Support Vector Machines , 1998, NIPS.

[2]  Sotiris B. Kotsiantis,et al.  Preventing Student Dropout in Distance Learning Using Machine Learning Techniques , 2003, KES.

[3]  Ian H. Witten,et al.  Induction of model trees for predicting continuous classes , 1996 .

[4]  N. Draper,et al.  Applied Regression Analysis , 1966 .

[5]  Yoshua Bengio,et al.  Inference for the Generalization Error , 1999, Machine Learning.

[6]  Ian Witten,et al.  Data Mining , 2000 .

[7]  J. Fox Applied Regression Analysis, Linear Models, and Related Methods , 1997 .

[8]  Sotiris B. Kotsiantis,et al.  PREDICTING STUDENTS' PERFORMANCE IN DISTANCE LEARNING USING MACHINE LEARNING TECHNIQUES , 2004, Appl. Artif. Intell..

[9]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[10]  S. Sathiya Keerthi,et al.  Improvements to the SMO algorithm for SVM regression , 2000, IEEE Trans. Neural Networks Learn. Syst..

[11]  Rajeev Sharma,et al.  Advances in Neural Information Processing Systems 11 , 1999 .

[12]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[13]  Piet Kommers,et al.  Proceedings of the Sixth International Conference on Advanced Learning Technologies , 2006 .

[14]  Panayiotis E. Pintelas,et al.  A survey on student dropout rates and dropout causes concerning the students in the Course of Informatics of the Hellenic Open University , 2002, Comput. Educ..

[15]  Marko Robnik-Sikonja,et al.  An adaptation of Relief for attribute estimation in regression , 1997, ICML.

[16]  Andrew W. Moore,et al.  Locally Weighted Learning , 1997, Artificial Intelligence Review.