A Comparative Analysis of Techniques for Predicting Student Performance

The problem of student final grade prediction in a particular course has recently been addressed using data mining techniques. In this paper, we present two different approaches solving this task. Both approaches are validated on 138 courses which were offered to students of the Faculty of Informatics of Masaryk University between the years of 2010 and 2013. The first approach is based on classification and regression algorithms that search for patterns in study-related data and also data about students' social behavior. We prove that students’ social behavior characteristics improve prediction for a quarter of courses. The second approach is based on collaborative filtering techniques. We predict the final grades based on previous achievements of similar students. The results show that both approaches reached similar average results and can be beneficially utilized for student final grade prediction. The first approach reaches significantly better results for courses with a small number of students. In contrary, the second approach achieves significantly better results for mathematical courses. We also identified groups of courses for which we are not able to predict the grades reliably. Finally, we are able to correctly identify half of all failures (that constitute less than a quarter of all grades) and predict the final grades only with the error of one degree in the grade scale.

[1]  Lubomír Popelínský,et al.  The influence of social data on student success prediction , 2014, IDEAS.

[2]  Carlos Márquez-Vera,et al.  Predicting School Failure Using Data Mining , 2011, EDM.

[3]  Hana Byd Are Collaborative Filtering Methods Suitable for Student Performance Prediction , 2015 .

[4]  Nguyen Thai Nghe,et al.  A comparative analysis of techniques for predicting academic performance , 2007, 2007 37th Annual Frontiers In Education Conference - Global Engineering: Knowledge Without Borders, Opportunities Without Passports.

[5]  Sebastián Ventura,et al.  Predicting students' final performance from participation in on-line discussion forums , 2013, Comput. Educ..

[6]  Myra Spiliopoulou,et al.  Hoeffding-CF: Neighbourhood-Based Recommendations on Reliably Similar Users , 2014, UMAP.

[7]  Hendrik Drachsler,et al.  Recommender Systems in Technology Enhanced Learning , 2011, Recommender Systems Handbook.

[8]  S. Wasserman,et al.  Models and Methods in Social Network Analysis: Structural Analysis in the Social Sciences , 2005 .

[9]  Vladimir Batagelj,et al.  Exploratory Social Network Analysis with Pajek , 2005 .

[10]  Rui Abreu,et al.  A Comparative Study of Classification and Regression Algorithms for Modelling Students' Academic Performance. , 2015, EDM 2015.

[11]  Bethany S. Dohleman Exploratory social network analysis with Pajek , 2006 .

[12]  Mohamed El Zeweidy,et al.  A Comparative Analysis of Techniques for Predicting Academic Performance , 2013 .

[13]  Irena Koprinska,et al.  Students at Risk: Detection and Remediation , 2015, EDM.

[14]  Nadine Meskens,et al.  Predicting Academic Performance by Data Mining Methods , 2007 .

[15]  Radek Pelánek,et al.  Student Models for Prior Knowledge Estimation , 2015, EDM.

[16]  Katrien Verbert,et al.  Recommender Systems for Technology Enhanced Learning , 2014, Springer New York.

[17]  Fionn Murtagh,et al.  Algorithms for hierarchical clustering: an overview , 2012, WIREs Data Mining Knowl. Discov..