The Influence of First Year Behaviour in the Progressions of University Students

Advanced clustering techniques are used on educational data concerning various cohorts of university students. First, K-means analysis is used to classify students according to the results of the self assessment test and the first year performance. Then, the analysis concentrates on the subset of the data involving the cohorts of students for which the behavior during the first, second and third year of University is known. The results of the second and third year are analyzed and the students are re-assigned to the clusters obtained during the analysis of the first year. In this way, for each student we are able to obtain the sequence of traversed clusters during three years, based on the results achieved during the first. For the data set under analysis, this analysis highlights three groups of students strongly affected by the results of the first year: high achieving students who start high and maintain their performance over the time, medium-high achieving students throughout the entire course of study and, low achieving students unable to improve their performance who often abandon their studies. This kind of study can be used by the involved laurea degree to detect critical issues and undertake improvement strategies.

[1]  Joachim M. Buhmann,et al.  Predicting Graduate-level Performance from Undergraduate Achievements , 2011, EDM.

[2]  Alex J. Bowers Analyzing the Longitudinal K-12 Grading Histories of Entire Cohorts of Students: Grades, Data Driven Decision Making, Dropping out and Hierarchical Cluster Analysis. , 2010 .

[3]  Sebastián Ventura,et al.  A Survey on Pre-Processing Educational Data , 2014 .

[4]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[5]  Ryan Shaun Joazeiro de Baker,et al.  Educational Data Mining: An Advance for Intelligent Systems in Education , 2014, IEEE Intelligent Systems.

[6]  Kamelia Stefanova,et al.  Analyzing University Data for Determining Student Profiles and Predicting Performance , 2011, EDM.

[7]  Renzo Sprugnoli,et al.  Data mining models for student careers , 2015, Expert Syst. Appl..

[8]  M. Cecilia Verri,et al.  University Student Progressions and First Year Behaviour , 2017, CSEDU.

[9]  Moti Zwilling,et al.  Student data mining solution-knowledge management system related to higher education institutions , 2014, Expert Syst. Appl..

[10]  Joachim M. Buhmann,et al.  A model-based approach to predicting graduate-level performance using indicators of undergraduate-level performance , 2015, EDM 2015.

[11]  Sebastián Ventura,et al.  Data mining in education , 2013, WIREs Data Mining Knowl. Discov..

[12]  Alejandro Peña-Ayala Review: Educational data mining: A survey and a data mining-based analysis of recent works , 2014 .