Predicting Student Academic Performance at Degree Level: A Case Study

Universities gather large volumes of data with reference to their students in electronic form. The advances in the data mining field make it possible to mine these educational data and find information that allow for innovative ways of supporting both teachers and students. This paper presents a case study on predicting performance of students at the end of a university degree at an early stage of the degree program, in order to help universities not only to focus more on bright students but also to initially identify students with low academic achievement and find ways to support them. The data of four academic cohorts comprising 347 undergraduate students have been mined with different classifiers. The results show that it is possible to predict the graduation performance in 4th year at university using only pre-university marks and marks of 1st and 2nd year courses, no socio-economic or demographic features, with a reasonable accuracy. Furthermore courses that are indicators of particularly good or poor performance have been identified.

[1]  Mykola Pechenizkiy,et al.  Handbook of Educational Data Mining , 2010 .

[2]  Nitesh V. Chawla,et al.  Engagement vs performance: using electronic portfolios to predict first semester engineering student retention , 2014, LAK.

[3]  Kamelia Stefanova,et al.  Analyzing University Data for Determining Student Profiles and Predicting Performance , 2011, EDM.

[4]  Edith Galy,et al.  The Effect of Using E-Learning Tools in Online and Campus-based Classrooms on Student Performance , 2011, J. Inf. Technol. Educ. Res..

[5]  Sebastián Ventura,et al.  Classification via clustering for predicting final marks starting from the student participation in Forums , 2012, EDM.

[6]  I. Kazanidis,et al.  E-Learning Platform Usage Analysis , 2011 .

[7]  P. Golding,et al.  Predicting Academic Performance , 2006, Proceedings. Frontiers in Education. 36th Annual Conference.

[8]  Bing Liu,et al.  Web Data Mining: Exploring Hyperlinks, Contents, and Usage Data , 2006, Data-Centric Systems and Applications.

[9]  Z. Kovacic,et al.  Predicting student success by mining enrolment data. , 2012 .

[10]  Ji Hyea Han,et al.  Data Mining : Concepts and Techniques 2 nd Edition Solution Manual , 2005 .

[11]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[12]  Dursun Delen,et al.  A comparative analysis of machine learning techniques for student retention management , 2010, Decis. Support Syst..

[13]  Nguyen Thai Nghe,et al.  A comparative analysis of techniques for predicting academic performance , 2007, 2007 37th Annual Frontiers In Education Conference - Global Engineering: Knowledge Without Borders, Opportunities Without Passports.

[14]  Mykola Pechenizkiy,et al.  Predicting Students Drop Out: A Case Study , 2009, EDM.

[15]  Raheela Asif,et al.  MINING STUDENT’S ADMISSION DATA AND PREDICTING STUDENT’S PERFORMANCE USING DECISION TREES , 2012 .

[16]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[17]  D. P. Acharjya,et al.  Prediction of Missing Associations Using Rough Computing and Bayesian Classification , 2012 .

[18]  Zdenek Zdráhal,et al.  Improving retention: predicting at-risk students by analysing clicking behaviour in a virtual learning environment , 2013, LAK '13.

[19]  Shaobo Huang,et al.  Predicting student academic performance in an engineering dynamics course: A comparison of four types of predictive mathematical models , 2013, Comput. Educ..

[20]  Joachim M. Buhmann,et al.  Predicting Graduate-level Performance from Undergraduate Achievements , 2011, EDM.

[21]  Sebastián Ventura,et al.  Educational Data Mining: A Review of the State of the Art , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[22]  Dorina Kabakchieva,et al.  Predicting Student Performance by Using Data Mining Methods for Classification , 2013 .

[23]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[24]  DelenDursun A comparative analysis of machine learning techniques for student retention management , 2010, DSS 2010.

[25]  Kalina Yacef,et al.  Measuring Correlation of Strong Symmetric Association Rules in Educational Data , 2010 .

[26]  Paul Golding,et al.  Predicting Academic Performance in the School of Computing & Information Technology (SCIT) , 2005, Proceedings Frontiers in Education 35th Annual Conference.

[27]  Zachary A. Pardos,et al.  The Effect of Model Granularity on Student Performance Prediction Using Bayesian Networks , 2007, User Modeling.