Using the Belbin method and models for predicting the academic performance of engineering students

This paper describes the process of generating a predictive model of students’ academic performance in different engineering subjects at Universidad Católica del Norte (UCN). It aims to analyze the importance of variables influencing the final average grade of the UCN students in projects related to different subjects, focusing on the dimensions resulting from the Belbin test. The main objective of this work is to provide evidence of the real impact of the Belbin test outcomes on the final performance of a team of students, using as a metric of variable importance the one provided by a Random Forest model, supplied by the scikit‐learn library. As a result, the final classifier presented an accuracy of 80%, and one of the most influential variables according to this model was Covered Roles 2, which represents the number of roles covered in each group. Future research lines are proposed to validate these outcomes, mostly concerned with the acquisition of more data across several future semesters.

[1]  R. Berenson,et al.  Emotional Intelligence as a Predictor for Success in Online Learning , 2008 .

[2]  Michèle Basseville,et al.  Detection of Abrupt Changes: Theory and Applications. , 1995 .

[3]  Guillermo Mendez,et al.  Factors Associated With Persistence in Science and Engineering Majors: An Exploratory Study Using Classification Trees and Random Forests , 2008 .

[4]  Geoffrey I. Webb,et al.  Encyclopedia of Machine Learning , 2011, Encyclopedia of Machine Learning.

[5]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[6]  Wahidah Husain,et al.  A Review on Predicting Student's Performance Using Data Mining Techniques , 2015 .

[7]  Paul Baepler,et al.  Academic Analytics and Data Mining in Higher Education , 2010 .

[8]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Vernon C. Smith,et al.  Predictive Modeling to Forecast Student Outcomes and Drive Effective Interventions in Online Community College Courses , 2012 .

[10]  M. Lawera Predictive inference : an introduction , 1995 .

[11]  Moti Zwilling,et al.  Student data mining solution-knowledge management system related to higher education institutions , 2014, Expert Syst. Appl..

[12]  Senén Barro,et al.  Do we need hundreds of classifiers to solve real world classification problems? , 2014, J. Mach. Learn. Res..

[13]  Víctor Manuel Flores Fonseca,et al.  Applying Active Methodologies for Teaching Software Engineering in Computer Engineering , 2017, Rev. Iberoam. de Tecnol. del Aprendiz..

[14]  Pedro José Lara Bercial,et al.  Project Based Engineering School: una escuela conectada con un mundo profesional sostenible , 2013 .

[15]  John P. Campbell,et al.  Analytics in Higher Education: Establishing a Common Language , 2012 .

[16]  GUÍA PARA EL TRABAJO EN EQUIPO , 2005 .

[17]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[18]  V. Ramesh,et al.  Predicting Student Performance: A Statistical and Data Mining Approach , 2013 .

[19]  Marcia L. Laskey,et al.  Self-Regulated Learning, Metacognition, and Soft Skills: The 21st Century Learner. , 2010 .

[20]  Andrew J. DuBrin,et al.  Human Relations Interpersonal, Job-Oriented Skills , 1978 .

[21]  Dharminder Kumar,et al.  Mining Students' Data for Prediction Performance , 2014, 2014 Fourth International Conference on Advanced Computing & Communication Technologies.

[22]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[23]  J. Kidd Management Teams: Why They Succeed or Fail , 1982 .

[24]  G. E. Rogers,et al.  Soft Skills in the Technology Education Classroom: What Do Students Need?. , 2008 .

[25]  Rashedur M. Rahman,et al.  Improving accuracy of students’ final grade prediction model using optimal equal width binning and synthetic minority over-sampling technique , 2015, Decis. Anal..