Multi-task MIML learning for pre-course student performance prediction

In higher education, the initial studying period of each course plays a crucial role for students, and seriously influences the subsequent learning activities. However, given the large size of a course’s students at universities, it has become impossible for teachers to keep track of the performance of individual students. In this circumstance, an academic early warning system is desirable, which automatically detects students with difficulties in learning (i.e., at-risk students) prior to a course starting. However, previous studies are not well suited to this purpose for two reasons: 1) they have mainly concentrated on e-learning platforms, e.g., massive open online courses (MOOCs), and relied on the data about students’ online activities, which is hardly accessed in traditional teaching scenarios; and 2) they have only made performance prediction when a course is in progress or even close to the end. In this paper, for traditional classroom-teaching scenarios, we investigate the task of pre-course student performance prediction, which refers to detecting at-risk students for each course before its commencement. To better represent a student sample and utilize the correlations among courses, we cast the problem as a multi-instance multi-label (MIML) problem. Besides, given the problem of data scarcity, we propose a novel multi-task learning method, i.e., MIML-Circle, to predict the performance of students from different specialties in a unified framework. Extensive experiments are conducted on five real-world datasets, and the results demonstrate the superiority of our approach over the state-of-the-art methods.

[1]  Michalis Nik Xenos Prediction and assessment of student behaviour in open and distance education in computers using Bayesian networks , 2004, Comput. Educ..

[2]  Rachel Baker,et al.  The different relationships between engagement and outcomes across participant subgroups in Massive Open Online Courses , 2018, Comput. Educ..

[3]  David D. Clarke,et al.  Identifying barriers to help-seeking: a qualitative analysis of students' preparedness to seek help from tutors , 1998 .

[4]  Sebastián Ventura,et al.  Educational Data Mining: A Review of the State of the Art , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[5]  Jiebo Luo,et al.  Learning multi-label scene classification , 2004, Pattern Recognit..

[6]  Gongping Yang,et al.  Pre-course student performance prediction with multi-instance multi-label learning , 2018, Science China Information Sciences.

[7]  Zhi-Hua Zhou,et al.  Fast Multi-Instance Multi-Label Learning , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Yu Zhang,et al.  A Survey on Multi-Task Learning , 2017, IEEE Transactions on Knowledge and Data Engineering.

[9]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[10]  Alvin Y. Wang,et al.  Predictors of Performance in the Virtual Classroom: Identifying and Helping At-Risk Cyber-Students , 2002 .

[11]  Zachary A. Pardos,et al.  Clustering Students to Generate an Ensemble to Improve Standard Test Score Predictions , 2011, AIED.

[12]  Friedhelm Schwenker,et al.  Ensemble Methods: Foundations and Algorithms [Book Review] , 2013, IEEE Computational Intelligence Magazine.

[13]  Farshid Marbouti,et al.  Models for early prediction of at-risk students in a course using standards-based grading , 2016, Comput. Educ..

[14]  Zhi-Hua Zhou,et al.  Solving multi-instance problems with classifier ensemble based on constructive clustering , 2007, Knowledge and Information Systems.

[15]  Alvin Y. Wang,et al.  A Discourse Analysis of Online Classroom Chats: Predictors of Cyber-Student Performance , 2001 .

[16]  Chia-Lun Lo,et al.  Developing early warning systems to predict students' online learning performance , 2014, Comput. Hum. Behav..

[17]  Zhi-Hua Zhou,et al.  Towards Discovering What Patterns Trigger What Labels , 2012, AAAI.

[18]  Aditya Johri,et al.  Predicting Performance on MOOC Assessments using Multi-Regression Models , 2016, EDM.

[19]  Zhi-Hua Zhou,et al.  Ensemble multi-instance multi-label learning approach for video annotation task , 2011, ACM Multimedia.

[20]  Erkan Er Identifying At-Risk Students Using Machine Learning Techniques: A Case Study with IS 100 , 2012 .

[21]  Jie Xu,et al.  Predicting Grades , 2015, IEEE Transactions on Signal Processing.

[22]  Sotiris B. Kotsiantis,et al.  Preventing Student Dropout in Distance Learning Using Machine Learning Techniques , 2003, KES.

[23]  Sebastián Ventura,et al.  Multiple instance learning for classifying students in learning management systems , 2011, Expert Syst. Appl..

[24]  Sebastián Ventura,et al.  Classification via clustering for predicting final marks starting from the student participation in Forums , 2012, EDM.

[25]  Yuan Jiang,et al.  Complex Object Classification: A Multi-Modal Multi-Instance Multi-Label Deep Network with Optimal Transport , 2018, KDD.

[26]  Ji Feng,et al.  Deep MIML Network , 2017, AAAI.

[27]  Anal Acharya,et al.  Early Prediction of Students Performance using Machine Learning Techniques , 2014 .

[28]  Shane Dawson,et al.  Mining LMS data to develop an "early warning system" for educators: A proof of concept , 2010, Comput. Educ..

[29]  Sotiris B. Kotsiantis,et al.  PREDICTING STUDENTS' PERFORMANCE IN DISTANCE LEARNING USING MACHINE LEARNING TECHNIQUES , 2004, Appl. Artif. Intell..

[30]  Aditya Johri,et al.  Next-Term Student Performance Prediction: A Recommender Systems Approach , 2016, EDM.

[31]  Thomas Hofmann,et al.  Multi-Instance Multi-Label Learning with Application to Scene Classification , 2007 .

[32]  Hanan Ayad,et al.  Student success system: risk analytics and data visualization using ensembles of predictive models , 2012, LAK.

[33]  Wang Shaobo,et al.  Classifier Circle Method for Multi-Label Learning , 2015 .

[34]  Zhi-Hua Zhou,et al.  M3MIML: A Maximum Margin Method for Multi-instance Multi-label Learning , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[35]  Tom Gedeon,et al.  Explaining student grades predicted by a neural network , 1993, Proceedings of 1993 International Conference on Neural Networks (IJCNN-93-Nagoya, Japan).

[36]  Min-Ling Zhang,et al.  A k-Nearest Neighbor Based Multi-Instance Multi-Label Learning Algorithm , 2010, 2010 22nd IEEE International Conference on Tools with Artificial Intelligence.