CLMS-Net: dropout prediction in MOOCs with deep learning

Massive Open Online Courses (MOOCs) have played an increasingly crucial role in education, but the high dropout rate problem is great serious. Predicting whether students will dropout has attracted considerable attention. Current methods mainly depend on handcrafted features. The process is laborious and not scalability, and even difficult to guarantee the final prediction effect. In this paper, we propose a deep neural network model, which is a combination of Convolutional Neural Network, Long Short-Term Memory network and Support Vector Machine. Our model has an effective feature extraction strategy, which automatically extract features from the raw data, and takes into account the impact of the sequential relationship of student behavior and class imbalance on dropout and, most importantly, reinforce the performance of dropout prediction. Extensive experiments on a public dataset have shown that the proposed model can achieve better results comparable to feature engineering based methods and other neural network methods.

[1]  Ju Jia Zou,et al.  Texture image classification with discriminative neural networks , 2016, Computational Visual Media.

[2]  Leo Breiman,et al.  Using Iterated Bagging to Debias Regressions , 2001, Machine Learning.

[3]  Harry Zhang,et al.  The Optimality of Naive Bayes , 2004, FLAIRS.

[4]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[5]  Zachary A. Pardos,et al.  Modelling Student Behavior using Granular Large Scale Action Data from a MOOC , 2016, ArXiv.

[6]  Niels Pinkwart,et al.  Predicting MOOC Dropout over Weeks Using Machine Learning Methods , 2014, EMNLP 2014.

[7]  Chunyan Miao,et al.  Deep Model for Dropout Prediction in MOOCs , 2017, ICCSE'17.

[8]  Tong Zhang,et al.  Effective Use of Word Order for Text Categorization with Convolutional Neural Networks , 2014, NAACL.

[9]  Shirley Williams,et al.  MOOCs: A systematic study of the published literature 2008-2012 , 2013 .

[10]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[11]  Kalyan Veeramachaneni,et al.  Likely to stop? Predicting Stopout in Massive Open Online Courses , 2014, ArXiv.

[12]  Patrick Jermann,et al.  Your click decides your fate: Inferring Information Processing and Attrition Behavior from MOOC Video Clickstream Interactions , 2014, Proceedings of the EMNLP 2014 Workshop on Analysis of Large Scale Social Interaction in MOOCs.

[13]  Hua Li,et al.  Dropout prediction in MOOCs using behavior features and multi-view semi-supervised learning , 2016, 2016 International Joint Conference on Neural Networks (IJCNN).

[14]  Phil Blunsom,et al.  A Convolutional Neural Network for Modelling Sentences , 2014, ACL.

[15]  Martin Ebner,et al.  MOOCs Completion Rates and Possible Methods to Improve Retention - A Literature Review , 2014 .

[16]  K. Robert Lai,et al.  Dimensional Sentiment Analysis Using a Regional CNN-LSTM Model , 2016, ACL.

[17]  Jacob Whitehill,et al.  Delving Deeper into MOOC Student Dropout Prediction , 2017, ArXiv.

[18]  Zhang Xiong,et al.  Time Series Model for Predicting Dropout in Massive Open Online Courses , 2018, AIED.

[19]  Fei-Fei Li,et al.  Large-Scale Video Classification with Convolutional Neural Networks , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Patricia Charlton,et al.  Conference on Artificial Intelligence in Education , 2009 .

[21]  Dit-Yan Yeung,et al.  Temporal Models for Predicting Student Dropout in Massive Open Online Courses , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[22]  Jacob Whitehill,et al.  MOOC Dropout Prediction: How to Measure Accuracy? , 2017, L@S.

[23]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[24]  Lise Getoor,et al.  Learning Latent Engagement Patterns of Students in Online Courses , 2014, AAAI.

[25]  Li Chen,et al.  A Nonlinear State Space Model for Identifying At-Risk Students in Open Online Courses , 2016, EDM.

[26]  Zachary A. Pardos,et al.  Predictive Modelling of Student Behavior Using Granular Large-Scale Action Data , 2017 .

[27]  Sreerama K. Murthy,et al.  Automatic Construction of Decision Trees from Data: A Multi-Disciplinary Survey , 1998, Data Mining and Knowledge Discovery.

[28]  Wenguang Chen,et al.  What Decides the Dropout in MOOCs? , 2017, DASFAA Workshops.

[29]  Suma Bhat,et al.  Predicting Attrition Along the Way: The UIUC Model , 2014, EMNLP 2014.

[30]  Jian Yang,et al.  Big Data Application in Education: Dropout Prediction in Edx MOOCs , 2016, 2016 IEEE Second International Conference on Multimedia Big Data (BigMM).