Behavior-Based Grade Prediction for MOOCs Via Time Series Neural Networks

We present a novel method for predicting the evolution of a student's grade in massive open online courses (MOOCs). Performance prediction is particularly challenging in MOOC settings due to per-student assessment response sparsity and the need for personalized models. Our method overcomes these challenges by incorporating another, richer form of data collected from each student—lecture video-watching clickstreams—into the machine learning feature set, and using that to train a time series neural network that learns from both prior performance and clickstream data. Through evaluation on two MOOC datasets, we find that our algorithm outperforms a baseline of average past performance by more than 60% on average, and a lasso regression baseline by more than 15%. Moreover, the gains are higher when the student has answered fewer questions, underscoring their ability to provide instructors with early detection of struggling and/or advanced students. We also show that despite these gains, when taken alone, none of the behavioral features are particularly correlated with performance, emphasizing the need to consider their combined effect and nonlinear predictors. Finally, we discuss how course instructors can use these predictive learning analytics to stage student interventions.

[1]  Richard G. Baraniuk,et al.  Time-varying learning and content analytics via sparse factor analysis , 2013, KDD.

[2]  H. Vincent Poor,et al.  Social learning networks: Efficiency optimization for MOOC forums , 2016, IEEE INFOCOM 2016 - The 35th Annual IEEE International Conference on Computer Communications.

[3]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[4]  H. Vincent Poor,et al.  Mining MOOC Clickstreams: Video-Watching Behavior vs. In-Video Quiz Performance , 2016, IEEE Transactions on Signal Processing.

[5]  Aditya Johri,et al.  Predicting Performance on MOOC Assessments using Multi-Regression Models , 2016, EDM.

[6]  Michael C. Mozer,et al.  How Deep is Knowledge Tracing? , 2016, EDM.

[7]  E. Michael Azoff,et al.  Neural Network Time Series: Forecasting of Financial Markets , 1994 .

[8]  René F. Kizilcec,et al.  Motivation as a Lens to Understand Online Learners , 2015, ACM Trans. Comput. Hum. Interact..

[9]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[10]  Qing Chen,et al.  PeakVizor: Visual Analytics of Peaks in Video Clickstreams from Massive Open Online Courses , 2016, IEEE Transactions on Visualization and Computer Graphics.

[11]  Sangtae Ha,et al.  Individualization for Education at Scale: MIIC Design and Preliminary Evaluation , 2015, IEEE Transactions on Learning Technologies.

[12]  Richard G. Baraniuk,et al.  Tag-Aware Ordinal Sparse Factor Analysis for Learning and Content Analytics , 2014, EDM.

[13]  Patrick Jermann,et al.  Your click decides your fate: Inferring Information Processing and Attrition Behavior from MOOC Video Clickstream Interactions , 2014, Proceedings of the EMNLP 2014 Workshop on Analysis of Large Scale Social Interaction in MOOCs.

[14]  Mung Chiang,et al.  Social learning networks: A brief survey , 2014, 2014 48th Annual Conference on Information Sciences and Systems (CISS).

[15]  Mihaela van der Schaar,et al.  eTutor: Online learning for personalized education , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Patrick C. Shih,et al.  Understanding Student Motivation, Behaviors and Perceptions in MOOCs , 2015, CSCW.

[17]  Jie Xu,et al.  Predicting Grades , 2015, IEEE Transactions on Signal Processing.

[18]  Mung Chiang,et al.  MOOC performance prediction via clickstream data and social learning networks , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[19]  Zhenming Liu,et al.  Learning about Social Learning in MOOCs: From Statistical Analysis to Generative Model , 2013, IEEE Transactions on Learning Technologies.

[20]  James Henderson A Neural Network Parser that Handles Sparse Data , 2000, IWPT.

[21]  Jure Leskovec,et al.  Engaging with massive online courses , 2014, WWW.

[22]  Sebastián Ventura,et al.  Predicting students' final performance from participation in on-line discussion forums , 2013, Comput. Educ..

[23]  Yoav Bergner,et al.  Model-Based Collaborative Filtering Analysis of Student Response Data: Machine-Learning Item Response Theory , 2012, EDM.

[24]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[25]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[26]  Qian Zhang,et al.  Modeling and Predicting Learning Behavior in MOOCs , 2016, WSDM.

[27]  Niels Pinkwart,et al.  Predicting MOOC Dropout over Weeks Using Machine Learning Methods , 2014, EMNLP 2014.

[28]  Krzysztof Z. Gajos,et al.  Understanding in-video dropouts and interaction peaks in online lecture videos Citation , 2014 .