Mining MOOC Clickstreams: On the Relationship Between Learner Video-Watching Behavior and Performance

We study student behavior and performance in two Massive Open Online Courses (MOOCs). In doing so, we present two frameworks by which video-watching clickstreams can be represented: one based on the sequence of events created, and another on the sequence of positions visited. With the event-based framework, we extract recurring subsequences of student behavior, which contain fundamental characteris- tics such as reflecting (i.e., repeatedly playing and pausing) and revising (i.e., plays and skip backs). We find that some of these behaviors are significantly associated with whether a user will be Correct on First Attempt (CFA) or not in answering quiz questions. With the position-based framework, we then devise models for performance. In evaluating these through CFA prediction, we find that three of them can substantially improve prediction quality in terms of accuracy and F1, which underlines the ability to relate behavior to performance. Since our prediction considers videos individually, these benefits also suggest that our models are useful in situations where there is limited training data, e.g., for early detection or in short courses.

[1]  Zhenming Liu,et al.  Learning about Social Learning in MOOCs: From Statistical Analysis to Generative Model , 2013, IEEE Transactions on Learning Technologies.

[2]  Armando Fox,et al.  Monitoring MOOCs: which information sources do instructors value? , 2014, L@S.

[3]  Sangtae Ha,et al.  Individualization for Education at Scale: MIIC Design and Preliminary Evaluation , 2015, IEEE Transactions on Learning Technologies.

[4]  M. Tamer Özsu,et al.  A Web page prediction model based on click-stream tree representation of user behavior , 2003, KDD '03.

[5]  Patrick Jermann,et al.  Your click decides your fate: Inferring Information Processing and Attrition Behavior from MOOC Video Clickstream Interactions , 2014, Proceedings of the EMNLP 2014 Workshop on Analysis of Large Scale Social Interaction in MOOCs.

[6]  Michael F. Schatz,et al.  Student Use of a Single Lecture Video in a Flipped Introductory Mechanics Course , 2015 .

[7]  Mung Chiang,et al.  Social learning networks: A brief survey , 2014, 2014 48th Annual Conference on Information Sciences and Systems (CISS).

[8]  Maliha S. Nash,et al.  Handbook of Parametric and Nonparametric Statistical Procedures , 2001, Technometrics.

[9]  Jure Leskovec,et al.  Engaging with massive online courses , 2014, WWW.

[10]  Mung Chiang,et al.  MOOC performance prediction via clickstream data and social learning networks , 2015, 2015 IEEE Conference on Computer Communications (INFOCOM).

[11]  Michael Jahrer,et al.  Collaborative Filtering Applied to Educational Data Mining , 2010 .

[12]  Gianluca Antonini,et al.  On nested palindromes in clickstream data , 2012, KDD.

[13]  Gang Wang,et al.  Northeastern University , 2021, IEEE Pulse.

[14]  Justin Cheng,et al.  Tools for predicting drop-off in large online classes , 2013, CSCW '13.

[15]  Chris Piech,et al.  Deconstructing disengagement: analyzing learner subpopulations in massive open online courses , 2013, LAK '13.

[16]  Yoav Bergner,et al.  Model-Based Collaborative Filtering Analysis of Student Response Data: Machine-Learning Item Response Theory , 2012, EDM.

[17]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[18]  N. Heffernan,et al.  Using HMMs and bagged decision trees to leverage rich features of user and skill from an intelligent tutoring system dataset , 2010 .

[19]  Richard G. Baraniuk,et al.  Time-varying learning and content analytics via sparse factor analysis , 2013, KDD.

[20]  Patrick Jermann,et al.  Capturing "attrition intensifying" structural traits from didactic interaction sequences of MOOC learners , 2014, EMNLP 2014.

[21]  Krzysztof Z. Gajos,et al.  Understanding in-video dropouts and interaction peaks inonline lecture videos , 2014, L@S.