MOOCad: Visual Analysis of Anomalous Learning Activities in Massive Open Online Courses

The research on Massive Open Online Course (MOOC) has mushroomed worldwide due to the technical revolution and its unprecedented enrollments. Existing work mainly focuses on performance prediction, content recommendation, and learning behavior summarization. However, finding anomalous learning activities in MOOC data has posed special challenges and requires providing a clear definition of anomalous behavior, analyzing the multifaceted learning sequence data, and interpreting anomalies at different scales. In this paper, we present a novel visual analytics system, MOOCad, for exploring anomalous learning patterns and their clustering in MOOC data. The system integrates an anomaly detection algorithm to cluster learning sequences of MOOC learners into staged-based groups. Moreover, it allows interactive anomaly detection between and within groups on the basis of semantic and interpretable group-wise data summaries. We demonstrate the effectiveness of MOOCad via an in-depth interview with a MOOC lecturer with real-world course data. CCS Concepts • Human-centered computing → Visual analytics; • Applied computing → E-learning;

[1]  Mark Warschauer,et al.  Predicting MOOC performance with Week 1 Behavior , 2014, EDM.

[2]  Jianyong Wang,et al.  Efficiently Mining Closed Subsequences with Gap Constraints , 2008, SDM.

[3]  Zhi-Hua Zhou,et al.  Isolation Forest , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[4]  Yixin Chen,et al.  ECGLens: Interactive Visual Exploration of Large Scale ECG Data for Arrhythmia Detection , 2018, CHI.

[5]  John R. Anderson,et al.  Cognitive Tutors: Lessons Learned , 1995 .

[6]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[7]  Hongxing He,et al.  Outlier Detection Using Replicator Neural Networks , 2002, DaWaK.

[8]  Andrew Mercer,et al.  Uncertainty-Aware Multidimensional Ensemble Data Visualization and Exploration , 2015, IEEE Transactions on Visualization and Computer Graphics.

[9]  Huilong Duan,et al.  Summarizing clinical pathways from event logs , 2013, J. Biomed. Informatics.

[10]  Yu-Ru Lin,et al.  Voila: Visual Anomaly Detection and Monitoring with Streaming Spatiotemporal Data , 2018, IEEE Transactions on Visualization and Computer Graphics.

[11]  Leonidas J. Guibas,et al.  Syntactic and Functional Variability of a Million Code Submissions in a Machine Learning MOOC , 2013, AIED Workshops.

[12]  Ke Xu,et al.  EventThread: Visual Summarization and Stage Analysis of Event Sequence Data , 2018, IEEE Transactions on Visualization and Computer Graphics.

[13]  Qing Chen,et al.  DropoutSeer: Visualizing learning patterns in Massive Open Online Courses for dropout reasoning and prediction , 2016, 2016 IEEE Conference on Visual Analytics Science and Technology (VAST).

[14]  Yun Wang,et al.  EnsembleLens: Ensemble-based Visual Exploration of Anomaly Detection Algorithms with Multidimensional Data , 2019, IEEE Transactions on Visualization and Computer Graphics.

[15]  David Gotz,et al.  DecisionFlow: Visual Analytics for High-Dimensional Temporal Event Sequence Data , 2014, IEEE Transactions on Visualization and Computer Graphics.

[16]  Krzysztof Z. Gajos,et al.  Understanding in-video dropouts and interaction peaks inonline lecture videos , 2014, L@S.

[17]  Mojtaba Salehi,et al.  Hybrid recommendation approach for learning material based on sequential pattern of the accessed material and the learner's preference tree , 2013, Knowl. Based Syst..

[18]  Michel C. Desmarais,et al.  A review of recent advances in learner and skill modeling in intelligent learning environments , 2012, User Modeling and User-Adapted Interaction.

[19]  Jian Zhao,et al.  Egocentric Analysis of Dynamic Networks with EgoLines , 2016, CHI.

[20]  Hongyuan Zha,et al.  Visual Progression Analysis of Event Sequence Data , 2019, IEEE Transactions on Visualization and Computer Graphics.

[21]  Peter Brusilovsky,et al.  Developing Adaptive Educational Hypermedia Systems: From Design Models to Authoring Tools , 2003 .

[22]  Jian Zhao,et al.  MOOCex: Exploring Educational Video via Recommendation , 2018, ICMR.

[23]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[24]  Jason P. Davis,et al.  Learning Sequences: Their Existence, Effect, and Evolution , 2012 .

[25]  Franck Dernoncourt,et al.  MoocViz: A Large Scale, Open Access, Collaborative, Data Analytics Platform for MOOCs , 2013 .

[26]  Linda Corrin,et al.  Visualizing patterns of student engagement and performance in MOOCs , 2014, LAK.

[27]  Philip J. Guo,et al.  How video production affects student engagement: an empirical study of MOOC videos , 2014, L@S.

[28]  Patrick Jermann,et al.  Capturing "attrition intensifying" structural traits from didactic interaction sequences of MOOC learners , 2014, EMNLP 2014.

[29]  Yale Song,et al.  #FluxFlow: Visual Analysis of Anomalous Information Spreading on Social Media , 2014, IEEE Transactions on Visualization and Computer Graphics.

[30]  Qing Chen,et al.  VisMOOC: Visualizing video clickstream data from massive open online courses , 2015, 2014 IEEE Conference on Visual Analytics Science and Technology (VAST).

[31]  Tzone-I Wang,et al.  A mining-based approach on discovering courses pattern for constructing suitable learning path , 2010, Expert Syst. Appl..

[32]  Jingrui He,et al.  RCLens: Interactive Rare Category Exploration and Identification , 2018, IEEE Transactions on Visualization and Computer Graphics.

[33]  Alfred Kobsa User Modeling and User-Adapted Interaction , 2005, User Modeling and User-Adapted Interaction.

[34]  Olivier Thonnard,et al.  Understanding User Behaviour through Action Sequences: From the Usual to the Unusual , 2019, IEEE Transactions on Visualization and Computer Graphics.