Visual Analysis of MOOC Forums with iForum

Discussion forums of Massive Open Online Courses (MOOC) provide great opportunities for students to interact with instructional staff as well as other students. Exploration of MOOC forum data can offer valuable insights for these staff to enhance the course and prepare the next release. However, it is challenging due to the large, complicated, and heterogeneous nature of relevant datasets, which contain multiple dynamically interacting objects such as users, posts, and threads, each one including multiple attributes. In this paper, we present a design study for developing an interactive visual analytics system, called iForum, that allows for effectively discovering and understanding temporal patterns in MOOC forums. The design study was conducted with three domain experts in an iterative manner over one year, including a MOOC instructor and two official teaching assistants. iForum offers a set of novel visualization designs for presenting the three interleaving aspects of MOOC forums (i.e., posts, users, and threads) at three different scales. To demonstrate the effectiveness and usefulness of iForum, we describe a case study involving field experts, in which they use iForum to investigate real MOOC forum data for a course on JAVA programming.

[1]  Anirban Dasgupta,et al.  Superposter behavior in MOOC forums , 2014, L@S.

[2]  Bernard Kerr Thread Arcs: an email thread visualization , 2003, IEEE Symposium on Information Visualization 2003 (IEEE Cat. No.03TH8714).

[3]  Daniel A. Keim,et al.  ForAVis: explorative user forum analysis , 2011, WIMS '11.

[4]  Danny Holten,et al.  Hierarchical Edge Bundles: Visualization of Adjacency Relations in Hierarchical Data , 2006, IEEE Transactions on Visualization and Computer Graphics.

[5]  Carolyn Penstein Rosé,et al.  Sentiment Analysis in MOOC Discussion Forums: What does it tell us? , 2014, EDM.

[6]  Armando Fox,et al.  Monitoring MOOCs: which information sources do instructors value? , 2014, L@S.

[7]  Sanjay Ghemawat,et al.  MapReduce: Simplified Data Processing on Large Clusters , 2004, OSDI.

[8]  Alan J. Dix,et al.  A Taxonomy of Clutter Reduction for Information Visualisation , 2007, IEEE Transactions on Visualization and Computer Graphics.

[9]  Sung-Hee Kim,et al.  VisOHC: Designing Visual Analytics for Online Health Communities , 2016, IEEE Transactions on Visualization and Computer Graphics.

[10]  John G. Keating,et al.  A visualisation tool to aid exploration of students' interactions in asynchronous online communication , 2012, Comput. Educ..

[11]  Alyssa Friend Wise,et al.  Identifying Content-Related Threads in MOOC Discussion Forums , 2015, L@S.

[12]  Dan Goldwasser,et al.  Predicting Instructor’s Intervention in MOOC forums , 2014, ACL.

[13]  Jaime Arguello,et al.  Predicting Speech Acts in MOOC Forum Posts , 2015, ICWSM.

[14]  Zhenming Liu,et al.  Learning about Social Learning in MOOCs: From Statistical Analysis to Generative Model , 2013, IEEE Transactions on Learning Technologies.

[15]  Martin Wattenberg,et al.  Conversation thumbnails for large-scale discussions , 2003, CHI Extended Abstracts.

[16]  Bernard J. Jansen,et al.  An Analysis of MOOC Discussion Forum Interactions from the Most Active Users , 2015, SBP.

[17]  Andreas Kaltenbrunner,et al.  Exploring Asynchronous Online Discussions through Hierarchical Visualisation , 2009, 2009 13th International Conference Information Visualisation.

[18]  Lise Getoor,et al.  Understanding MOOC Discussion Forums using Seeded LDA , 2014, BEA@ACL.

[19]  Paula S. Newman,et al.  Exploring discussion lists: steps and directions , 2002, JCDL '02.

[20]  Carman Neustaedter,et al.  Understanding sequence and reply relationships within email conversations: a mixed-model visualization , 2003, CHI '03.

[21]  Hal Daumé,et al.  Incorporating Lexical Priors into Topic Models , 2012, EACL.

[22]  Omprakash Gnawali,et al.  Language independent analysis and classification of discussion threads in Coursera MOOC forums , 2014, Proceedings of the 2014 IEEE 15th International Conference on Information Reuse and Integration (IEEE IRI 2014).

[23]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[24]  Giuseppe Carenini,et al.  ConVis: A Visual Text Analytic System for Exploring Blog Conversations , 2014, Comput. Graph. Forum.

[25]  Warren Sack,et al.  Discourse diagrams: interface design for very large-scale conversations , 2000, Proceedings of the 33rd Annual Hawaii International Conference on System Sciences.

[26]  Coye Cheshire,et al.  Not Too Long to Read: The tldr Interface for Exploring and Navigating Large-Scale Discussion Spaces , 2010, 2010 43rd Hawaii International Conference on System Sciences.

[27]  Christopher Potts,et al.  Recursive Deep Models for Semantic Compositionality Over a Sentiment Treebank , 2013, EMNLP.

[28]  Eni Mustafaraj,et al.  The Visible and Invisible in a MOOC Discussion Forum , 2015, L@S.

[29]  Hongan Wang,et al.  Visualization of large hierarchical data by circle packing , 2006, CHI.

[30]  Martin Wattenberg,et al.  Flash forums and forumReader: navigating a new kind of large-scale online discussion , 2004, CSCW.

[31]  Ewan Klein,et al.  Natural Language Processing with Python , 2009 .

[32]  Carolyn Penstein Rosé,et al.  “ Turn on , Tune in , Drop out ” : Anticipating student dropouts in Massive Open Online Courses , 2013 .

[33]  M. Braga,et al.  Exploratory Data Analysis , 2018, Encyclopedia of Social Network Analysis and Mining. 2nd Ed..