Analysis of Student Behaviour in Habitable Worlds Using Continuous Representation Visualization

We introduce a novel approach to visualizing temporal clickstream behaviour in the context of a degree-satisfying online course, Habitable Worlds, offered through Arizona State University. The current practice for visualizing behaviour within a digital learning environment has been to utilize state space graphs and other plots of descriptive statistics on resource transitions. While these forms can be visually engaging, they rely on conditional frequency tabulations which lack contextual depth and require assumptions about the patterns being sought. Skip-grams and other representation learning techniques position elements into a vector space which can capture a wide scope of regularities in the data. These regularities can then be projected onto a two-dimensional perceptual space using dimensionality reduction techniques designed to retain relationships information encoded in the learned representations. While these visualization techniques have been used before in the broader machine learning community to better understand the makeup of a neural network hidden layer or the relationship between word vectors, we apply them to online behavioral learner data and go a step further; exploring the impact of the parameters of the model on producing tangible, non-trivial observations of behaviour that are illuminating and suggestive of pedagogical improvement to the course designers and instructors. The methodology introduced in this paper led to an improved understanding of passing and non-passing student behavior in the course and is widely applicable to other datasets of clickstream activity where investigators and stakeholders wish to organically surface principal behavioral patterns.

[1]  Zachary A. Pardos,et al.  Imputing KCs with Representations of Problem Content and Context , 2017, UMAP.

[2]  Danielle S. McNamara,et al.  Combining click-stream data with NLP tools to better understand MOOC completion , 2016, LAK.

[3]  S. Semken,et al.  Habitable Worlds: Delivering on the Promises of Online Education. , 2018, Astrobiology.

[4]  Pascal Vincent,et al.  Representation Learning: A Review and New Perspectives , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Zachary A. Pardos,et al.  Connectionist recommendation in the wild: on the utility and scrutability of neural networks for personalized course guidance , 2018, User Modeling and User-Adapted Interaction.

[6]  Zachary A. Pardos,et al.  The 2010 KDD Cup Competition Dataset: Engaging the machine learning community in predictive learning analytics , 2016, J. Learn. Anal..

[7]  Padhraic Smyth,et al.  Detecting changes in student behavior from clickstream data , 2017, LAK.

[8]  Ido Roll,et al.  A Visual Approach towards Knowledge Engineering and Understanding How Students Learn in Complex Environments , 2017, L@S.

[9]  Zachary A. Pardos,et al.  Predictive Modelling of Student Behavior Using Granular Large-Scale Action Data , 2017 .

[10]  Zachary A. Pardos,et al.  Distributed Representation of Misconceptions , 2018, ICLS.

[11]  Jimmy J. Lin,et al.  Visual analytics of MOOCs at maryland , 2014, L@S.

[12]  Kalyan Veeramachaneni,et al.  Transfer Learning for Predictive Models in Massive Open Online Courses , 2015, AIED.

[13]  David Williamson Shaffer,et al.  Epistemic Network Analysis: A Worked Example of Theory-Based Learning Analytics , 2017 .

[14]  George Siemens Connectivism: A Learning Theory for the Digital Age , 2004 .

[15]  Joseph Jay Williams,et al.  HarvardX and MITx: Two Years of Open Online Courses Fall 2012-Summer 2014 , 2015 .

[16]  Dragan Gasevic,et al.  Learning analytics to unveil learning strategies in a flipped classroom , 2017, Internet High. Educ..

[17]  Yoav Bergner,et al.  Visualization and Confirmatory Clustering of Sequence Data from a Simulation-Based Assessment Task , 2014, EDM.

[18]  Alex Paramythis,et al.  Activity sequence modelling and dynamic clustering for personalized e-learning , 2011, User Modeling and User-Adapted Interaction.

[19]  David W. Shaffer,et al.  Epistemic frames for epistemic games , 2006, Comput. Educ..

[20]  David E. Pritchard,et al.  Studying Learning in the Worldwide Classroom Research into edX's First MOOC. , 2013 .

[21]  Geoffrey Zweig,et al.  Linguistic Regularities in Continuous Space Word Representations , 2013, NAACL.

[22]  Dragan Gasevic,et al.  Detecting Learning Strategies with Analytics: Links with Self-reported Measures and Academic Performance , 2017, J. Learn. Anal..

[23]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[24]  Yoav Bergner,et al.  Who does what in a massive open online course? , 2014, Commun. ACM.

[25]  Sidney D'Mello,et al.  Data mining and education. , 2015, Wiley interdisciplinary reviews. Cognitive science.

[26]  Olga Caprotti Shapes of Educational Data in an Online Calculus Course , 2017 .

[27]  Zachary A. Pardos,et al.  The School of Information and its relationship to computer science at UC Berkeley , 2017 .

[28]  Katy Börner,et al.  MOOC visual analytics: Empowering students, teachers, researchers, and platform developers of massively open online courses , 2017, J. Assoc. Inf. Sci. Technol..

[29]  Vincent Aleven,et al.  Towards Sensor-Free Affect Detection in Cognitive Tutor Algebra. , 2012, EDM 2012.

[30]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[31]  Dayne Freitag,et al.  Machine Learning for Information Extraction in Informal Domains , 2000, Machine Learning.

[32]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[33]  Jan Geryk Using Visual Analytics Tool for Improving Data Comprehension , 2015, EDM.

[34]  Garron Hillaire,et al.  Prototyping Visual Learning Analytics Guided by an Educational Theory Informed Goal , 2016, J. Learn. Anal..

[35]  Daniel R. Figueiredo,et al.  struc2vec: Learning Node Representations from Structural Identity , 2017, KDD.

[36]  Emden R. Gansner,et al.  Graphviz - Open Source Graph Drawing Tools , 2001, GD.

[37]  Geert-Jan Houben,et al.  Gauging MOOC Learners' Adherence to the Designed Learning Path , 2016, EDM.

[38]  Vincent Aleven,et al.  Sensor-free automated detection of affect in a Cognitive Tutor for Algebra , 2012, EDM.