Mining theory-based patterns from Big data: Identifying self-regulated learning strategies in Massive Open Online Courses

Abstract Big data in education offers unprecedented opportunities to support learners and advance research in the learning sciences. Analysis of observed behaviour using computational methods can uncover patterns that reflect theoretically established processes, such as those involved in self-regulated learning (SRL). This research addresses the question of how to integrate this bottom-up approach of mining behavioural patterns with the traditional top-down approach of using validated self-reporting instruments. Using process mining, we extracted interaction sequences from fine-grained behavioural traces for 3458 learners across three Massive Open Online Courses. We identified six distinct interaction sequence patterns. We matched each interaction sequence pattern with one or more theory-based SRL strategies and identified three clusters of learners. First, Comprehensive Learners, who follow the sequential structure of the course materials, which sets them up for gaining a deeper understanding of the content. Second, Targeting Learners, who strategically engage with specific course content that will help them pass the assessments. Third, Sampling Learners, who exhibit more erratic and less goal-oriented behaviour, report lower SRL, and underperform relative to both Comprehensive and Targeting Learners. Challenges that arise in the process of extracting theory-based patterns from observed behaviour are discussed, including analytic issues and limitations of available trace data from learning platforms.

[1]  David M. Shannon,et al.  Students’ characteristics, self-regulated learning, technology self-efficacy, and course outcomes in online learning , 2013 .

[2]  Jorge Munoz-Gama,et al.  Exploring differences in how learners navigate in MOOCs based on self-regulated learning and learning styles: A process mining approach , 2016, 2016 XLII Latin American Computing Conference (CLEI).

[3]  R. Azevedo Theoretical, conceptual, methodological, and instructional issues in research on metacognition and self-regulated learning: A discussion , 2009 .

[4]  M. Bannert Promoting Self-Regulated Learning Through Prompts , 2009 .

[5]  Justin Reich,et al.  The Life Between Big Data Log Events , 2016 .

[6]  Cheryl I. Johnson,et al.  A Testing Effect with Multimedia Learning , 2009 .

[7]  Eric Rojas Cordoba,et al.  Deciphering event logs in SharePoint Server: A methodology based on process mining , 2014, CLEI.

[8]  Dragan Gasevic,et al.  Learning analytics to unveil learning strategies in a flipped classroom , 2017, Internet High. Educ..

[9]  Anastasia Kitsantas,et al.  Reliability and Validity of Self-Efficacy for Learning Form (SELF) Scores of College Students , 2007 .

[10]  Allison Littlejohn,et al.  Learning in MOOCs: Motivations and self-regulated learning in MOOCs , 2016, Internet High. Educ..

[11]  Carlo P. Magno Validating the Academic Self-Regulated Learning Scale with the Motivated Strategies For Learning Questionnaire (MSLQ) and Learning and Study Strategies Inventory (LASSI) , 2011 .

[12]  Richard Cb Slack,et al.  Measuring the Impact , 1998 .

[13]  Philip H. Winne,et al.  Issues in researching self-regulated learning as patterns of events , 2014 .

[14]  George Siemens,et al.  Editorial: Datasets for Learning Analytics , 2016 .

[15]  Roger Azevedo,et al.  The Temporal and Dynamic Nature of Self-Regulatory Processes During Independent and Externally Assisted Hypermedia Learning , 2011 .

[16]  Carlos Delgado Kloos,et al.  Understanding Learners’ Motivation and Learning Strategies in MOOCs , 2017 .

[17]  José Carlos Núñez,et al.  Self-regulated profiles and academic achievement. , 2008, Psicothema.

[18]  Dragan Gasevic,et al.  Detecting Learning Strategies with Analytics: Links with Self-reported Measures and Academic Performance , 2017, J. Learn. Anal..

[19]  Chris Piech,et al.  Deconstructing disengagement: analyzing learner subpopulations in massive open online courses , 2013, LAK '13.

[20]  R. Azevedo,et al.  Self-Regulated Learning Processes Vary as a Function of Epistemic Beliefs and Contexts: Mixed Method Evidence from Eye Tracking and Concurrent and Retrospective Reports , 2016 .

[21]  Wil M. P. van der Aalst,et al.  Wanna improve process mining results? , 2013, 2013 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[22]  K. Hew,et al.  Students’ and instructors’ use of massive open online courses (MOOCs): Motivations and challenges , 2014 .

[23]  Paul R. Pintrich,et al.  Motivated Strategies for Learning Questionnaire , 2012 .

[24]  I. Jivet,et al.  The Learning Tracker: A Learner Dashboard that Encourages Self-regulation in MOOC Learners , 2016 .

[25]  R. Azevedo Defining and Measuring Engagement and Learning in Science: Conceptual, Theoretical, Methodological, and Analytical Issues , 2015 .

[26]  Claire E. Weinstein,et al.  Self-regulation and learning strategies , 2011 .

[27]  B. Zimmerman Attaining self-regulation: A social cognitive perspective. , 2000 .

[28]  Roger Azevedo,et al.  Teaching and Learning in Technology-Rich Environments , 2006 .

[29]  Geert-Jan Houben,et al.  Follow the successful crowd: raising MOOC completion rates through social comparison at scale , 2017, LAK.

[30]  M. Boekaerts Self-Regulated Learning. , 1999 .

[31]  Wil M. P. van der Aalst,et al.  Process Mining - Discovery, Conformance and Enhancement of Business Processes , 2011 .

[32]  Geert-Jan Houben,et al.  Retrieval Practice and Study Planning in MOOCs: Exploring Classroom-Based Self-regulated Learning Strategies at Scale , 2016, EC-TEL.

[33]  M. Boekaerts SELF-REGULATED LEARNING: A NEW CONCEPT EMBRACED BY RESEARCHERS, POLICY MAKERS, EDUCATORS, TEACHERS, AND STUDENTS , 1997 .

[34]  Gráinne Conole,et al.  Designing effective MOOCs , 2015 .

[35]  Christopher Brooks,et al.  Diverse Big Data and Randomized Field Experiments in MOOCs , 2017 .

[36]  D. Shen,et al.  Self-regulation in online learning , 2013 .

[37]  Joachim Wirth Promoting Self-Regulated Learning Through Prompts , 2009 .

[38]  Marek Hatala,et al.  Measuring the impact of technological scaffolding interventions on micro-level processes of self-regulated workplace learning , 2016, Comput. Hum. Behav..

[39]  Mar Pérez-Sanagustín,et al.  Self-regulated learning strategies predict learner behavior and goal attainment in Massive Open Online Courses , 2017, Comput. Educ..

[40]  Cristóbal Romero,et al.  A survey on educational process mining , 2018, WIREs Data Mining Knowl. Discov..

[41]  P. Pintrich A Manual for the Use of the Motivated Strategies for Learning Questionnaire (MSLQ). , 1991 .

[42]  Ramakrishnan Srikant,et al.  Mining sequential patterns , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[43]  Maria Bannert,et al.  Discovering the Effects of Metacognitive Prompts on the Sequential Structure of SRL-Processes Using Process Mining Techniques , 2015, J. Learn. Anal..

[44]  Katharina Reinecke,et al.  Demographic differences in how students navigate through MOOCs , 2014, L@S.

[45]  R. Peterson A Meta-analysis of Cronbach's Coefficient Alpha , 1994 .

[46]  Marek Hatala,et al.  Penetrating the black box of time-on-task estimation , 2015, LAK.

[47]  P. Pintrich The role of goal orientation in self-regulated learning. , 2000 .

[48]  Ernesto Panadero,et al.  A Review of Self-regulated Learning: Six Models and Four Directions for Research , 2017, Front. Psychol..

[49]  René F. Kizilcec,et al.  Motivation as a Lens to Understand Online Learners , 2015, ACM Trans. Comput. Hum. Interact..

[50]  George Siemens,et al.  Learning analytics and educational data mining: towards communication and collaboration , 2012, LAK.

[51]  Alex Paramythis,et al.  Activity sequence modelling and dynamic clustering for personalized e-learning , 2011, User Modeling and User-Adapted Interaction.

[52]  W. Lan,et al.  Online self-regulatory learning behaviors as a mediator in the relationship between online course perceptions with achievement , 2008 .

[53]  Philip H. Winne,et al.  Improving Measurements of Self-Regulated Learning , 2010 .

[54]  J. Biggs What the student does: teaching for enhanced learning , 1999 .

[55]  Jaclyn Broadbent,et al.  Comparing online and blended learner's self-regulated learning strategies and academic performance , 2017, Internet High. Educ..

[56]  Fielding I. Winters,et al.  Self-Regulation of Learning within Computer-based Learning Environments: A Critical Analysis , 2008 .

[57]  Ido Roll,et al.  Understanding, evaluating, and supporting self-regulated learning using learning analytics , 2015, J. Learn. Anal..

[58]  Christian W. Günther,et al.  Disco: Discover Your Processes , 2012, BPM.

[59]  Rebecca Eynon,et al.  The rise of Big Data: what does it mean for education, technology, and media research? , 2013 .

[60]  B. Zimmerman Developing self-fulfilling cycles of academic regulation: An analysis of exemplary instructional models. , 1998 .

[61]  P. Reimann,et al.  Process mining techniques for analysing patterns and strategies in students’ self-regulated learning , 2013, Metacognition and Learning.

[62]  Kristy J. Lauver,et al.  Do psychosocial and study skill factors predict college outcomes? A meta-analysis. , 2004, Psychological bulletin.

[63]  Fatos Xhafa,et al.  A Review on Massive E-Learning (MOOC) Design, Delivery and Assessment , 2013, 2013 Eighth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[64]  J. Broadbent,et al.  Self-regulated learning strategies & academic achievement in online higher education learning environments: A systematic review , 2015, Internet High. Educ..

[65]  Wil M. P. van der Aalst,et al.  Fuzzy Mining - Adaptive Process Simplification Based on Multi-perspective Metrics , 2007, BPM.

[66]  Francisco J. García-Peñalvo,et al.  Evolution of the Conversation and Knowledge Acquisition in Social Networks Related to a MOOC Course , 2015, HCI.

[67]  Judy Kay,et al.  Clustering and Sequential Pattern Mining of Online Collaborative Learning Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[68]  Steve Cooper,et al.  Reflections on Stanford's MOOCs , 2013, CACM.

[69]  Jorge Munoz-Gama,et al.  Process mining in healthcare: A literature review , 2016, J. Biomed. Informatics.

[70]  Wil M. P. van der Aalst,et al.  Learning Analytics on Coursera Event Data: A Process Mining Approach , 2015, SIMPDA.

[71]  Bernhard Schmitz,et al.  Assessing self-regulated learning in higher education: a systematic literature review of self-report instruments , 2016 .

[72]  Ryan S. Baker,et al.  Educational Data Mining and Learning Analytics , 2014 .

[73]  J. Downing,et al.  Learning strategies, learning anxiety and knowledge acquisition. , 2000, British journal of psychology.

[74]  René F. Kizilcec,et al.  Eight-minute self-regulation intervention raises educational attainment at scale in individualist but not collectivist cultures , 2017, Proceedings of the National Academy of Sciences.

[75]  Zhihui Du,et al.  Modeling the learning behaviors of massive open online courses , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[76]  Geraldine Clarebout,et al.  Students' tool-use within a web enhanced course: Explanatory mechanisms of students' tool-use pattern , 2013, Comput. Hum. Behav..

[77]  Geert-Jan Houben,et al.  Beyond the MOOC platform: gaining insights about learners from the social web , 2016, WebSci.

[78]  Barry J. Zimmerman,et al.  Self-Regulated Learning: Theories, Measures, and Outcomes , 2015 .

[79]  P. Pintrich A Conceptual Framework for Assessing Motivation and Self-Regulated Learning in College Students , 2004 .

[80]  John G. Borkowski,et al.  Metacognition: Theory or chapter heading? , 1996 .

[81]  Philip H. Winne,et al.  Examining trace data to explore self-regulated learning , 2007 .

[82]  Cathy van Dyck,et al.  Exploration strategies, performance, and error consequences when learning a complex computer task , 2001 .

[83]  D. Leutner,et al.  Self-Regulated Learning as a Competence Implications of Theoretical Models for Assessment Methods , 2008 .

[84]  Mar Pérez-Sanagustín,et al.  Recommending Self-Regulated Learning Strategies Does Not Improve Performance in a MOOC , 2016, L@S.

[85]  Allyson F. Hadwin,et al.  Enhancing the Quality of Learning: Promoting Learning Skills in Undergraduate Students , 2012 .

[86]  Maria Puzziferro Online Technologies Self-Efficacy and Self-Regulated Learning as Predictors of Final Grade and Satisfaction in College-Level Online Courses , 2008 .

[87]  Ricardo Seguel,et al.  Process Mining Manifesto , 2011, Business Process Management Workshops.

[88]  Donatella Persico,et al.  Self-Regulated Learning , 2011 .

[89]  R. Azevedo,et al.  Does training on self-regulated learning facilitate students' learning with hypermedia? , 2004 .

[90]  van der Wmp Wil Aalst,et al.  Exploring students’ learning behaviour in MOOCs using process mining techniques , 2015 .

[91]  Allison Littlejohn,et al.  Designing MOOCs for professional learners: Tools and patterns to encourage self-regulated learning , 2015 .

[92]  Karen Edge,et al.  Educational assessment, evaluation and accountability , 2012 .

[93]  Dianne Jamieson-Noel,et al.  Self-regulating studying by objectives for learning: Students’ reports compared to a model , 2003 .

[94]  R. Snow Aptitude-treatment interaction as a framework for research on individual differences in learning. , 1989 .

[95]  A. D. Carson,et al.  Predicting Student Success from the Lassi for Learning Online (LLO) , 2011 .

[96]  A. C. Butler,et al.  The critical role of retrieval practice in long-term retention , 2011, Trends in Cognitive Sciences.

[97]  C. Abraham,et al.  Psychological correlates of university students' academic performance: a systematic review and meta-analysis. , 2012, Psychological bulletin.

[98]  van der Wmp Wil Aalst,et al.  Uncovering learning patterns in a MOOC through conformance alignments , 2015 .

[99]  Linda Corrin,et al.  What data and analytics can and do say about effective learning , 2017, npj Science of Learning.

[100]  Yu-Chu Yeh,et al.  Aptitude-treatment interaction , 2012 .

[101]  David E. Pritchard,et al.  Studying Learning in the Worldwide Classroom Research into edX's First MOOC. , 2013 .

[102]  Osmar R. Zaïane,et al.  Data Mining and Learning Analytics: Applications in Educational Research , 2016 .

[103]  Stuart A. Karabenick,et al.  Understanding and facilitating self‐regulated help seeking , 2011 .

[104]  Michael Frese,et al.  Exploration strategies, performance, and error consequences when learning a complex computer task , 2001, Behav. Inf. Technol..

[105]  Birgit Schyns,et al.  A Short Version of the Occupational Self-Efficacy Scale: Structural and Construct Validity Across Five Countries , 2008 .

[106]  Marek Hatala,et al.  A process mining approach to linking the study of aptitude and event facets of self-regulated learning , 2015, LAK.

[107]  Linda Corrin,et al.  Using learning analytics to explore help-seeking learner profiles in MOOCs , 2017, LAK.

[108]  B. Zimmerman,et al.  Development of a Structured Interview for Assessing Student Use of Self-Regulated Learning Strategies , 1986 .

[109]  Philip H. Winne,et al.  Studying as self-regulated learning. , 1998 .

[110]  M. E. Gredler,et al.  Prior Achievement, Aptitude, and Use of Learning Strategies as Predictors of College Student Achievement , 2002 .

[111]  Anouschka van Leeuwen,et al.  Validation of the self-regulated online learning questionnaire , 2016, Journal of Computing in Higher Education.

[112]  Geert-Jan Houben,et al.  Gauging MOOC Learners' Adherence to the Designed Learning Path , 2016, EDM.

[113]  George Siemens,et al.  Where is research on massive open online courses headed? A data analysis of the MOOC research initiative , 2014 .

[114]  D. Schunk Self-Regulated Learning: The Educational Legacy of Paul R. Pintrich , 2005 .

[115]  Cristóbal Romero,et al.  EDUCATIONAL PROCESS MINING , 2016 .

[116]  Philip H. Winne,et al.  Learning Strategies, Study Skills, and Self-Regulated Learning in Postsecondary Education , 2013 .

[117]  Gautam Biswas,et al.  Measuring Self-Regulated Learning Skills through Social Interactions in a teachable Agent Environment , 2010, Res. Pract. Technol. Enhanc. Learn..

[118]  Daniel C. Moos,et al.  Self-Regulation of Learning with Computer-Based Learning Environments , 2011 .

[119]  Philip H. Winne,et al.  How Software Technologies Can Improve Research on Learning and Bolster School Reform , 2006 .

[120]  K. Schmidt Conceptual Framework for , 2002 .

[121]  Sander J. J. Leemans,et al.  PM ^2 : A Process Mining Project Methodology , 2015, CAiSE.