Causal Analysis of Learning Performance Based on Bayesian Network and Mutual Information

Over the past few years, online learning has exploded in popularity due to the potentially unlimited enrollment, lack of geographical limitations, and free accessibility of many courses. However, learners are prone to have poor performance due to the unconstrained learning environment, lack of academic pressure, and low interactivity. Personalized intervention design with the learners’ background and learning behavior factors in mind may improve the learners’ performance. Causality strictly distinguishes cause from outcome factors and plays an irreplaceable role in designing guiding interventions. The goal of this paper is to construct a Bayesian network to make causal analysis and then provide personalized interventions for different learners to improve learning. This paper first constructs a Bayesian network based on background and learning behavior factors, combining expert knowledge and a structure learning algorithm. Then the important factors in the constructed network are selected using mutual information based on entropy. At last, we identify learners with poor performance using inference and propose personalized interventions, which may help with successful applications in education. Experimental results verify the effectiveness of the proposed method and demonstrate the impact of factors on learning performance.

[1]  Ciara Keenan,et al.  The trials of evidence-based practice in education: a systematic review of randomised controlled trials in education research 1980–2016 , 2018, Educational Research.

[2]  Chi-Chang Chen,et al.  An online game approach for improving students' learning performance in web-based problem-solving activities , 2012, Comput. Educ..

[3]  José-Luis Pérez-de-la-Cruz,et al.  Learning Bayesian Networks for Student Modeling , 2015, AIED.

[4]  Hongyan Wang,et al.  Causal Association Analysis Algorithm for MOOC Learning Behavior and Learning Effect , 2016, 2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech).

[5]  Lise Getoor,et al.  Understanding MOOC Discussion Forums using Seeded LDA , 2014, BEA@ACL.

[6]  Lilac A. Al-Safadi,et al.  Intervention Strategies for the Improvement of Students' Academic Performance in Data Structure Course , 2014 .

[7]  Ji-Hye Park,et al.  Factors Influencing Adult Learners' Decision to Drop Out or Persist in Online Learning , 2009, J. Educ. Technol. Soc..

[8]  Hangjung Zo,et al.  Understanding the MOOCs continuance: The role of openness and reputation , 2015, Comput. Educ..

[9]  Carolyn Penstein Rosé,et al.  Sentiment Analysis in MOOC Discussion Forums: What does it tell us? , 2014, EDM.

[10]  Jure Leskovec,et al.  Engaging with massive online courses , 2014, WWW.

[11]  David E. Pritchard,et al.  Studying Learning in the Worldwide Classroom Research into edX's First MOOC. , 2013 .

[12]  Vivian C. Wong,et al.  Three conditions under which experiments and observational studies produce comparable causal estimates: New findings from within‐study comparisons , 2008 .

[13]  Björn Hartmann,et al.  Should your MOOC forum use a reputation system? , 2014, CSCW.

[14]  Huan Liu,et al.  Discretization: An Enabling Technique , 2002, Data Mining and Knowledge Discovery.

[15]  I Jenkinson,et al.  A methodology to model causal relationships on offshore safety assessment focusing on human and organizational factors. , 2008, Journal of safety research.

[16]  Tamara Sumner,et al.  Educational Recommendation in an Informal Intentional Learning System , 2012 .

[17]  Gladys Castillo,et al.  Using Bayesian networks to improve knowledge assessment , 2013, Comput. Educ..

[18]  Rianne Conijn,et al.  Predicting student performance in a blended MOOC , 2018, J. Comput. Assist. Learn..

[19]  George Karypis,et al.  Predicting Student Performance Using Personalized Analytics , 2016, Computer.

[20]  Andy Laws,et al.  Machine learning approaches to predict learning outcomes in Massive open online courses , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[21]  Dorina Kabakchieva,et al.  Predicting Student Performance by Using Data Mining Methods for Classification , 2013 .

[22]  Laxmisha Rai,et al.  Influencing Factors of Success and Failure in MOOC and General Analysis of Learner Behavior , 2016 .

[23]  Robert Cowell,et al.  Introduction to Inference for Bayesian Networks , 1998, Learning in Graphical Models.

[24]  Fatos Xhafa,et al.  A Review on Massive E-Learning (MOOC) Design, Delivery and Assessment , 2013, 2013 Eighth International Conference on P2P, Parallel, Grid, Cloud and Internet Computing.

[25]  M AlraimiKhaled,et al.  Understanding the MOOCs continuance , 2015 .

[26]  Alaa M. El-Halees,et al.  Mining educational data to improve students' performance: a case study , 2012 .

[27]  Rafael Rumí,et al.  Bayesian networks in environmental modelling , 2011, Environ. Model. Softw..

[28]  Marco Scutari,et al.  Learning Bayesian Networks with the bnlearn R Package , 2009, 0908.3817.

[29]  Elena L. Glassman,et al.  RIMES: Embedding Interactive Multimedia Exercises in Lecture Videos , 2015, CHI.

[30]  Derek Bell,et al.  The perceived success of interventions in science education , 2014 .

[31]  Carole Torgerson,et al.  The Need for Randomised Controlled Trials in Educational Research , 2001 .

[32]  Krzysztof Z. Gajos,et al.  Understanding in-video dropouts and interaction peaks inonline lecture videos , 2014, L@S.

[33]  D. Schacter,et al.  Interpolated memory tests reduce mind wandering and improve learning of online lectures , 2013, Proceedings of the National Academy of Sciences.

[34]  Mark J. Gierl,et al.  Cognitive diagnostic assessment for education: Theory and applications. , 2007 .

[35]  Anthony G. Picciano BEYOND STUDENT PERCEPTIONS: ISSUES OF INTERACTION, PRESENCE, AND PERFORMANCE IN AN ONLINE COURSE , 2019, Online Learning.

[36]  Sara McNeil,et al.  Students' patterns of engagement and course performance in a Massive Open Online Course , 2016, Comput. Educ..

[37]  Thomas D. Cook,et al.  Randomized Experiments and Quasi-Experimental Designs in Educational Research , 2009 .

[38]  Omid Kalatpour,et al.  Constructing a Bayesian network model for improving safety behavior of employees at workplaces. , 2017, Applied ergonomics.

[39]  Geplante Forschungsreisen,et al.  AN , 2020, Catalysis from A to Z.

[40]  Ali Mosleh,et al.  An Entropy Based Bayesian Network Framework for System Health Monitoring , 2018, Entropy.

[41]  Justin Reich,et al.  Rebooting MOOC Research , 2015, Science.

[42]  Samuli Kolari,et al.  Learning needs time and effort: a time-use study of engineering students , 2008 .

[43]  N. Leech,et al.  Understanding Correlation: Factors That Affect the Size of r , 2006 .

[44]  John Cigas,et al.  Short videos improve student learning in online education , 2013 .

[45]  Peter Bühlmann,et al.  Causal Inference Using Graphical Models with the R Package pcalg , 2012 .

[46]  Gayle S. Christensen,et al.  The MOOC Phenomenon: Who Takes Massive Open Online Courses and Why? , 2013 .

[47]  Hao-Chuan Wang,et al.  Using Time-Anchored Peer Comments to Enhance Social Interaction in Online Educational Videos , 2015, CHI.

[48]  Abir Jaafar Hussain,et al.  Analyzing Learners Behavior in MOOCs: An Examination of Performance and Motivation Using a Data-Driven Approach , 2018, IEEE Access.

[49]  Book reviews , 2002 .

[50]  Dorian A. Canelas,et al.  Understanding the massive open online course (MOOC) student experience: An examination of attitudes, motivations, and barriers , 2017, Comput. Educ..

[51]  Luis Manuel Hernández-Ramos,et al.  Factors affecting student learning performance: A causal model in higher blended education , 2018, J. Comput. Assist. Learn..

[52]  S. Miyano,et al.  Finding Optimal Bayesian Network Given a Super-Structure , 2008 .

[53]  Kemal Gursoy A review of: “Causality: Models, Reasoning, and Inference” Judea Pearl Cambridge University Press, Cambridge, UK, 2000, $39.95, xvi+384 pp., hardcover, ISBN 0-521-77362-8 , 2002 .

[54]  A. G. Asuero,et al.  The Correlation Coefficient: An Overview , 2006 .

[55]  J M Kendall,et al.  Designing a research project: randomised controlled trials and their principles , 2003, Emergency medicine journal : EMJ.

[56]  Girish Balakrishnan,et al.  Predicting Student Retention in Massive Open Online Courses using Hidden Markov Models , 2013 .

[57]  R. Felder Reaching the Second Tier--Learning and Teaching Styles in College Science Education. , 1993 .

[58]  Jie Xu,et al.  Predicting Grades , 2015, IEEE Transactions on Signal Processing.

[59]  Elena L. Glassman,et al.  Mudslide: A Spatially Anchored Census of Student Confusion for Online Lecture Videos , 2015, CHI.

[60]  Qiang Ji,et al.  Structure learning of Bayesian networks using constraints , 2009, ICML '09.

[61]  Linda Corrin,et al.  Visualizing patterns of student engagement and performance in MOOCs , 2014, LAK.

[62]  Kevin C. Almeroth,et al.  Moodog: Tracking students' Online Learning Activities , 2007 .

[63]  Juan C. Burguillo,et al.  Using game theory and Competition-based Learning to stimulate student motivation and performance , 2010, Comput. Educ..

[64]  Tom M. Mitchell,et al.  Bayesian Network Learning with Parameter Constraints , 2006, J. Mach. Learn. Res..

[65]  Rui Guo,et al.  Participation-based student final performance prediction model through interpretable Genetic Programming: Integrating learning analytics, educational data mining and theory , 2015, Comput. Hum. Behav..

[66]  Luis M. de Campos,et al.  Bayesian network learning algorithms using structural restrictions , 2007, Int. J. Approx. Reason..

[67]  Catherine P. Bradshaw,et al.  Examining the Effects of Schoolwide Positive Behavioral Interventions and Supports on Student Outcomes , 2010 .

[68]  Mark Stansfield,et al.  Enhancing Student Performance in Online Learning and Traditional Face-to-Face Class Delivery , 2004, J. Inf. Technol. Educ..