Needle in a haystack: Identifying learner posts that require urgent response in MOOC discussion forums

Abstract Although massive open online courses or MOOCs have been successful in attracting a large number of learners, they have not been equally successful in retaining the learners to the point of course completion. One critical point of failure in many courses, especially those that use discussion forums as a means of collaborative learning, is the large number of messages exchanged on the forums. The extensive exchange of messages often creates chaos from the instructors' perspective and several questions remain unanswered. Lack of attention and response to urgent messages – those that are critical from the learners’ perspective to move forward – becomes a major challenge in this environment. This paper proposes a model to identify “urgent” posts that need immediate attention from instructors. In our analysis, we investigate different feature sets and different data mining techniques, and report the best set of features and classification techniques for addressing the problem of identifying messages that need urgent attention. The results demonstrate the ability to use a limited number of linguistic features with select metadata to build a moderate to substantially reliable classification model that can identify urgent posts in MOOC forums regardless of the course content. The work has potential application across a range of platforms that provide large scale courses and can help instructors efficiently navigate the discussion forums and prioritize the responses so that timely intervention can support learning and may reduce dropout rates.

[1]  Fu-Ren Lin,et al.  Discovering genres of online discussion threads via text mining , 2009, Comput. Educ..

[2]  Min-Yen Kan,et al.  Learning Instructor Intervention from MOOC Forums: Early Results and Issues , 2015, EDM.

[3]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[4]  Carolyn Penstein Rosé,et al.  Forum Thread Recommendation for Massive Open Online Courses , 2014, EDM.

[5]  Jungjoo Kim,et al.  Influence of group size on students' participation in online discussion forums , 2013, Comput. Educ..

[6]  Jennifer C. Richardson,et al.  Conceptualizing and Investigating Instructor Presence in Online Learning Environments. , 2015 .

[7]  Mária Bieliková,et al.  Educational Question Routing in Online Student Communities , 2017, RecSys.

[8]  Carolyn Penstein Rosé,et al.  Exploring the Effect of Confusion in Discussion Forums of Massive Open Online Courses , 2015, L@S.

[9]  Shirley Williams,et al.  MOOCs: A systematic study of the published literature 2008-2012 , 2013 .

[10]  Carolyn Penstein Rosé,et al.  Investigating How Student's Cognitive Behavior in MOOC Discussion Forum Affect Learning Gains , 2015, EDM.

[11]  Alyssa Friend Wise,et al.  Mining for gold: Identifying content-related MOOC discussion threads across domains through linguistic modeling , 2017, Internet High. Educ..

[12]  Andreas Paepcke,et al.  YouEDU: Addressing Confusion in MOOC Discussion Forums by Recommending Instructional Video Clips , 2015, EDM.

[13]  Laurie P. Dringus,et al.  Using data mining as a strategy for assessing asynchronous discussion forums , 2005, Comput. Educ..

[14]  Carolyn Penstein Rosé,et al.  Linguistic Reflections of Student Engagement in Massive Open Online Courses , 2014, ICWSM.

[15]  Thomas C. Reeves,et al.  Meaningful interaction in web-based learning: A social constructivist interpretation , 2007, Internet High. Educ..

[16]  Bernard J. Jansen,et al.  Analyzing MOOC discussion forum messages to identify cognitive learning information exchanges , 2015, ASIST.

[17]  Carolyn Penstein Rosé,et al.  Sentiment Analysis in MOOC Discussion Forums: What does it tell us? , 2014, EDM.

[18]  Dan Goldwasser,et al.  Predicting Instructor’s Intervention in MOOC forums , 2014, ACL.

[19]  Fiona M. Hollands and Devayani Tirthali MOOCs: Expectations and Reality , 2014 .

[20]  Ghada R. El Said,et al.  Exploring the factors affecting MOOC retention: A survey study , 2016, Comput. Educ..

[21]  Aneesha Bakharia,et al.  Towards Cross-domain MOOC Forum Post Classification , 2016, L@S.

[22]  Alyssa Friend Wise,et al.  Identifying Content-Related Threads in MOOC Discussion Forums , 2015, L@S.