Predicting Instructor’s Intervention in MOOC forums

Instructor intervention in student discussion forums is a vital component in Massive Open Online Courses (MOOCs), where personalized interaction is limited. This paper introduces the problem of predicting instructor interventions in MOOC forums. We propose several prediction models designed to capture unique aspects of MOOCs, combining course information, forum structure and posts content. Our models abstract contents of individual posts of threads using latent categories, learned jointly with the binary intervention prediction problem. Experiments over data from two Coursera MOOCs demonstrate that incorporating the structure of threads into the learning problem leads to better predictive performance.

[1]  Natasa Milic-Frayling,et al.  Improving the classification of newsgroup messages through social network analysis , 2007, CIKM '07.

[2]  Michael Gamon,et al.  Predicting Responses to Microblog Posts , 2012, NAACL.

[3]  Erik Aumayr,et al.  Reconstruction of Threaded Conversations in Online Discussion Forums , 2011, ICWSM.

[4]  Jure Leskovec,et al.  Engaging with massive online courses , 2014, WWW.

[5]  Jon M. Kleinberg,et al.  Computational Perspectives on Social Phenomena at Global Scales , 2013, IJCAI.

[6]  Li Wang,et al.  The Utility of Discourse Structure in Forum Thread Retrieval , 2013, AIRS.

[7]  Cornelia Caragea,et al.  Predicting Subjectivity Orientation of Online Forum Threads , 2013, CICLing.

[8]  Lise Getoor,et al.  Modeling Learner Engagement in MOOCs using Probabilistic Soft Logic , 2013 .

[9]  Ming Zhou,et al.  Extracting Chatbot Knowledge from Online Discussion Forums , 2007, IJCAI.

[10]  Thorsten Joachims,et al.  Learning structural SVMs with latent variables , 2009, ICML '09.

[11]  Zhenghao Chen,et al.  Tuned Models of Peer Assessment in MOOCs , 2013, EDM.

[12]  Leonidas J. Guibas,et al.  Syntactic and Functional Variability of a Million Code Submissions in a Machine Learning MOOC , 2013, AIED Workshops.

[13]  Mao Ye,et al.  From user comments to on-line conversations , 2012, KDD.

[14]  Munmun De Choudhury,et al.  What makes conversations interesting?: themes, participants and consequences of conversations in online social media , 2009, WWW '09.

[15]  Paulo Blikstein,et al.  Modeling how students learn to program , 2012, SIGCSE '12.

[16]  Jon Kleinberg,et al.  Differences in the mechanics of information diffusion across topics: idioms, political hashtags, and complex contagion on twitter , 2011, WWW.

[17]  Michael Collins,et al.  Discriminative Training Methods for Hidden Markov Models: Theory and Experiments with Perceptron Algorithms , 2002, EMNLP.

[18]  Ravi Kumar,et al.  Dynamics of conversations , 2010, KDD.

[19]  Gary Geunbae Lee,et al.  Semi-supervised Speech Act Recognition in Emails and Forums , 2009, EMNLP.

[20]  Chris Piech,et al.  Deconstructing disengagement: analyzing learner subpopulations in massive open online courses , 2013, LAK '13.

[21]  Vicenç Gómez,et al.  Statistical analysis of the social network and discussion threads in slashdot , 2008, WWW.

[22]  Kristina Lerman,et al.  Information Contagion: An Empirical Study of the Spread of News on Digg and Twitter Social Networks , 2010, ICWSM.

[23]  Leonidas J. Guibas,et al.  Codewebs: scalable homework search for massive open online programming courses , 2014, WWW.

[24]  M. de Rijke,et al.  Predicting the volume of comments on online news stories , 2009, CIKM.

[25]  Duncan J. Watts,et al.  Everyone's an influencer: quantifying influence on twitter , 2011, WSDM '11.

[26]  Karthik Visweswariah,et al.  Semi-Supervised Answer Extraction from Discussion Forums , 2013, IJCNLP.

[27]  Karen Rose,et al.  What is Twitter , 2009 .

[28]  Noah A. Smith,et al.  What's Worthy of Comment? Content and Comment Volume in Political Blogs , 2010, ICWSM.

[29]  Carolyn Penstein Rosé,et al.  A Feature Based Approach to Leveraging Context for Classifying Newsgroup Style Discussion Segments , 2007, ACL.

[30]  Jon M. Kleinberg,et al.  Tracing information flow on a global scale using Internet chain-letter data , 2008, Proceedings of the National Academy of Sciences.

[31]  Matthew O. Jackson,et al.  Seeing only the successes: The power of selection bias in explaining the structure of observed Internet diusions , 2010 .

[32]  Ramesh Nallapati,et al.  Joint question clustering and relevance prediction for open domain non-factoid question answering , 2014, WWW.

[33]  Ming-Wei Chang,et al.  Discriminative Learning over Constrained Latent Representations , 2010, NAACL.

[34]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[35]  ChengXiang Zhai,et al.  Learning online discussion structures by conditional random fields , 2011, SIGIR.

[36]  Rizal Setya Perdana What is Twitter , 2013 .

[37]  Dan Goldwasser,et al.  “I Object!” Modeling Latent Pragmatic Effects in Courtroom Dialogues , 2014, EACL.

[38]  Jon M. Kleinberg,et al.  Characterizing and curating conversation threads: expansion, focus, volume, re-entry , 2013, WSDM.

[39]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.