Predicting Continued Participation in Online Health Forums

Online health forums provide advice and emotional solace to their users from a social network of people who have faced similar conditions. Continued participation of users is thus critical to their success. In this paper, we develop machine learning models for predicting whether or not a user will continue to participate in an online health forum. The prediction models are trained and tested over a large dataset collected from the support group based social networking site dailystrength.org. We find that our models can predict continued participation with over 83% accuracy after as little as 1 month observing the user’s activities, and that performance increases rapidly up to 1 year of observation. We also show that features such as the time since a user’s last activity are consistently predictive regardless of the length of the observation period, while other features, such as the number of times a user replies to others, decrease in predictiveness as the observation period grows.

[1]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[2]  Robert E. Kraut,et al.  Predicting Continued Participation in Newsgroups , 2006, J. Comput. Mediat. Commun..

[3]  Ashutosh Tiwari,et al.  Computer assisted customer churn management: State-of-the-art and future trends , 2007, Comput. Oper. Res..

[4]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[5]  Sougata Mukherjea,et al.  Social ties and their relevance to churn in mobile telecom networks , 2008, EDBT '08.

[6]  Jure Leskovec,et al.  No country for old members: user lifecycle and linguistic change in online communities , 2013, WWW.

[7]  Vagelis Hristidis,et al.  Pharmaceutical drugs chatter on Online Social Networks , 2014, J. Biomed. Informatics.

[8]  Cliff Lampe,et al.  Follow the (slash) dot: effects of feedback on new members in an online community , 2005, GROUP.

[9]  Peter Pirolli,et al.  Why You Are More Engaged: Factors Influencing Twitter Engagement in Occupy Wall Street , 2012, ICWSM.

[10]  Eric Johnson,et al.  Churn Reduction in the Wireless Industry , 1999, NIPS.

[11]  Carolyn Penstein Rosé,et al.  Talk to me: foundations for successful individual-group interactions in online communities , 2006, CHI.

[12]  Graciela Gonzalez-Hernandez,et al.  Utilizing social media data for pharmacovigilance: A review , 2015, J. Biomed. Informatics.

[13]  Jeffrey Nichols,et al.  Why Are You More Engaged? Predicting Social Engagement from Word Use , 2014, ArXiv.

[14]  Maurice Tchuente,et al.  Churn Prediction in a Real Online Social Network Using Local CommunIty Analysis , 2012, 2012 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining.