Inferring Smoking Status from User Generated Content in an Online Cessation Community

Introduction User generated content (UGC) is a valuable but underutilized source of information about individuals who participate in online cessation interventions. This study represents a first effort to passively detect smoking status among members of an online cessation program using UGC. Methods Secondary data analysis was performed on data from 826 participants in a web-based smoking cessation randomized trial that included an online community. Domain experts from the online community reviewed each post and comment written by participants and attempted to infer the author's smoking status at the time it was written. Inferences from UGC were validated by comparison with self-reported 30-day point prevalence abstinence (PPA). Following validation, the impact of this method was evaluated across all individuals and time points in the study period. Results Of the 826 participants in the analytic sample, 719 had written at least one post from which content inference was possible. Among participants for whom unambiguous smoking status was inferred during the 30 days preceding their 3-month follow-up survey, concordance with self-report was almost perfect (kappa = 0.94). Posts indicating abstinence tended to be written shortly after enrollment (median = 14 days). Conclusions Passive inference of smoking status from UGC in online cessation communities is possible and highly reliable for smokers who actively produce content. These results lay the groundwork for further development of observational research tools and intervention innovations. Implications A proof-of-concept methodology for inferring smoking status from user generated content in online cessation communities is presented and validated. Content inference of smoking status makes a key cessation variable available for use in observational designs. This method provides a powerful tool for researchers interested in online cessation interventions and establishes a foundation for larger scale application via machine learning.

[1]  Suzanne Pingree,et al.  Effects of Insightful Disclosure Within Computer Mediated Support Groups on Women With Breast Cancer , 2006, Health communication.

[2]  J. R. Landis,et al.  The measurement of observer agreement for categorical data. , 1977, Biometrics.

[3]  P. Mabry,et al.  Boosting population quits through evidence-based cessation treatment and policy. , 2010, American journal of preventive medicine.

[4]  Jennifer L Pearson,et al.  A Multirelational Social Network Analysis of an Online Health Community for Smoking Cessation , 2016, Journal of medical Internet research.

[5]  Vincent Baujard,et al.  A qualitative analysis of an internet discussion forum for recent ex-smokers. , 2006, Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco.

[6]  David B Abrams,et al.  Improving Adherence to Smoking Cessation Treatment: Intervention Effects in a Web-Based Randomized Trial , 2016, Nicotine & tobacco research : official journal of the Society for Research on Nicotine and Tobacco.

[7]  R. Niaura,et al.  Baseline Characteristics and Generalizability of Participants in an Internet Smoking Cessation Randomized Trial , 2016, Annals of behavioral medicine : a publication of the Society of Behavioral Medicine.

[8]  Joseph A Cafazzo,et al.  Beyond the Randomized Controlled Trial: A Review of Alternatives in mHealth Clinical Trial Methods , 2016, JMIR mHealth and uHealth.

[9]  Trevor Cohen,et al.  Content-driven analysis of an online community for smoking cessation: integration of qualitative techniques, automated text analysis, and affiliation networks. , 2015, American journal of public health.

[10]  Richard J. Cook,et al.  Kappa and Its Dependence on Marginal Rates , 2005 .

[11]  Nathan K. Cobb,et al.  Improving adherence to web-based cessation programs: a randomized controlled trial study protocol , 2013, Trials.

[12]  Amanda L. Graham,et al.  Systematic review and meta-analysis of Internet interventions for smoking cessation among adults , 2016, Substance abuse and rehabilitation.

[13]  Sven Van Poucke,et al.  Are Randomized Controlled Trials the (G)old Standard? From Clinical Intelligence to Prescriptive Analytics , 2016, Journal of medical Internet research.

[14]  Daniel Parent,et al.  Online Social and Professional Support for Smokers Trying to Quit: An Exploration of First Time Posts From 2562 Members , 2010, Journal of medical Internet research.

[15]  Kristen Campbell Eichhorn,et al.  Soliciting and Providing Social Support Over the Internet: An Investigation of Online Eating Disorder Support Groups , 2008, J. Comput. Mediat. Commun..

[16]  N. Coulson,et al.  Social support in cyberspace: a content analysis of communication within a Huntington's disease online support group. , 2007, Patient education and counseling.

[17]  M. Fiore,et al.  Treating tobacco use and dependence: 2008 update U.S. Public Health Service Clinical Practice Guideline executive summary. , 2008, Respiratory care.

[18]  Jillian T Henderson,et al.  Behavioral Counseling and Pharmacotherapy Interventions for Tobacco Cessation in Adults, Including Pregnant Women: A Review of Reviews for the U.S. Preventive Services Task Force , 2015, Annals of Internal Medicine.

[19]  Jan P Vandenbroucke,et al.  Observational Research, Randomised Trials, and Two Views of Medical Science , 2008, PLoS medicine.

[20]  Rolf Wynn,et al.  Language of motivation and emotion in an internet support group for smoking cessation: explorative use of automated content analysis to measure regulatory focus , 2014, Psychology research and behavior management.

[21]  A. Sheikh,et al.  Internet-based interventions for smoking cessation. , 2010, The Cochrane database of systematic reviews.

[22]  Jae-Wook Song,et al.  Observational Studies: Cohort and Case-Control Studies , 2010, Plastic and reconstructive surgery.

[23]  Ming Liu,et al.  An analysis of social support exchanges in online HIV/AIDS self-help groups , 2009, Comput. Hum. Behav..

[24]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[25]  B. Rimer,et al.  How Cancer Survivors Provide Support on Cancer-Related Internet Mailing Lists , 2007, Journal of medical Internet research.

[26]  Trevor van Mierlo,et al.  An online support group for problem drinkers: AlcoholHelpCenter.net. , 2008, Patient education and counseling.

[27]  D. Vallone,et al.  Evaluation of EX: a national mass media smoking cessation campaign. , 2011, American journal of public health.

[28]  Andy McEwen,et al.  Online support for smoking cessation: a systematic review of the literature. , 2009, Addiction.

[29]  T. Thompson,et al.  Use and effectiveness of quitlines versus Web‐based tobacco cessation interventions among 4 state tobacco control programs , 2016, Cancer.