Finding the Patient’s Voice Using Big Data: Analysis of Users’ Health-Related Concerns in the ChaCha Question-and-Answer Service (2009–2012)

Background The development of effective health care and public health interventions requires a comprehensive understanding of the perceptions, concerns, and stated needs of health care consumers and the public at large. Big datasets from social media and question-and-answer services provide insight into the public’s health concerns and priorities without the financial, temporal, and spatial encumbrances of more traditional community-engagement methods and may prove a useful starting point for public-engagement health research (infodemiology). Objective The objective of our study was to describe user characteristics and health-related queries of the ChaCha question-and-answer platform, and discuss how these data may be used to better understand the perceptions, concerns, and stated needs of health care consumers and the public at large. Methods We conducted a retrospective automated textual analysis of anonymous user-generated queries submitted to ChaCha between January 2009 and November 2012. A total of 2.004 billion queries were read, of which 3.50% (70,083,796/2,004,243,249) were missing 1 or more data fields, leaving 1.934 billion complete lines of data for these analyses. Results Males and females submitted roughly equal numbers of health queries, but content differed by sex. Questions from females predominantly focused on pregnancy, menstruation, and vaginal health. Questions from males predominantly focused on body image, drug use, and sexuality. Adolescents aged 12–19 years submitted more queries than any other age group. Their queries were largely centered on sexual and reproductive health, and pregnancy in particular. Conclusions The private nature of the ChaCha service provided a perfect environment for maximum frankness among users, especially among adolescents posing sensitive health questions. Adolescents’ sexual health queries reveal knowledge gaps with serious, lifelong consequences. The nature of questions to the service provides opportunities for rapid understanding of health concerns and may lead to development of more effective tailored interventions.

[1]  Qiang Chen,et al.  Identifying Diseases, Drugs, and Symptoms in Twitter , 2015, MedInfo.

[2]  N. Denzin,et al.  Strategies Of Qualitative Inquiry , 2012 .

[3]  Tweeting and Treating: How Hospitals Use Twitter to Improve Care , 2015, The health care manager.

[4]  M. Minkler Ethical Challenges for the “Outside” Researcher in Community-Based Participatory Research , 2004, Health education & behavior : the official publication of the Society for Public Health Education.

[5]  Mark Dredze,et al.  You Are What You Tweet: Analyzing Twitter for Public Health , 2011, ICWSM.

[6]  Lori Frank,et al.  The PCORI perspective on patient-centered outcomes research. , 2014, JAMA.

[7]  Lindsay T. Graham,et al.  A Review of Facebook Research in the Social Sciences , 2012, Perspectives on psychological science : a journal of the Association for Psychological Science.

[8]  Aaron Smith,et al.  U.S. Smartphone Use in 2015 , 2015 .

[9]  J. Brownstein,et al.  Surveillance Sans Frontières: Internet-Based Emerging Infectious Disease Intelligence and the HealthMap Project , 2008, PLoS medicine.

[10]  N. Wallerstein,et al.  Community-based participatory research for health : from process to outcomes , 2008 .

[11]  T. Kass-Hout,et al.  Social media in public health. , 2013, British medical bulletin.

[12]  Meredith Minkler,et al.  Community-Based Participatory Research for Health , 2002 .

[13]  Alberto Maria Segre,et al.  The Use of Twitter to Track Levels of Disease Activity and Public Concern in the U.S. during the Influenza A H1N1 Pandemic , 2011, PloS one.

[14]  J. Selby,et al.  Practicing Partnered Research , 2014, Journal of general internal medicine.

[15]  H. Britt,et al.  Patient use of the internet for health information. , 2014, Australian family physician.