HealthTrust: A Social Network Approach for Retrieving Online Health Videos

Background Social media are becoming mainstream in the health domain. Despite the large volume of accurate and trustworthy health information available on social media platforms, finding good-quality health information can be difficult. Misleading health information can often be popular (eg, antivaccination videos) and therefore highly rated by general search engines. We believe that community wisdom about the quality of health information can be harnessed to help create tools for retrieving good-quality social media content. Objectives To explore approaches for extracting metrics about authoritativeness in online health communities and how these metrics positively correlate with the quality of the content. Methods We designed a metric, called HealthTrust, that estimates the trustworthiness of social media content (eg, blog posts or videos) in a health community. The HealthTrust metric calculates reputation in an online health community based on link analysis. We used the metric to retrieve YouTube videos and channels about diabetes. In two different experiments, health consumers provided 427 ratings of 17 videos and professionals gave 162 ratings of 23 videos. In addition, two professionals reviewed 30 diabetes channels. Results HealthTrust may be used for retrieving online videos on diabetes, since it performed better than YouTube Search in most cases. Overall, of 20 potential channels, HealthTrust’s filtering allowed only 3 bad channels (15%) versus 8 (40%) on the YouTube list. Misleading and graphic videos (eg, featuring amputations) were more commonly found by YouTube Search than by searches based on HealthTrust. However, some videos from trusted sources had low HealthTrust scores, mostly from general health content providers, and therefore not highly connected in the diabetes community. When comparing video ratings from our reviewers, we found that HealthTrust achieved a positive and statistically significant correlation with professionals (Pearson r 10 = .65, P = .02) and a trend toward significance with health consumers (r 7 = .65, P = .06) with videos on hemoglobinA1 c, but it did not perform as well with diabetic foot videos. Conclusions The trust-based metric HealthTrust showed promising results when used to retrieve diabetes content from YouTube. Our research indicates that social network analysis may be used to identify trustworthy social media in health communities.

[1]  E. Bernstam,et al.  Accuracy and self correction of information received from an internet breast cancer list: content analysis , 2006, BMJ : British Medical Journal.

[2]  D. Lorence,et al.  Setting the Public Agenda for Online Health Search: A White Paper and Action Agenda , 2004, Journal of medical Internet research.

[3]  Xiaolong Zhang,et al.  SNDocRank: document ranking based on social networks , 2010, WWW '10.

[4]  Peter R. Harris,et al.  Perceived Threat and Corroboration: Key Factors That Improve a Predictive Model of Trust in Internet-based Health Information and Advice , 2011, Journal of Medical Internet Research.

[5]  Vangelis Karkaletsis,et al.  Applying Semantic Web technologies to improve the retrieval, credibility and use of health-related web resources , 2011, Health Informatics J..

[6]  G Eysenbach,et al.  Experience in the use of social media in medical and health education. Contribution of the IMIA Social Media Working Group. , 2011, Yearbook of medical informatics.

[7]  Timo Honkela,et al.  MedIEQ-Quality labelling of medical web content using multilingual information extraction. , 2006, Studies in health technology and informatics.

[8]  P. Elkin,et al.  The Role of Social Media for Patients and Consumer Health , 2011, Yearbook of Medical Informatics.

[9]  Charles L. A. Clarke,et al.  Information Retrieval - Implementing and Evaluating Search Engines , 2010 .

[10]  A. Kaplan,et al.  Users of the world, unite! The challenges and opportunities of Social Media , 2010 .

[11]  J. Powell,et al.  Empirical studies assessing the quality of health information for consumers on the world wide web: a systematic review. , 2002, JAMA.

[12]  Stéfan Jacques Darmoni,et al.  MedCIRCLE: Collaboration for Internet Rating, Certification, Labelling and Evaluation of Health Information on the World-Wide-Web , 2003, MIE.

[13]  Yo-Sub Han,et al.  Evaluation of User Reputation on YouTube , 2009, HCI.

[14]  Hector Garcia-Molina,et al.  Combating Web Spam with TrustRank , 2004, VLDB.

[15]  Kurt Hornik,et al.  The Comprehensive R Archive Network , 2012 .

[16]  Shlomo Moran,et al.  The stochastic approach for link-structure analysis (SALSA) and the TKC effect , 2000, Comput. Networks.

[17]  Mark H. Chignell,et al.  Automatic detection of cohesive subgroups within social hypertext: A heuristic approach , 2008, New Rev. Hypermedia Multim..

[18]  Luis Fernández-Luque,et al.  An Analysis of Personal Medical Information Disclosed in YouTube Videos Created by Patients with Multiple Sclerosis , 2009, MIE.

[19]  J. A. Stockman Human Papillomavirus Vaccination Coverage on YouTube , 2010 .

[20]  Jaana Kekäläinen,et al.  Cumulated gain-based evaluation of IR techniques , 2002, TOIS.

[21]  Yu He,et al.  The YouTube video recommendation system , 2010, RecSys '10.

[22]  K. Boydell,et al.  Ana and the Internet: a review of pro-anorexia websites. , 2006, The International journal of eating disorders.

[23]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[24]  J. Rodgers,et al.  Thirteen ways to look at the correlation coefficient , 1988 .

[25]  A R Jadad,et al.  Rating health information on the Internet: navigating to knowledge or to Babel? , 1998, JAMA.

[26]  Genevieve B. Melton,et al.  HealthTrust: trust-based retrieval of you tube's diabetes channels , 2011, CIKM '11.

[27]  C. Chronaki,et al.  eHealth Trends in Europe 2005-2007: A Population-Based Survey , 2008, Journal of medical Internet research.

[28]  Jacob Cohen A Coefficient of Agreement for Nominal Scales , 1960 .

[29]  Krishna P. Gummadi,et al.  Measurement and analysis of online social networks , 2007, IMC '07.

[30]  Funda Meric-Bernstam,et al.  Efficacy of Quality Criteria to Identify Potentially Harmful Information: A Cross-sectional Survey of Complementary and Alternative Medicine Web Sites , 2004, Journal of medical Internet research.

[31]  Funda Meric-Bernstam,et al.  Commonly cited website quality criteria are not effective at identifying inaccurate online information about breast cancer , 2008, Cancer.

[32]  Alejandro R Jadad,et al.  Examination of instruments used to rate quality of health information on the internet: chronicle of a voyage with an unclear destination , 2002, BMJ : British Medical Journal.