HealthRecSys: A semantic content-based recommender system to complement health videos

BackgroundThe Internet, and its popularity, continues to grow at an unprecedented pace. Watching videos online is very popular; it is estimated that 500 h of video are uploaded onto YouTube, a video-sharing service, every minute and that, by 2019, video formats will comprise more than 80% of Internet traffic. Health-related videos are very popular on YouTube, but their quality is always a matter of concern. One approach to enhancing the quality of online videos is to provide additional educational health content, such as websites, to support health consumers. This study investigates the feasibility of building a content-based recommender system that links health consumers to reputable health educational websites from MedlinePlus for a given health video from YouTube.MethodsThe dataset for this study includes a collection of health-related videos and their available metadata. Semantic technologies (such as SNOMED-CT and Bio-ontology) were used to recommend health websites from MedlinePlus. A total of 26 healths professionals participated in evaluating 253 recommended links for a total of 53 videos about general health, hypertension, or diabetes. The relevance of the recommended health websites from MedlinePlus to the videos was measured using information retrieval metrics such as the normalized discounted cumulative gain and precision at K.ResultsThe majority of websites recommended by our system for health videos were relevant, based on ratings by health professionals. The normalized discounted cumulative gain was between 46% and 90% for the different topics.ConclusionsOur study demonstrates the feasibility of using a semantic content-based recommender system to enrich YouTube health videos. Evaluation with end-users, in addition to healthcare professionals, will be required to identify the acceptance of these recommendations in a nonsimulated information-seeking context.

[1]  Barry Smyth,et al.  Case-based recommender systems , 2005, The Knowledge Engineering Review.

[2]  C L Sanchez-Bocanegra,et al.  Introduction on health recommender systems. , 2015, Methods in molecular biology.

[3]  Sharon Straus,et al.  Managing evidence-based knowledge: the need for reliable, relevant and readable resources , 2009, Canadian Medical Association Journal.

[4]  Alla Keselman,et al.  Term Identification Methods for Consumer Health Vocabulary Development , 2007, Journal of medical Internet research.

[5]  Andreas Hotho,et al.  Towards Semantic Web Mining , 2002, SEMWEB.

[6]  Chris Fox,et al.  The Handbook of Computational Linguistics and Natural Language Processing , 2010 .

[7]  Katrien Verbert,et al.  Recommender Systems for Health Informatics: State-of-the-Art and Future Perspectives , 2016, Machine Learning for Health Informatics.

[8]  Michael D. Ekstrand,et al.  First Do No Harm: Considering and Minimizing Harm in Recommender Systems Designed for Engendering Health , 2016 .

[9]  Joseph Sharit,et al.  Seeking and Resolving Complex Online Health Information , 2016 .

[10]  S. Brunak,et al.  Mining electronic health records: towards better research applications and clinical care , 2012, Nature Reviews Genetics.

[11]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[12]  J. Clore,et al.  Health Information Seeking, Receipt, and Use in Diabetes Self-Management , 2010, The Annals of Family Medicine.

[13]  Daniel Lewis,et al.  What is web 2.0? , 2006, CROS.

[14]  Albert Y. Zomaya,et al.  A Survey of Mobile Device Virtualization , 2016, ACM Comput. Surv..

[15]  Luis Fernandez-Luque,et al.  Identifying Measures Used for Assessing Quality of YouTube Videos with Patient Health Information: A Review of Current Literature , 2013, Interactive journal of medical research.

[16]  Tim Berners-Lee,et al.  Publishing on the semantic web , 2001, Nature.

[17]  Haihua Xu,et al.  NLP based congestive heart failure case finding: A prospective analysis on statewide electronic medical records , 2015, Int. J. Medical Informatics.

[18]  Amr Jamal,et al.  Association of Online Health Information–Seeking Behavior and Self-Care Activities Among Type 2 Diabetic Patients in Saudi Arabia , 2015, Journal of medical Internet research.

[19]  Martin Wiesner,et al.  Adapting recommender systems to the requirements of personal health record systems , 2010, IHI.

[20]  Amber M. Reinhart,et al.  Health information seeking: a review of measures and methods. , 2011, Patient education and counseling.

[21]  David Heckerman,et al.  Empirical Analysis of Predictive Algorithms for Collaborative Filtering , 1998, UAI.

[22]  GarridoAntonio,et al.  On the use of case-based planning for e-learning personalization , 2016 .

[23]  Özlem Uzuner,et al.  Practical applications for natural language processing in clinical research: The 2014 i2b2/UTHealth shared tasks , 2015, J. Biomed. Informatics.

[24]  Mingrui Wu,et al.  Gradient descent optimization of smoothed information retrieval metrics , 2010, Information Retrieval.

[25]  Manuel Noguera,et al.  Nutrition for Elder Care: a nutritional semantic recommender system for the elderly , 2016, Expert Syst. J. Knowl. Eng..

[26]  Koichi Takeda,et al.  Information retrieval on the web , 2000, CSUR.

[27]  Harry J. P. Timmermans,et al.  Motivate: Towards context-aware recommendation mobile system for healthy living , 2011, 2011 5th International Conference on Pervasive Computing Technologies for Healthcare (PervasiveHealth) and Workshops.

[28]  Randi Karlsen,et al.  Challenges and Opportunities of Using Recommender Systems for Personalized Health Education , 2009, MIE.

[29]  Yanyan Li,et al.  Designing a Learning Recommender System by Incorporating Resource Association Analysis and Social Interaction Computing , 2016 .

[30]  R. J. Cline,et al.  Consumer health information seeking on the Internet: the state of the art. , 2001, Health education research.

[31]  Gerhard Friedrich,et al.  Constraint-Based Recommender Systems , 2015, Recommender Systems Handbook.

[32]  Fernando Ortega,et al.  A non negative matrix factorization for collaborative filtering recommender systems based on a Bayesian probabilistic model , 2016, Knowl. Based Syst..

[33]  Bambang Parmanto,et al.  Web Content Accessibility of Consumer Health Information Web Sites for People with Disabilities: A Cross Sectional Evaluation , 2004, Journal of medical Internet research.

[34]  Alejandro Rivero Rodriguez,et al.  A health information recommender system: Enriching YouTube health videos with Medline Plus information by the use of SnomedCT terms , 2013, Proceedings of the 26th IEEE International Symposium on Computer-Based Medical Systems.

[35]  Hua Xu,et al.  Facilitating pharmacogenetic studies using electronic health records and natural-language processing: a case study of warfarin , 2011, J. Am. Medical Informatics Assoc..

[36]  Bernd Ludwig,et al.  Engendering Health with Recommender Systems , 2016, RecSys.

[37]  Guilherme Del Fiol,et al.  Integrating Personalized Health Information from MedlinePlus in a Patient Portal , 2014, MIE.

[38]  Lucila Ohno-Machado,et al.  Natural language processing: an introduction , 2011, J. Am. Medical Informatics Assoc..

[39]  Annie Y. S. Lau,et al.  Research Paper: Can Cognitive Biases during Consumer Health Information Searches Be Reduced to Improve Decision Making? , 2009, J. Am. Medical Informatics Assoc..

[40]  Sarah L Cutrona,et al.  Collective-Intelligence Recommender Systems: Advancing Computer Tailoring for Health Behavior Change Into the 21st Century , 2016, Journal of medical Internet research.

[41]  Jean-Pierre Deschamps,et al.  QueFaire: Context-Aware in-Person Social Activity Recommendation System for Active Aging , 2015, ICOST.

[42]  S. Ziebland,et al.  Health and Illness in a Connected World: How Might Sharing Experiences on the Internet Affect People's Health? , 2012, The Milbank quarterly.

[43]  Tie-Yan Liu,et al.  A Theoretical Analysis of Normalized Discounted Cumulative Gain (NDCG) Ranking Measures , 2013 .

[44]  J. Sim,et al.  The kappa statistic in reliability studies: use, interpretation, and sample size requirements. , 2005, Physical therapy.

[45]  Bo Zhao,et al.  Learning Global Term Weights for Content-based Recommender Systems , 2016, WWW.

[46]  Francesco Ricci,et al.  A survey of active learning in collaborative filtering recommender systems , 2016, Comput. Sci. Rev..

[47]  Alejandro Rivero Rodriguez,et al.  Diavideos: a Diabetes Health Video Portal , 2013, MedInfo.

[48]  David A Asch,et al.  Use of Social Media Across US Hospitals: Descriptive Analysis of Adoption and Utilization , 2014, Journal of medical Internet research.

[49]  Martin Wiesner,et al.  Health Recommender Systems: Concepts, Requirements, Technical Basics and Challenges , 2014, International journal of environmental research and public health.

[50]  Luis Fernandez-Luque,et al.  HealthTrust: A Social Network Approach for Retrieving Online Health Videos , 2012, Journal of medical Internet research.

[51]  BhuiyanMd Zakirul Alam,et al.  Understanding Graph-Based Trust Evaluation in Online Social Networks , 2016 .

[52]  Fan Min,et al.  Three-way recommender systems based on random forests , 2016, Knowl. Based Syst..

[53]  Quang-Thuy Ha,et al.  Sentiment Analysis and User Similarity for Social Recommender System: An Experimental Study , 2016 .

[54]  Luis M. de Campos,et al.  Combining content-based and collaborative recommendations: A hybrid approach based on Bayesian networks , 2010, Int. J. Approx. Reason..

[55]  Gerd Stumme,et al.  The Role of Cores in Recommender Benchmarking for Social Bookmarking Systems , 2016, ACM Trans. Intell. Syst. Technol..

[56]  Charu C. Aggarwal,et al.  Social and Trust-Centric Recommender Systems , 2016 .

[57]  Hors-Fraile Santiago,et al.  Coupling Neuroscience Smoking Cessation Interventions with Social Media and Mobile Devices , 2016 .

[58]  Xia Zhao,et al.  Ontology Driven Personal Health Knowledge Discovery , 2015, KMO.

[59]  Ivan Serina,et al.  On the use of case-based planning for e-learning personalization , 2016, Expert Syst. Appl..

[60]  P. Dhavachelvan,et al.  Precision at K in Multilingual Information Retrieval , 2011 .

[61]  Andy Kill The direct-to-consumer genetics debate. , 2016, The Lancet. Oncology.