Using Natural Language Processing Techniques to Provide Personalized Educational Materials for Chronic Disease Patients in China: Development and Assessment of a Knowledge-Based Health Recommender System

Background Health education emerged as an important intervention for improving the awareness and self-management abilities of chronic disease patients. The development of information technologies has changed the form of patient educational materials from traditional paper materials to electronic materials. To date, the amount of patient educational materials on the internet is tremendous, with variable quality, which makes it hard to identify the most valuable materials by individuals lacking medical backgrounds. Objective The aim of this study was to develop a health recommender system to provide appropriate educational materials for chronic disease patients in China and evaluate the effect of this system. Methods A knowledge-based recommender system was implemented using ontology and several natural language processing (NLP) techniques. The development process was divided into 3 stages. In stage 1, an ontology was constructed to describe patient characteristics contained in the data. In stage 2, an algorithm was designed and implemented to generate recommendations based on the ontology. Patient data and educational materials were mapped to the ontology and converted into vectors of the same length, and then recommendations were generated according to similarity between these vectors. In stage 3, the ontology and algorithm were incorporated into an mHealth system for practical use. Keyword extraction algorithms and pretrained word embeddings were used to preprocess educational materials. Three strategies were proposed to improve the performance of keyword extraction. System evaluation was based on a manually assembled test collection for 50 patients and 100 educational documents. Recommendation performance was assessed using the macro precision of top-ranked documents and the overall mean average precision (MAP). Results The constructed ontology contained 40 classes, 31 object properties, 67 data properties, and 32 individuals. A total of 80 SWRL rules were defined to implement the semantic logic of mapping patient original data to the ontology vector space. The recommender system was implemented as a separate Web service connected with patients' smartphones. According to the evaluation results, our system can achieve a macro precision up to 0.970 for the top 1 recommendation and an overall MAP score up to 0.628. Conclusions This study demonstrated that a knowledge-based health recommender system has the potential to accurately recommend educational materials to chronic disease patients. Traditional NLP techniques combined with improvement strategies for specific language and domain proved to be effective for improving system performance. One direction for future work is to explore the effect of such systems from the perspective of patients in a practical setting.

[1]  Min Chen,et al.  iDoctor: Personalized and professionalized medical recommendations based on hybrid matrix factorization , 2017, Future Gener. Comput. Syst..

[2]  Luis Terán,et al.  Health Recommender Systems: A State-of-the-Art Review , 2019, 2019 Sixth International Conference on eDemocracy & eGovernment (ICEDEG).

[3]  Andrea Giustina,et al.  The role of patient education in the prevention and management of type 2 diabetes: an overview , 2016, Endocrine.

[4]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[5]  Jun Zheng,et al.  A Hybrid Keyword Extraction Method Based on TF and Semantic Strategies for Chinese Document , 2014 .

[6]  Huilong Duan,et al.  Patients’ Acceptance of Smartphone Health Technology for Chronic Disease Management: A Theoretical Model and Empirical Test , 2017, JMIR mHealth and uHealth.

[7]  Huilong Duan,et al.  Using Goal-Directed Design to Create a Mobile Health App to Improve Patient Compliance With Hypertension Self-Management: Development and Deployment , 2020, JMIR mHealth and uHealth.

[8]  Deborah A. Greenwood,et al.  The eHealth Enhanced Chronic Care Model: A Theory Derivation Approach , 2015, Journal of medical Internet research.

[9]  Stacey L. Sheridan,et al.  Low Health Literacy and Health Outcomes: An Updated Systematic Review , 2011, Annals of Internal Medicine.

[10]  Una Stenberg,et al.  A scoping review of the literature on benefits and challenges of participating in patient education programs aimed at promoting self-management for people living with chronic illness. , 2016, Patient education and counseling.

[11]  Zhaohua Deng,et al.  Health information in the digital age: an empirical study of the perceived benefits and costs of seeking and using health information from online sources. , 2019, Health information and libraries journal.

[12]  WangWei,et al.  Recommender system application developments , 2015 .

[13]  R. Spitzer,et al.  The PHQ-9: validity of a brief depression severity measure. , 2001, Journal of general internal medicine.

[14]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[15]  Sijmen A. Reijneveld,et al.  The association between health literacy and self-management abilities in adults aged 75 and older, and its moderators , 2016, Quality of Life Research.

[16]  Khin Than Win,et al.  Benefits of Online Health Education: Perception from Consumers and Health Professionals , 2015, Journal of Medical Systems.

[17]  Liang Zhang,et al.  The effect of health literacy and self-management efficacy on the health-related quality of life of hypertensive patients in a western rural area of China: a cross-sectional study , 2017, International Journal for Equity in Health.

[18]  Hongfang Liu,et al.  Recommending Education Materials for Diabetic Questions Using Information Retrieval Approaches , 2017, AMIA.

[19]  Yiming Yang,et al.  XLNet: Generalized Autoregressive Pretraining for Language Understanding , 2019, NeurIPS.

[20]  Kuo Zhang,et al.  Keyword extraction based on tf/idf for Chinese news document , 2007, Wuhan University Journal of Natural Sciences.

[21]  Lior Rokach,et al.  Introduction to Recommender Systems Handbook , 2011, Recommender Systems Handbook.

[22]  Michael J. Pazzani,et al.  Content-Based Recommendation Systems , 2007, The Adaptive Web.

[23]  Jean-Baptiste Lamy,et al.  Owlready: Ontology-oriented programming in Python with automatic classification and high level constructs for biomedical ontologies , 2017, Artif. Intell. Medicine.

[24]  Gunasekaran Manogaran,et al.  Hybrid Recommendation System for Heart Disease Diagnosis based on Multiple Kernel Learning with Adaptive Neuro-Fuzzy Inference System , 2017, Multimedia Tools and Applications.

[25]  Petr Sojka,et al.  Software Framework for Topic Modelling with Large Corpora , 2010 .

[26]  Robin D. Burke,et al.  Hybrid Web Recommender Systems , 2007, The Adaptive Web.

[27]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[28]  Wei Wang,et al.  Recommender system application developments: A survey , 2015, Decis. Support Syst..

[29]  H. Lan,et al.  SWRL : A semantic Web rule language combining OWL and ruleML , 2004 .

[30]  L. Miller,et al.  Online Health Information Seeking , 2012, Journal of aging and health.

[31]  Jina Huh,et al.  Answers to Health Questions: Internet Search Results Versus Online Health Community Responses , 2016, Journal of medical Internet research.

[32]  Zhendong Niu,et al.  Knowledge-based recommendation: a review of ontology-based recommender systems for e-learning , 2017, Artificial Intelligence Review.

[33]  Karl Pearson F.R.S. LIII. On lines and planes of closest fit to systems of points in space , 1901 .

[34]  B. Ainsworth,et al.  International physical activity questionnaire: 12-country reliability and validity. , 2003, Medicine and science in sports and exercise.

[35]  H. Hotelling Analysis of a complex of statistical variables into principal components. , 1933 .

[36]  Harri Oinas-Kukkonen,et al.  Online Patient Education for Chronic Disease Management: Consumer Perspectives , 2016, Journal of Medical Systems.

[37]  Matthieu J. Guitton,et al.  Information quality and dynamics of patients' interactions on tonsillectomy web resources , 2016, Internet interventions.

[38]  Mario Cannataro,et al.  DIETOS: A dietary recommender system for chronic diseases monitoring and management , 2018, Comput. Methods Programs Biomed..

[39]  Sarah Dennis,et al.  A systematic review of chronic disease management interventions in primary care , 2018, BMC Family Practice.

[40]  Martin Wiesner,et al.  Health Recommender Systems: Concepts, Requirements, Technical Basics and Challenges , 2014, International journal of environmental research and public health.

[41]  Robert Janke,et al.  The efficacy of telehealth delivered educational approaches for patients with chronic diseases: A systematic review. , 2018, Patient education and counseling.

[42]  Zhaohua Deng,et al.  The health information seeking and usage behavior intention of Chinese consumers through mobile phones , 2015, Inf. Technol. People.

[43]  Hung-Ming Chen,et al.  Design and evaluation of a cloud-based Mobile Health Information Recommendation system on wireless sensor networks , 2016, Comput. Electr. Eng..

[44]  N. F. Noy,et al.  Ontology Development 101: A Guide to Creating Your First Ontology , 2001 .

[45]  Michael Gruninger,et al.  Methodology for the Design and Evaluation of Ontologies , 1995, IJCAI 1995.

[46]  Jürgen Buder,et al.  Learning with personalized recommender systems: A psychological view , 2012, Comput. Hum. Behav..

[47]  Gediminas Adomavicius,et al.  Toward the next generation of recommender systems: a survey of the state-of-the-art and possible extensions , 2005, IEEE Transactions on Knowledge and Data Engineering.

[48]  Meng Zhao,et al.  Chinese Document Keyword Extraction Algorithm Based on FP-growth , 2016, 2016 International Conference on Smart City and Systems Engineering (ICSCSE).

[49]  F Taketa,et al.  Structure of the Felidae hemoglobins and response to 2,3-diphosphoglycerate. , 1973, Comparative biochemistry and physiology. B, Comparative biochemistry.

[50]  Qing Zeng-Treitler,et al.  Use of topic modeling for recommending relevant education material to diabetic patients. , 2011, AMIA ... Annual Symposium proceedings. AMIA Symposium.

[51]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[52]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[53]  Alejandro Rivero Rodriguez,et al.  A health information recommender system: Enriching YouTube health videos with Medline Plus information by the use of SnomedCT terms , 2013, Proceedings of the 26th IEEE International Symposium on Computer-Based Medical Systems.

[54]  Anton Civit,et al.  HealthRecSys: A semantic content-based recommender system to complement health videos , 2017, BMC Medical Informatics and Decision Making.

[55]  K. Booth,et al.  Chronic disease patient education: lessons from meta-analyses. , 2001, Patient education and counseling.

[56]  Nor Azan Mat Zin,et al.  Ontological Approach in Knowledge Based Recommender System to Develop the Quality of E-learning System , 2012 .

[57]  Rafael Valencia-García,et al.  RecomMetz: A context-aware knowledge-based mobile recommender system for movie showtimes , 2015, Expert Syst. Appl..

[58]  J. Hibbard,et al.  Why Does Patient Activation Matter? An Examination of the Relationships Between Patient Activation and Health-Related Outcomes , 2012, Journal of General Internal Medicine.

[59]  Elisabeth Beaunoyer,et al.  Understanding online health information: Evaluation, tools, and strategies. , 2017, Patient education and counseling.