Use of a Machine Learning Program to Correctly Triage Incoming Text Messaging Replies From a Cardiovascular Text–Based Secondary Prevention Program: Feasibility Study

Background SMS text messaging programs are increasingly being used for secondary prevention, and have been shown to be effective in a number of health conditions including cardiovascular disease. SMS text messaging programs have the potential to increase the reach of an intervention, at a reduced cost, to larger numbers of people who may not access traditional programs. However, patients regularly reply to the SMS text messages, leading to additional staffing requirements to monitor and moderate the patients’ SMS text messaging replies. This additional staff requirement directly impacts the cost-effectiveness and scalability of SMS text messaging interventions. Objective This study aimed to test the feasibility and accuracy of developing a machine learning (ML) program to triage SMS text messaging replies (ie, identify which SMS text messaging replies require a health professional review). Methods SMS text messaging replies received from 2 clinical trials were manually coded (1) into “Is staff review required?” (binary response of yes/no); and then (2) into 12 general categories. Five ML models (Naïve Bayes, OneVsRest, Random Forest Decision Trees, Gradient Boosted Trees, and Multilayer Perceptron) and an ensemble model were tested. For each model run, data were randomly allocated into training set (2183/3118, 70.01%) and test set (935/3118, 29.98%). Accuracy for the yes/no classification was calculated using area under the receiver operating characteristics curve (AUC), false positives, and false negatives. Accuracy for classification into 12 categories was compared using multiclass classification evaluators. Results A manual review of 3118 SMS text messaging replies showed that 22.00% (686/3118) required staff review. For determining need for staff review, the Multilayer Perceptron model had highest accuracy (AUC 0.86; 4.85% false negatives; and 4.63% false positives); with addition of heuristics (specified keywords) fewer false negatives were identified (3.19%), with small increase in false positives (7.66%) and AUC 0.79. Application of this model would result in 26.7% of SMS text messaging replies requiring review (true + false positives). The ensemble model produced the lowest false negatives (1.43%) at the expense of higher false positives (16.19%). OneVsRest was the most accurate (72.3%) for the 12-category classification. Conclusions The ML program has high sensitivity for identifying the SMS text messaging replies requiring staff input; however, future research is required to validate the models against larger data sets. Incorporation of an ML program to review SMS text messaging replies could significantly reduce staff workload, as staff would not have to review all incoming SMS text messages. This could lead to substantial improvements in cost-effectiveness, scalability, and capacity of SMS text messaging–based interventions.

[1]  G. Hillis,et al.  TEXT messages to improve MEDication adherence and Secondary prevention (TEXTMEDS) after acute coronary syndrome: a randomised clinical trial protocol , 2018, BMJ Open.

[2]  Ankit Kumar Jain,et al.  Towards Filtering of SMS Spam Messages Using Machine Learning Based Technique , 2017 .

[3]  M. Woodward,et al.  Mobile Telephone Text Messaging for Medication Adherence in Chronic Disease: A Meta-analysis. , 2016, JAMA internal medicine.

[4]  R. Whittaker,et al.  Diabetes Text-Message Self-Management Support Program (SMS4BG): A Pilot Study , 2015, JMIR mHealth and uHealth.

[5]  S. Badawy,et al.  Economic Evaluation of Text-Messaging and Smartphone-Based Interventions to Improve Medication Adherence in Adolescents with Chronic Health Conditions: A Systematic Review , 2016, JMIR mHealth and uHealth.

[6]  Yingjie Lu,et al.  Automatic topic identification of health-related messages in online health community using text classification , 2013, SpringerPlus.

[7]  C. Chow,et al.  Patterns, predictors and effects of texting intervention on physical activity in CHD – insights from the TEXT ME randomized clinical trial , 2016, European journal of preventive cardiology.

[8]  Robyn Whittaker,et al.  Development of a Culturally Tailored Text Message Maternal Health Program: TextMATCH , 2017, JMIR mHealth and uHealth.

[9]  G. Hillis,et al.  Effect of Lifestyle-Focused Text Messaging on Risk Factor Modification in Patients With Coronary Heart Disease: A Randomized Clinical Trial. , 2015, JAMA.

[10]  Andre Matthias Müller,et al.  Text Messaging for Exercise Promotion in Older Adults From an Upper-Middle-Income Country: Randomized Controlled Trial , 2016, Journal of medical Internet research.

[11]  S. Ebrahim,et al.  Provision, uptake and cost of cardiac rehabilitation programmes: improving services to under-represented groups. , 2004, Health technology assessment.

[12]  Chad J Zack,et al.  Leveraging Machine Learning Techniques to Forecast Patient Prognosis After Percutaneous Coronary Intervention. , 2019, JACC. Cardiovascular interventions.

[13]  C. Chow,et al.  Factors Influencing Engagement, Perceived Usefulness and Behavioral Mechanisms Associated with a Text Message Support Program , 2016, PloS one.

[14]  Lisa Hartling,et al.  Meta-Analysis: Secondary Prevention Programs for Patients with Coronary Artery Disease , 2005, Annals of Internal Medicine.

[15]  Waddah Waheeb,et al.  Content-based SMS Classification: Statistical Analysis for the Relationship between Features Size and Classification Performance , 2017, Computación y Sistemas.

[16]  Sharon-Lise T Normand,et al.  Use of Cardiac Rehabilitation by Medicare Beneficiaries After Myocardial Infarction or Coronary Bypass Surgery , 2007, Circulation.

[17]  Juan P Casas,et al.  Mobile phone text messaging to improve medication adherence in secondary prevention of cardiovascular disease. , 2017, The Cochrane database of systematic reviews.

[18]  J. Bernhardt,et al.  Behavioral Functionality of Mobile Apps in Health Interventions: A Systematic Review of the Literature , 2015, JMIR mHealth and uHealth.

[19]  J. McMenamin,et al.  Efficacy of a Mobile Texting App (HepTalk) in Encouraging Patient Participation in Viral Hepatitis B Care: Development and Cohort Study , 2020, JMIR mHealth and uHealth.

[20]  Jun Liu,et al.  Classifying facts and opinions in Twitter messages: a deep learning-based approach , 2018 .

[21]  Yu-Chuan Li,et al.  Effects of and satisfaction with short message service reminders for patient medication adherence: a randomized controlled study , 2013, BMC Medical Informatics and Decision Making.

[22]  Yuanfang Guan,et al.  Clinical applications of machine learning in cardiovascular disease and its relevance to cardiac imaging. , 2018, European heart journal.

[23]  Danielle S McNamara,et al.  Using natural language processing and machine learning to classify health literacy from secure messages: The ECLIPPSE study , 2019, PloS one.

[24]  K. Patrick,et al.  A Text Message–Based Intervention for Weight Loss: Randomized Controlled Trial , 2009, Journal of medical Internet research.

[25]  Joseph Geraci,et al.  Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression , 2017, Evidence Based Journals.

[26]  Wei Li,et al.  Machine learning algorithms estimating prognosis and guiding therapy in adult congenital heart disease: data from a single tertiary centre including 10 019 patients , 2019, European heart journal.

[27]  I. Gram,et al.  Comparing the Efficacy of an Identical, Tailored Smoking Cessation Intervention Delivered by Mobile Text Messaging Versus Email: Randomized Controlled Trial , 2018, JMIR mHealth and uHealth.

[28]  Mary A Whooley,et al.  Mobile Phone Interventions for the Secondary Prevention of Cardiovascular Disease. , 2016, Progress in cardiovascular diseases.

[29]  Xiaohang Wu,et al.  Prediction of myopia development among Chinese school-aged children using refraction data from electronic medical records: A retrospective, multicentre machine learning study , 2018, PLoS medicine.

[30]  Amir H. Payberah,et al.  Predicting the risk of emergency admission with machine learning: Development and validation using linked electronic health records , 2018, PLoS medicine.

[31]  S. L. Lissel,et al.  Secondary prevention programmes for coronary heart disease: a meta-regression showing the merits of shorter, generalist, primary care-based interventions , 2007, European journal of cardiovascular prevention and rehabilitation : official journal of the European Society of Cardiology, Working Groups on Epidemiology & Prevention and Cardiac Rehabilitation and Exercise Physiology.

[32]  Brandon K. Fornwalt,et al.  Predicting Survival From Large Echocardiography and Electronic Health Record Datasets: Optimization With Machine Learning. , 2019, JACC. Cardiovascular imaging.

[33]  Chiadi E. Ndumele,et al.  mActive: A Randomized Clinical Trial of an Automated mHealth Intervention for Physical Activity Promotion , 2015, Journal of the American Heart Association.

[34]  Harry Hemingway,et al.  Machine learning models in electronic health records can outperform conventional survival models for predicting patient mortality in coronary artery disease , 2018, bioRxiv.