Stroke Outcome Measurements From Electronic Medical Records: Cross-sectional Study on the Effectiveness of Neural and Nonneural Classifiers

Background With the rapid adoption of electronic medical records (EMRs), there is an ever-increasing opportunity to collect data and extract knowledge from EMRs to support patient-centered stroke management. Objective This study aims to compare the effectiveness of state-of-the-art automatic text classification methods in classifying data to support the prediction of clinical patient outcomes and the extraction of patient characteristics from EMRs. Methods Our study addressed the computational problems of information extraction and automatic text classification. We identified essential tasks to be considered in an ischemic stroke value-based program. The 30 selected tasks were classified (manually labeled by specialists) according to the following value agenda: tier 1 (achieved health care status), tier 2 (recovery process), care related (clinical management and risk scores), and baseline characteristics. The analyzed data set was retrospectively extracted from the EMRs of patients with stroke from a private Brazilian hospital between 2018 and 2019. A total of 44,206 sentences from free-text medical records in Portuguese were used to train and develop 10 supervised computational machine learning methods, including state-of-the-art neural and nonneural methods, along with ontological rules. As an experimental protocol, we used a 5-fold cross-validation procedure repeated 6 times, along with subject-wise sampling. A heatmap was used to display comparative result analyses according to the best algorithmic effectiveness (F1 score), supported by statistical significance tests. A feature importance analysis was conducted to provide insights into the results. Results The top-performing models were support vector machines trained with lexical and semantic textual features, showing the importance of dealing with noise in EMR textual representations. The support vector machine models produced statistically superior results in 71% (17/24) of tasks, with an F1 score >80% regarding care-related tasks (patient treatment location, fall risk, thrombolytic therapy, and pressure ulcer risk), the process of recovery (ability to feed orally or ambulate and communicate), health care status achieved (mortality), and baseline characteristics (diabetes, obesity, dyslipidemia, and smoking status). Neural methods were largely outperformed by more traditional nonneural methods, given the characteristics of the data set. Ontological rules were also effective in tasks such as baseline characteristics (alcoholism, atrial fibrillation, and coronary artery disease) and the Rankin scale. The complementarity in effectiveness among models suggests that a combination of models could enhance the results and cover more tasks in the future. Conclusions Advances in information technology capacity are essential for scalability and agility in measuring health status outcomes. This study allowed us to measure effectiveness and identify opportunities for automating the classification of outcomes of specific tasks related to clinical conditions of stroke victims, and thus ultimately assess the possibility of proactively using these machine learning techniques in real-world situations.

[1]  Jussara M. Almeida,et al.  On the cost-effectiveness of neural and non-neural approaches and representations for text classification: A comprehensive comparative study , 2021, Inf. Process. Manag..

[2]  Hong-Jie Dai,et al.  Family History Information Extraction With Neural Attention and an Enhanced Relation-Side Scheme: Algorithm Development and Validation , 2020, JMIR medical informatics.

[3]  Haoqi Sun,et al.  Classification of the Disposition of Patients Hospitalized with COVID-19: Reading Discharge Summaries Using Natural Language Processing , 2020, JMIR medical informatics.

[4]  Soo-Yong Shin,et al.  Federated Learning on Clinical Benchmark Data: Performance Assessment , 2020, Journal of medical Internet research.

[5]  J. Schneider,et al.  Natural Language Processing of Clinical Notes to Identify Mental Illness and Substance Use Among People Living with HIV: Retrospective Cohort Study , 2020, JMIR medical informatics.

[6]  F. Zúñiga,et al.  Electronic Health Record Use in Swiss Nursing Homes and Its Association With Implicit Rationing of Nursing Care Documentation: Multicenter Cross-sectional Survey Study , 2020, JMIR medical informatics.

[7]  Rohit J. Kate Clinical Term Normalization Using Learned Edit Patterns and Subconcept Matching: System Development and Evaluation , 2020, JMIR medical informatics.

[8]  Zhuo Wang,et al.  Model-Based Reasoning of Clinical Diagnosis in Integrative Medicine: Real-World Methodological Study of Electronic Medical Records and Natural Language Processing Methods , 2020, JMIR medical informatics.

[9]  Danilo Silva,et al.  Predicting Multiple ICD-10 Codes from Brazilian-Portuguese Clinical Notes , 2020, BRACIS.

[10]  P. Bonnabry,et al.  Improving Primary Care Medication Processes by Using Shared Electronic Medication Plans in Switzerland: Lessons Learned From a Participatory Action Research Study , 2020, JMIR formative research.

[11]  Marcos André Gonçalves,et al.  Extended pre-processing pipeline for text classification: On the role of meta-feature representations, sparsification and selective sampling , 2020, Inf. Process. Manag..

[12]  Jun Cai,et al.  An Application of Machine Learning to Etiological Diagnosis of Secondary Hypertension: Retrospective Study Using Electronic Medical Records , 2020, JMIR medical informatics.

[13]  Eric Horvitz,et al.  Predicting severe clinical events by learning about life-saving actions and outcomes using distant supervision , 2020, J. Biomed. Informatics.

[14]  C. Wolfe,et al.  A systematic review of machine learning models for predicting outcomes of stroke with structured data , 2020, PloS one.

[15]  Emily Kogan,et al.  Assessing stroke severity using electronic health record data: a machine learning approach , 2020, BMC Medical Informatics and Decision Making.

[16]  H. Toyoshiba,et al.  Predicting Inpatient Falls Using Natural Language Processing of Nursing Records Obtained From Japanese Electronic Medical Records: Case-Control Study , 2019, JMIR medical informatics.

[17]  Thierson Couto,et al.  Similarity-Based Synthetic Document Representations for Meta-Feature Generation in Text Classification , 2019, SIGIR.

[18]  Fares Alahdab,et al.  Global, regional, and national burden of stroke, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016 , 2019, The Lancet Neurology.

[19]  Introduction to Information Retrieval , 2019, J. Assoc. Inf. Sci. Technol..

[20]  Donald E. Brown,et al.  Text Classification Algorithms: A Survey , 2019, Inf..

[21]  Dongsong Zhang,et al.  A Stroke Risk Detection: Improving Hybrid Feature Selection Method , 2019, Journal of medical Internet research.

[22]  Nigam H. Shah,et al.  Using natural language processing to construct a metastatic breast cancer cohort from linked cancer registry and electronic medical records data , 2019, JAMIA open.

[23]  Marcos André Gonçalves,et al.  A Thorough Evaluation of Distance-Based Meta-Features for Automated Text Classification , 2018, IEEE Transactions on Knowledge and Data Engineering.

[24]  Mohammad Mehdi Sepehri,et al.  Toward analyzing and synthesizing previous research in early prediction of cardiac arrest using machine learning based on a multi-layered integrative framework , 2018, J. Biomed. Informatics.

[25]  Sunil A. Sheth,et al.  Machine Learning in Acute Ischemic Stroke Neuroimaging , 2018, Front. Neurol..

[26]  Maria Liakata,et al.  Using clinical Natural Language Processing for health outcomes research: Overview and actionable suggestions for future advances , 2018, J. Biomed. Informatics.

[27]  Lisa M. Nordan,et al.  Lessons from Mayo Clinic's Redesign of Stroke Care , 2018 .

[28]  Eduardo G. Cortes,et al.  When, Where, Who, What or Why? A Hybrid Model to Question Answering Systems , 2018, PROPOR.

[29]  Marcos André Gonçalves,et al.  Improving random forests by neighborhood projection for effective text classification , 2018, Inf. Syst..

[30]  Marcos André Gonçalves,et al.  Exploiting efficient and effective lazy Semi-Bayesian strategies for text classification , 2018, Neurocomputing.

[31]  Iryna Gurevych,et al.  Integrating Knowledge-Supported Search into the INCEpTION Annotation Platform , 2018, EMNLP.

[32]  Iryna Gurevych,et al.  The INCEpTION Platform: Machine-Assisted and Knowledge-Oriented Interactive Annotation , 2018, COLING.

[33]  Takuma Shibahara,et al.  A machine learning model to predict the risk of 30-day readmissions in patients with heart failure: a retrospective analysis of electronic medical records data , 2018, BMC Medical Informatics and Decision Making.

[34]  I. Kohane,et al.  Big Data and Machine Learning in Health Care. , 2018, JAMA.

[35]  Vassilis P. Plagianakos,et al.  Convolutional Neural Networks for Toxic Comment Classification , 2018, SETN.

[36]  G. Gobbel,et al.  Automating Quality Measures for Heart Failure Using Natural Language Processing: A Descriptive Study in the Department of Veterans Affairs , 2018, JMIR medical informatics.

[37]  Marcus A. Badgeley,et al.  Deep learning guided stroke management: a review of clinical applications , 2017, Journal of NeuroInterventional Surgery.

[38]  Namkug Kim,et al.  Deep into the Brain: Artificial Intelligence in Stroke Imaging , 2017, Journal of stroke.

[39]  Sunghwan Sohn,et al.  Application of a Natural Language Processing Algorithm to Asthma Ascertainment. An Automated Chart Review , 2017, American journal of respiratory and critical care medicine.

[40]  David Sontag,et al.  Learning a Health Knowledge Graph from Electronic Medical Records , 2017, Scientific Reports.

[41]  Tong Zhang,et al.  Deep Pyramid Convolutional Neural Networks for Text Categorization , 2017, ACL.

[42]  Mohammad Ali Zare Chahooki,et al.  A Survey on semi-supervised feature selection methods , 2017, Pattern Recognit..

[43]  Herbert S. Chase,et al.  Early recognition of multiple sclerosis using natural language processing of the electronic health record , 2017, BMC Medical Informatics and Decision Making.

[44]  Fernando Nogueira,et al.  Imbalanced-learn: A Python Toolbox to Tackle the Curse of Imbalanced Datasets in Machine Learning , 2016, J. Mach. Learn. Res..

[45]  V. Feigin,et al.  Stroke is largely preventable across the globe: where to next? , 2016, The Lancet.

[46]  Ye Fei,et al.  Simultaneous Support Vector selection and parameter optimization using Support Vector Machines for sentiment classification , 2016, 2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS).

[47]  David W. Bates,et al.  Use of electronic healthcare records to identify complex patients with atrial fibrillation for targeted intervention , 2016, J. Am. Medical Informatics Assoc..

[48]  Nora El-Gohary,et al.  Ontology-Based Multilabel Text Classification of Construction Regulatory Documents , 2016, J. Comput. Civ. Eng..

[49]  N. Adler,et al.  Using Electronic Health Records for Population Health Research: A Review of Methods and Applications. , 2016, Annual review of public health.

[50]  M. Porter,et al.  Standardizing Patient Outcomes Measurement. , 2016, The New England journal of medicine.

[51]  Hongfang Liu,et al.  Identifying peripheral arterial disease cases using natural language processing of clinical notes , 2016, 2016 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI).

[52]  Walter P. Wodchis,et al.  A 3-year study of high-cost users of health care , 2016, Canadian Medical Association Journal.

[53]  I. Kohane,et al.  Methods to Develop an Electronic Medical Record Phenotype Algorithm to Compare the Risk of Coronary Artery Disease across 3 Chronic Disease Cohorts , 2015, PloS one.

[54]  Takaya Saito,et al.  The Precision-Recall Plot Is More Informative than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets , 2015, PloS one.

[55]  C. Kruse,et al.  Adoption factors associated with electronic health record among long-term care facilities: a systematic review , 2015, BMJ Open.

[56]  Thierson Couto,et al.  On Efficient Meta-Level Features for Effective Text Classification , 2014, CIKM.

[57]  Ken-Yu Lin,et al.  Using ontology-based text classification to assist Job Hazard Analysis , 2014, Adv. Eng. Informatics.

[58]  Krys J. Kochut,et al.  Ontology-Based Text Classification into Dynamically Defined Topics , 2014, 2014 IEEE International Conference on Semantic Computing.

[59]  Saad Rais,et al.  Predicting patients with high risk of becoming high-cost healthcare users in Ontario (Canada). , 2014, Healthcare policy = Politiques de sante.

[60]  M. Porter,et al.  The Strategy That Will Fix Health Care , 2013 .

[61]  Cynthia Brandt,et al.  Ontology-guided feature engineering for clinical text classification , 2012, J. Biomed. Informatics.

[62]  Fei Wang,et al.  Case-based reasoning in comparative effectiveness research , 2012, IBM J. Res. Dev..

[63]  Sean T. Green,et al.  Random forests for verbal autopsy analysis: multisite validation study using clinical diagnostic gold standards , 2011, Population health metrics.

[64]  Thomas H. Lee,et al.  Putting the value framework to work. , 2010, The New England journal of medicine.

[65]  D. Blumenthal,et al.  The "meaningful use" regulation for electronic health records. , 2010, The New England journal of medicine.

[66]  Jason Roy,et al.  Prediction Modeling Using EHR Data: Challenges, Strategies, and a Comparison of Machine Learning Approaches , 2010, Medical care.

[67]  Der-Chiang Li,et al.  A learning method for the class imbalance problem with medical data sets , 2010, Comput. Biol. Medicine.

[68]  Gavin C. Cawley,et al.  On Over-fitting in Model Selection and Subsequent Selection Bias in Performance Evaluation , 2010, J. Mach. Learn. Res..

[69]  V. Osmani,et al.  Treatment targeted at underlying disease versus palliative care in terminally ill patients: a systematic review , 2008, BMJ Open.

[70]  A. Viera,et al.  Understanding interobserver agreement: the kappa statistic. , 2005, Family medicine.

[71]  Asunción Gómez-Pérez,et al.  Ontological Engineering: With Examples from the Areas of Knowledge Management, e-Commerce and the Semantic Web , 2004, Advanced Information and Knowledge Processing.

[72]  Marcello Peixoto Bax,et al.  Uma visão geral sobre ontologias: pesquisa sobre definições, tipos, aplicações, métodos de avaliação e de construção , 2003 .

[73]  Yujing Shen,et al.  Selection incentives in a performance-based contracting system. , 2003, Health services research.

[74]  Hussein A. Abbass,et al.  A Comparative Study for Domain Ontology Guided Feature Extraction , 2003, ACSC.

[75]  U. Schulz,et al.  Improving the Assessment of Outcomes in Stroke: Use of a Structured Interview to Assign Grades on the Modified Rankin Scale , 2002, Stroke.

[76]  Malik Yousef,et al.  One-Class SVMs for Document Classification , 2002, J. Mach. Learn. Res..

[77]  Thomas A. Darden,et al.  Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method , 2001, Bioinform..

[78]  P D Lyden,et al.  A Modified National Institutes of Health Stroke Scale for Use in Stroke Clinical Trials: Preliminary Reliability and Validity , 2001, Stroke.

[79]  P. Kantor Foundations of Statistical Natural Language Processing , 2001, Information Retrieval.

[80]  George Karypis,et al.  Centroid-Based Document Classification: Analysis and Experimental Results , 2000, PKDD.

[81]  Gerard Salton,et al.  Term-Weighting Approaches in Automatic Text Retrieval , 1988, Inf. Process. Manag..

[82]  Marcos André Gonçalves,et al.  Combining Representations For Effective Citation Classification , 2020, WOSP.

[83]  Ming-Wei Chang,et al.  BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding , 2019, NAACL.

[84]  Hongfang Liu,et al.  Clinical information extraction applications: A literature review , 2018, J. Biomed. Informatics.

[85]  V. Caso,et al.  Improving stroke care across the world : the ANGELS Initiative , 2018 .

[86]  David A. Dorr,et al.  A Mixed Methods Task Analysis of the Implementation and Validation of EHR-Based Clinical Quality Measures , 2016, AMIA.

[87]  John Henderson,et al.  Incorporating longitudinal pediatric patient-centered outcome measurement into the clinical workflow using a commercial electronic health record: a step toward increasing value for the patient , 2016, J. Am. Medical Informatics Assoc..

[88]  Siti Mariyam Shamsuddin,et al.  Classification with class imbalance problem: A review , 2015, SOCO 2015.

[89]  Nitesh V. Chawla,et al.  Data Mining for Imbalanced Datasets: An Overview , 2005, The Data Mining and Knowledge Discovery Handbook.

[90]  L. Breiman Random Forests , 2001, Machine Learning.

[91]  J. H. Zar,et al.  Biostatistical Analysis (5th Edition) , 1984 .