Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review

OBJECTIVE Natural language processing (NLP) of symptoms from electronic health records (EHRs) could contribute to the advancement of symptom science. We aim to synthesize the literature on the use of NLP to process or analyze symptom information documented in EHR free-text narratives. MATERIALS AND METHODS Our search of 1964 records from PubMed and EMBASE was narrowed to 27 eligible articles. Data related to the purpose, free-text corpus, patients, symptoms, NLP methodology, evaluation metrics, and quality indicators were extracted for each study. RESULTS Symptom-related information was presented as a primary outcome in 14 studies. EHR narratives represented various inpatient and outpatient clinical specialties, with general, cardiology, and mental health occurring most frequently. Studies encompassed a wide variety of symptoms, including shortness of breath, pain, nausea, dizziness, disturbed sleep, constipation, and depressed mood. NLP approaches included previously developed NLP tools, classification methods, and manually curated rule-based processing. Only one-third (n = 9) of studies reported patient demographic characteristics. DISCUSSION NLP is used to extract information from EHR free-text narratives written by a variety of healthcare providers on an expansive range of symptoms across diverse clinical specialties. The current focus of this field is on the development of methods to extract symptom information and the use of symptom information for disease classification tasks rather than the examination of symptoms themselves. CONCLUSION Future NLP studies should concentrate on the investigation of symptoms and symptom documentation in EHR free-text narratives. Efforts should be undertaken to examine patient characteristics and make symptom-related NLP algorithms or pipelines and vocabularies openly available.

[1]  Adi V. Gundlapalli,et al.  General Symptom Extraction from VA Electronic Medical Notes , 2017, MedInfo.

[2]  Xiaoyan Wang,et al.  Active computerized pharmacovigilance using natural language processing, statistics, and electronic health records: a feasibility study. , 2009, Journal of the American Medical Informatics Association : JAMIA.

[3]  Goodarz Danaei,et al.  A Novel Model for Predicting Rehospitalization Risk Incorporating Physical Function, Cognitive Status, and Psychosocial Support Using Natural Language Processing , 2017, Medical care.

[4]  Herbert S. Chase,et al.  Early recognition of multiple sclerosis using natural language processing of the electronic health record , 2017, BMC Medical Informatics and Decision Making.

[5]  George Hripcsak,et al.  Automating a severity score guideline for community-acquired pneumonia employing medical language processing of discharge summaries , 1999, AMIA.

[6]  Jennifer A. Haythornthwaite,et al.  Longitudinal analysis of pain in patients with metastatic prostate cancer using natural language processing of medical record text , 2012, J. Am. Medical Informatics Assoc..

[7]  Brian A. Nosek,et al.  How open science helps researchers succeed , 2016, eLife.

[8]  Anna Rumshisky,et al.  Evaluating temporal relations in clinical text: 2012 i2b2 Challenge , 2013, J. Am. Medical Informatics Assoc..

[9]  Debbie A. Travers,et al.  Evaluation of preprocessing techniques for chief complaint classification , 2008, J. Biomed. Informatics.

[10]  R. Dobson,et al.  Natural language processing to extract symptoms of severe mental illness from clinical text: the Clinical Record Interactive Search Comprehensive Data Extraction (CRIS-CODE) project , 2017, BMJ Open.

[11]  Shuying Shen,et al.  2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text , 2011, J. Am. Medical Informatics Assoc..

[12]  Rocco Casagrande,et al.  Advancing Symptom Science Through Symptom Cluster Research: Expert Panel Proceedings and Recommendations , 2017, Journal of the National Cancer Institute.

[13]  Hua Xu,et al.  Identifying risk factors for heart disease over time: Overview of 2014 i2b2/UTHealth shared task Track 2 , 2015, J. Biomed. Informatics.

[14]  Suzanne Bakken,et al.  Exploring the Ability of Natural Language Processing to Extract Data From Nursing Narratives , 2009, Computers, informatics, nursing : CIN.

[15]  John Mullooly,et al.  Detecting possible vaccine adverse events in clinical notes of the electronic medical record. , 2009, Vaccine.

[16]  Loes M. M. Braun,et al.  Natural Language Processing in Radiology: A Systematic Review. , 2016, Radiology.

[17]  I. Solti,et al.  Leveraging Food and Drug Administration Adverse Event Reports for the Automated Monitoring of Electronic Health Records in a Pediatric Hospital , 2017, Biomedical informatics insights.

[18]  W. Alkema,et al.  Application of text mining in the biomedical domain. , 2015, Methods.

[19]  Roland Eils,et al.  circlize implements and enhances circular visualization in R , 2014, Bioinform..

[20]  Shamkant B. Navathe,et al.  Identifying Patients with Depression Using Free-text Clinical Documents , 2015, MedInfo.

[21]  L. Ohno-Machado,et al.  “Big Data” and the Electronic Health Record , 2014, Yearbook of Medical Informatics.

[22]  E. Mohammadi,et al.  Barriers and facilitators related to the implementation of a physiological track and trigger system: A systematic review of the qualitative evidence , 2017, International journal for quality in health care : journal of the International Society for Quality in Health Care.

[23]  K. Kwekkeboom,et al.  Cancer Symptom Cluster Management. , 2016, Seminars in oncology nursing.

[24]  Sunghwan Sohn,et al.  Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications , 2010, J. Am. Medical Informatics Assoc..

[25]  Serguei V. S. Pakhomov,et al.  Epidemiology of angina pectoris: role of natural language processing of the medical record. , 2007, American heart journal.

[26]  Jimeng Sun,et al.  Automatic identification of heart failure diagnostic criteria, using text analysis of clinical notes from electronic health records , 2014, Int. J. Medical Informatics.

[27]  Shuying Shen,et al.  “Sitting on Pins and Needles”: Characterization of Symptom Descriptions in Clinical Notes” , 2013, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[28]  Wen-wai Yim,et al.  Natural Language Processing in Oncology: A Review. , 2016, JAMA oncology.

[29]  Donia Scott,et al.  Extracting information from the text of electronic medical records to improve case detection: a systematic review , 2016, J. Am. Medical Informatics Assoc..

[30]  Mick Watson When will ‘open science’ become simply ‘science’? , 2015, Genome Biology.

[31]  Michael O Harhay,et al.  Natural Language Processing to Assess Documentation of Features of Critical Illness in Discharge Documents of Acute Respiratory Distress Syndrome Survivors. , 2016, Annals of the American Thoracic Society.

[32]  Peter L. Elkin,et al.  Detection of infectious symptoms from VA emergency department and primary care clinical documentation , 2012, Int. J. Medical Informatics.

[33]  Suzanne Tamang,et al.  Detecting unplanned care from clinician notes in electronic health records. , 2015, Journal of oncology practice.

[34]  John D Seeger,et al.  Tolerability and Effectiveness of Exenatide Once Weekly Relative to Basal Insulin Among Type 2 Diabetes Patients of Different Races in Routine Care , 2017, Diabetes Therapy.

[35]  Chelsea Canan,et al.  Automatable algorithms to identify nonmedical opioid use using electronic data: a systematic review , 2017, J. Am. Medical Informatics Assoc..

[36]  Michele Filannino,et al.  A natural language processing challenge for clinical records: Research Domains Criteria (RDoC) for psychiatry. , 2017, Journal of biomedical informatics.

[37]  Guilherme Del Fiol,et al.  Text summarization in the biomedical domain: A systematic review of recent research , 2014, J. Biomed. Informatics.

[38]  Haipeng Shen,et al.  Artificial intelligence in healthcare: past, present and future , 2017, Stroke and Vascular Neurology.

[39]  Xiaoyan Wang,et al.  Automated Knowledge Acquisition from Clinical Narrative Reports , 2008, AMIA.

[40]  Serguei V. S. Pakhomov,et al.  Agreement between patient-reported symptoms and their documentation in the medical record. , 2008, The American journal of managed care.

[41]  Barbara J. Grosz,et al.  Natural-Language Processing , 1982, Artificial Intelligence.

[42]  Guangrong Li,et al.  Clinical Documents Clustering Based on Medication/Symptom Names Using Multi-View Nonnegative Matrix Factorization , 2015, IEEE Transactions on NanoBioscience.

[43]  Peter L. Elkin,et al.  Comparison of Natural Language Processing Biosurveillance Methods for Identifying Influenza From Encounter Notes , 2012, Annals of Internal Medicine.

[44]  Abhishek Pandey,et al.  Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review , 2017, J. Biomed. Informatics.

[45]  Hongfang Liu,et al.  Journal of Biomedical Informatics , 2022 .

[46]  Elizabeth S. Chen,et al.  Mining the electronic health record for disease knowledge. , 2014, Methods in molecular biology.

[47]  Annette DeVito Dabbs,et al.  Envisioning the future in symptom science. , 2014, Nursing outlook.

[48]  Shuying Shen,et al.  Application of Natural Language Processing to VA Electronic Health Records to Identify Phenotypic Characteristics for Clinical and Research Purposes , 2008, Summit on translational bioinformatics.

[49]  Guy Divita,et al.  Detecting the presence of an indwelling urinary catheter and urinary symptoms in hospitalized patients using natural language processing. , 2017, Journal of biomedical informatics.

[50]  Zina M. Ibrahim,et al.  ADEPt, a semantically-enriched pipeline for extracting adverse drug events from free-text electronic health records , 2017, PloS one.

[51]  Shahram Ebadollahi,et al.  Prevalence of heart failure signs and symptoms in a large primary care population identified through the use of text and data mining of the electronic health record. , 2014, Journal of cardiac failure.

[52]  Nishita Mehta,et al.  Concurrence of big data analytics and healthcare: A systematic review , 2018, Int. J. Medical Informatics.

[53]  Rashmi Patel,et al.  Mood instability is a common feature of mental health disorders and is associated with poor clinical outcomes , 2015, BMJ Open.