Automated chart review utilizing natural language processing algorithm for asthma predictive index

BackgroundThus far, no algorithms have been developed to automatically extract patients who meet Asthma Predictive Index (API) criteria from the Electronic health records (EHR) yet. Our objective is to develop and validate a natural language processing (NLP) algorithm to identify patients that meet API criteria.MethodsThis is a cross-sectional study nested in a birth cohort study in Olmsted County, MN. Asthma status ascertained by manual chart review based on API criteria served as gold standard. NLP-API was developed on a training cohort (n = 87) and validated on a test cohort (n = 427). Criterion validity was measured by sensitivity, specificity, positive predictive value and negative predictive value of the NLP algorithm against manual chart review for asthma status. Construct validity was determined by associations of asthma status defined by NLP-API with known risk factors for asthma.ResultsAmong the eligible 427 subjects of the test cohort, 48% were males and 74% were White. Median age was 5.3 years (interquartile range 3.6–6.8). 35 (8%) had a history of asthma by NLP-API vs. 36 (8%) by abstractor with 31 by both approaches. NLP-API predicted asthma status with sensitivity 86%, specificity 98%, positive predictive value 88%, negative predictive value 98%. Asthma status by both NLP and manual chart review were significantly associated with the known asthma risk factors, such as history of allergic rhinitis, eczema, family history of asthma, and maternal history of smoking during pregnancy (p value < 0.05). Maternal smoking [odds ratio: 4.4, 95% confidence interval 1.8–10.7] was associated with asthma status determined by NLP-API and abstractor, and the effect sizes were similar between the reviews with 4.4 vs 4.2 respectively.ConclusionNLP-API was able to ascertain asthma status in children mining from EHR and has a potential to enhance asthma care and research through population management and large-scale studies when identifying children who meet API criteria.

[1]  L. Weed Medical records that guide and teach. , 1968, The New England journal of medicine.

[2]  Margaret King,et al.  Evaluation of natural language processing systems , 1991 .

[3]  C E Reed,et al.  A community-based study of the epidemiology of asthma. Incidence rates, 1964-1983. , 1992, The American review of respiratory disease.

[4]  W R Hersh,et al.  Automated application of clinical practice guidelines for asthma management. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[5]  Third Expert Panel on theDiagnosis,et al.  Expert Panel Report 3: Guidelines for the Diagnosis and Management of Asthma , 1997 .

[6]  B. Yawn,et al.  The impact of requiring patient authorization for use of data in medical records research. , 1998, The Journal of family practice.

[7]  C E Reed,et al.  Allergic rhinitis in Rochester, Minnesota residents with asthma: frequency and impact on health care charges. , 1999, The Journal of allergy and clinical immunology.

[8]  J. Castro‐Rodriguez,et al.  A clinical index to define risk of asthma in young children with recurrent wheezing. , 2000, American journal of respiratory and critical care medicine.

[9]  Michael Krauthammer,et al.  GENIES: a natural-language processing system for the extraction of molecular pathways from journal articles , 2001, ISMB.

[10]  D. Gaboda,et al.  Early childhood chronic illness: comparability of maternal reports and medical records. , 2001, Vital and health statistics. Series 2, Data evaluation and methods research.

[11]  U. Frey,et al.  Age-related differences in perceived asthma control in childhood: guidelines and reality , 2002, European Respiratory Journal.

[12]  J. Austin,et al.  Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports. , 2002, Radiology.

[13]  Stefano Guerra,et al.  Outcome of asthma and wheezing in the first 6 years of life: follow-up through adolescence. , 2005, American journal of respiratory and critical care medicine.

[14]  The High Concentration of U.S. Health Care Expenditures , 2006 .

[15]  D. Strachan,et al.  Worldwide time trends in the prevalence of symptoms of asthma, allergic rhinoconjunctivitis, and eczema in childhood: ISAAC Phases One and Three repeat multicountry cross-sectional surveys , 2006, The Lancet.

[16]  Scott T. Weiss,et al.  Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system , 2006, BMC Medical Informatics Decis. Mak..

[17]  B. Yawn,et al.  The Potential Biases in Studying the Relationship between Asthma and Microbial Infection , 2007, The Journal of asthma : official journal of the Association for the Care of Asthma.

[18]  B. Yawn,et al.  Why Parents Seek Medical Evaluations for Their Children With Mild Acute Illnesses , 2008, Clinical pediatrics.

[19]  Preemptive Use of High-Dose Fluticasone for Virus-Induced Wheezing in Young Children , 2009, Pediatrics.

[20]  P. Lambert,et al.  Oral prednisolone for preschool children with acute virus-induced wheezing. , 2009, The New England journal of medicine.

[21]  Abel N. Kho,et al.  A Highly Specific Algorithm for Identifying Asthma Cases and Controls for Genome-Wide Association Studies , 2009, AMIA.

[22]  J. Stockman Oral Prednisolone for Preschool Children with Acute Virus-Induced Wheezing , 2010 .

[23]  D. Curran‐Everett,et al.  Identification of asthma phenotypes using cluster analysis in the Severe Asthma Research Program. , 2010, American journal of respiratory and critical care medicine.

[24]  D. Meyers Genetics of asthma and allergy: what have we learned? , 2010, The Journal of allergy and clinical immunology.

[25]  A. Jha,et al.  Meaningful use of electronic health records: the road ahead. , 2010, JAMA.

[26]  Mario Castro,et al.  Heterogeneity of severe asthma in childhood: confirmation by cluster analysis of children in the National Institutes of Health/National Heart, Lung, and Blood Institute Severe Asthma Research Program. , 2011, The Journal of allergy and clinical immunology.

[27]  Steven H. Brown,et al.  Automated identification of postoperative complications within an electronic medical record using natural language processing. , 2011, JAMA.

[28]  Vital signs: asthma prevalence, disease characteristics, and self-management education: United States, 2001--2009. , 2011, MMWR. Morbidity and mortality weekly report.

[29]  B. Yawn,et al.  History of the Rochester Epidemiology Project: half a century of medical records linkage in a US population. , 2012, Mayo Clinic proceedings.

[30]  K. E. Ravikumar,et al.  Automated chart review for asthma cohort identification using natural language processing: an exploratory study. , 2013, Annals of allergy, asthma & immunology : official publication of the American College of Allergy, Asthma, & Immunology.

[31]  J. Henry,et al.  Adoption of Electronic Health Record Systems among U . S . Non-Federal Acute Care Hospitals : 2008-2015 , 2013 .

[32]  W. Phipatanakul,et al.  Utility of the Asthma Predictive Index in predicting childhood asthma and identifying disease-modifying interventions. , 2014, Annals of allergy, asthma & immunology : official publication of the American College of Allergy, Asthma, & Immunology.

[33]  T. To,et al.  Trends in the age of diagnosis of childhood asthma. , 2014, The Journal of allergy and clinical immunology.

[34]  Y. Juhn Risks for infection in patients with asthma (or other atopic conditions): is asthma more than a chronic airway disease? , 2014, The Journal of allergy and clinical immunology.

[35]  B. Yawn,et al.  Risk of herpes zoster in children with asthma , 2015, Allergy and asthma proceedings.

[36]  C. Wi,et al.  Development and initial testing of Asthma Predictive Index for a retrospective study: an exploratory study , 2015, The Journal of asthma : official journal of the Association for the Care of Asthma.

[37]  T. Mahr,et al.  Exhaled Biomarkers and Gene Expression at Preschool Age Improve Asthma Prediction at 6 Years of Age , 2015, Pediatrics.

[38]  Paul A. Harris,et al.  PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability , 2016, J. Am. Medical Informatics Assoc..

[39]  J. Celedón,et al.  Risk and Protective Factors for Childhood Asthma: What Is the Evidence? , 2016, The journal of allergy and clinical immunology. In practice.

[40]  D. Strachan,et al.  The epidemiology, healthcare and societal burden and costs of asthma in the UK and its member nations: analyses of standalone and linked national databases , 2016, BMC Medicine.

[41]  L. Murphy,et al.  State-level medical and absenteeism cost of asthma in the United States , 2017, The Journal of asthma : official journal of the Association for the Care of Asthma.

[42]  Sunghwan Sohn,et al.  Application of a Natural Language Processing Algorithm to Asthma Ascertainment. An Automated Chart Review , 2017, American journal of respiratory and critical care medicine.

[43]  E. Ryu,et al.  What accounts for the association between late preterm births and risk of asthma? , 2017, Allergy and asthma proceedings.

[44]  Hongfang Liu,et al.  Clinical documentation variations and NLP system portability: a case study in asthma birth cohorts across institutions , 2017, J. Am. Medical Informatics Assoc..

[45]  Mir H. Ali,et al.  Natural Language Processing for Asthma Ascertainment in Different Practice Settings. , 2018, The journal of allergy and clinical immunology. In practice.