Application of a Natural Language Processing Algorithm to Asthma Ascertainment. An Automated Chart Review

Rationale: Difficulty of asthma ascertainment and its associated methodologic heterogeneity have created significant barriers to asthma care and research. Objectives: We evaluated the validity of an existing natural language processing (NLP) algorithm for asthma criteria to enable an automated chart review using electronic medical records (EMRs). Methods: The study was designed as a retrospective birth cohort study using a random sample of 500 subjects from the 1997‐2007 Mayo Birth Cohort who were born at Mayo Clinic and enrolled in primary pediatric care at Mayo Clinic Rochester. Performance of NLP‐based asthma ascertainment using predetermined asthma criteria was assessed by determining both criterion validity (chart review of EMRs by abstractor as a gold standard) and construct validity (association with known risk factors for asthma, such as allergic rhinitis). Measurements and Main Results: After excluding three subjects whose respiratory symptoms could be attributed to other conditions (e.g., tracheomalacia), among the remaining eligible 497 subjects, 51% were male, 77% white persons, and the median age at last follow‐up date was 11.5 years. The asthma prevalence was 31% in the study cohort. Sensitivity, specificity, positive predictive value, and negative predictive value for NLP algorithm in predicting asthma status were 97%, 95%, 90%, and 98%, respectively. The risk factors for asthma (e.g., allergic rhinitis) that were identified either by NLP or the abstractor were the same. Conclusions: Asthma ascertainment through NLP should be considered in the era of EMRs because it can enable large‐scale clinical studies in a more time‐efficient manner and improve the recognition and care of childhood asthma in practice.

[1]  A. Silman,et al.  UvA-DARE (Digital Academic Repository) 2010 Rheumatoid arthritis classification criteria: an American College of Rheumatology/European League Against Rheumatism collaborative initiative Aletaha, , 2010 .

[2]  Manuel A. R. Ferreira,et al.  Identification of IL6R and chromosome 11q13.5 as risk loci for asthma , 2011, The Lancet.

[3]  Hongfang Liu,et al.  Automated Chart Review for Asthma Ascertainment: An Innovative Approach for Asthma Care and Research in the Era of Electronic Medical Record , 2016 .

[4]  A. Silman,et al.  Rheumatoid arthritis classifi cation criteria : an American College of Rheumatology / European League Against Rheumatism collaborative initiative , 2010 .

[5]  Mario Castro,et al.  Heterogeneity of severe asthma in childhood: confirmation by cluster analysis of children in the National Institutes of Health/National Heart, Lung, and Blood Institute Severe Asthma Research Program. , 2011, The Journal of allergy and clinical immunology.

[6]  C E Reed,et al.  Allergic rhinitis in Rochester, Minnesota residents with asthma: frequency and impact on health care charges. , 1999, The Journal of allergy and clinical immunology.

[7]  The High Concentration of U.S. Health Care Expenditures , 2006 .

[8]  Y. Juhn,et al.  Increased risk of pertussis in patients with asthma. , 2012, The Journal of allergy and clinical immunology.

[9]  C E Reed,et al.  Interobserver variability in medical record review: an epidemiological study of asthma. , 1992, Journal of clinical epidemiology.

[10]  E. Bleecker,et al.  Genome-wide association study of asthma identifies RAD50-IL13 and HLA-DR/DQ regions. , 2010, The Journal of allergy and clinical immunology.

[11]  Rob Koeling,et al.  What evidence is there for a delay in diagnostic coding of rheumatoid arthritis in UK general practice records? An observational study of free text , 2016 .

[12]  Mike Thomas,et al.  Cluster analysis and clinical asthma phenotypes. , 2008, American journal of respiratory and critical care medicine.

[13]  Foreman,et al.  The state of US health, 1990-2010: burden of diseases, injuries, and risk factors. , 2013, JAMA.

[14]  B. Yawn,et al.  Risk of herpes zoster in children with asthma , 2015, Allergy and asthma proceedings.

[15]  A. Weaver,et al.  Mode of delivery at birth and development of asthma: a population-based cohort study. , 2005, The Journal of allergy and clinical immunology.

[16]  W R Hersh,et al.  Automated application of clinical practice guidelines for asthma management. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[17]  M. Lethbridge-Çejku,et al.  Summary health statistics for u.s. Adults: national health interview survey, 2003. , 2005, Vital and health statistics. Series 10, Data from the National Health Survey.

[18]  B. Yawn,et al.  The impact of requiring patient authorization for use of data in medical records research. , 1998, The Journal of family practice.

[19]  Devore S. Culver,et al.  Web-based Real-Time Case Finding for the Population Health Management of Patients With Diabetes Mellitus: A Prospective Validation of the Natural Language Processing–Based Algorithm With Statewide Electronic Medical Records , 2016, JMIR medical informatics.

[20]  Naomi Sager,et al.  Research Paper: Natural Language Processing and the Representation of Clinical Data , 1994, J. Am. Medical Informatics Assoc..

[21]  C E Reed,et al.  A community-based study of the epidemiology of asthma. Incidence rates, 1964-1983. , 1992, The American review of respiratory disease.

[22]  C. Wi,et al.  Development and initial testing of Asthma Predictive Index for a retrospective study: an exploratory study , 2015, The Journal of asthma : official journal of the Association for the Care of Asthma.

[23]  D. Curran‐Everett,et al.  Identification of asthma phenotypes using cluster analysis in the Severe Asthma Research Program. , 2010, American journal of respiratory and critical care medicine.

[24]  D. Strachan,et al.  Worldwide time trends in the prevalence of symptoms of asthma, allergic rhinoconjunctivitis, and eczema in childhood: ISAAC Phases One and Three repeat multicountry cross-sectional surveys , 2006, The Lancet.

[25]  K. E. Ravikumar,et al.  Automated chart review for asthma cohort identification using natural language processing: an exploratory study. , 2013, Annals of allergy, asthma & immunology : official publication of the American College of Allergy, Asthma, & Immunology.

[26]  Jeannine S. Schiller,et al.  Summary health statistics for u.s. Adults: national health interview survey, 2011. , 2012, Vital and health statistics. Series 10, Data from the National Health Survey.

[27]  J. Winn,et al.  Multiple atopy phenotypes and their associations with asthma: similar findings from two birth cohorts , 2013, Allergy.

[28]  Jennifer St Sauver,et al.  The influence of neighborhood environment on the incidence of childhood asthma: a multilevel approach. , 2005, Social science & medicine.

[29]  R. Platt,et al.  Preemptive use of high-dose fluticasone for virus-induced wheezing in young children. , 2009, The New England journal of medicine.

[30]  Heather Eliassen,et al.  The Childhood Asthma Management Program (CAMP): design, rationale, and methods. Childhood Asthma Management Program Research Group. , 1999, Controlled clinical trials.

[31]  C E Reed,et al.  Long-term survival of a cohort of community residents with asthma. , 1994, The New England journal of medicine.

[32]  J. Wohlfahrt,et al.  Caesarean delivery and risk of atopy and allergic disesase: meta‐analyses , 2008, Clinical and experimental allergy : journal of the British Society for Allergy and Clinical Immunology.

[33]  Hongfang Liu,et al.  Research and applications: Patient-level temporal aggregation for text-based asthma status ascertainment , 2014, J. Am. Medical Informatics Assoc..

[34]  Gregory A Poland,et al.  Childhood asthma and measles vaccine response. , 2006, Annals of allergy, asthma & immunology : official publication of the American College of Allergy, Asthma, & Immunology.

[35]  Scott T. Weiss,et al.  Characterization of Patients who Suffer Asthma Exacerbations using Data Extracted from Electronic Medical Records , 2008, AMIA.

[36]  F. Gilliland,et al.  Mode of delivery is associated with asthma and allergy occurrences in children. , 2006, Annals of epidemiology.

[37]  Peter Wollan,et al.  Increased risk of serious pneumococcal disease in patients with asthma. , 2008, The Journal of allergy and clinical immunology.

[38]  P. Lambert,et al.  Oral prednisolone for preschool children with acute virus-induced wheezing. , 2009, The New England journal of medicine.

[39]  D. Meyers Genetics of asthma and allergy: what have we learned? , 2010, The Journal of allergy and clinical immunology.

[40]  Hongfang Liu,et al.  Rapid identification of familial hypercholesterolemia from electronic health records: The SEARCH study. , 2016, Journal of clinical lipidology.

[41]  Scott T. Weiss,et al.  Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system , 2006, BMC Medical Informatics Decis. Mak..

[42]  M. Cabana,et al.  Adherence to Asthma Guidelines in Children, Tweens, and Adults in Primary Care Settings: A Practice-Based Network Assessment. , 2016, Mayo Clinic proceedings.

[43]  R. Zarychanski,et al.  Probiotic supplementation during pregnancy or infancy for the prevention of asthma and wheeze: systematic review and meta-analysis , 2013, BMJ.

[44]  E. Ryu,et al.  What accounts for the association between late preterm births and risk of asthma? , 2017, Allergy and asthma proceedings.

[45]  I. Kohane,et al.  Electronic medical records for discovery research in rheumatoid arthritis , 2010, Arthritis care & research.

[46]  Rui Qin,et al.  The influence of neighborhood environment on the incidence of childhood asthma: a propensity score approach. , 2010, The Journal of allergy and clinical immunology.

[47]  C. Johnson,et al.  Validation of claims data algorithms to identify nonmelanoma skin cancer , 2012, The Journal of investigative dermatology.

[48]  Vital signs: asthma prevalence, disease characteristics, and self-management education: United States, 2001--2009. , 2011, MMWR. Morbidity and mortality weekly report.

[49]  C E Reed,et al.  Accuracy of the death certificate in a population-based study of asthmatic patients. , 1993, JAMA.

[50]  C E Reed,et al.  Incidence and outcomes of asthma in the elderly. A population-based study in Rochester, Minnesota. , 1997, Chest.

[51]  Y. Juhn,et al.  Assessment of the association between atopic conditions and tympanostomy tube placement in children , 2012, Allergy and asthma proceedings.

[52]  Christopher G Chute,et al.  An Information Extraction Framework for Cohort Identification Using Electronic Health Records , 2013, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[53]  C E Reed,et al.  Attained adult height after childhood asthma: effect of glucocorticoid therapy. , 1997, The Journal of allergy and clinical immunology.

[54]  Richard Coles,et al.  Summary health statistics for U.S. adults: national health interview survey, 2012. , 2014, Vital and health statistics. Series 10, Data from the National Health Survey.