Coronary Artery Disease Phenotype Detection in an Academic Hospital System Setting

BACKGROUND  The United States, and especially West Virginia, have a tremendous burden of coronary artery disease (CAD). Undiagnosed familial hypercholesterolemia (FH) is an important factor for CAD in the U.S. Identification of a CAD phenotype is an initial step to find families with FH. OBJECTIVE  We hypothesized that a CAD phenotype detection algorithm that uses discrete data elements from electronic health records (EHRs) can be validated from EHR information housed in a data repository. METHODS  We developed an algorithm to detect a CAD phenotype which searched through discrete data elements, such as diagnosis, problem lists, medical history, billing, and procedure (International Classification of Diseases [ICD]-9/10 and Current Procedural Terminology [CPT]) codes. The algorithm was applied to two cohorts of 500 patients, each with varying characteristics. The second (younger) cohort consisted of parents from a school child screening program. We then determined which patients had CAD by systematic, blinded review of EHRs. Following this, we revised the algorithm by refining the acceptable diagnoses and procedures. We ran the second algorithm on the same cohorts and determined the accuracy of the modification. RESULTS  CAD phenotype Algorithm I was 89.6% accurate, 94.6% sensitive, and 85.6% specific for group 1. After revising the algorithm (denoted CAD Algorithm II) and applying it to the same groups 1 and 2, sensitivity 98.2%, specificity 87.8%, and accuracy 92.4; accuracy 93% for group 2. Group 1 F1 score was 92.4%. Specific ICD-10 and CPT codes such as "coronary angiography through a vein graft" were more useful than generic terms. CONCLUSION  We have created an algorithm, CAD Algorithm II, that detects CAD on a large scale with high accuracy and sensitivity (recall). It has proven useful among varied patient populations. Use of this algorithm can extend to monitor a registry of patients in an EHR and/or to identify a group such as those with likely FH.

[1]  C. Lilly,et al.  Trends in serum lipids among 5th grade CARDIAC participants, 2002–2012 , 2013, Journal of Epidemiology & Community Health.

[2]  D. Wald,et al.  Child-parent screening for familial hypercholesterolemia. , 2011, The Journal of pediatrics.

[3]  James J. Cimino,et al.  Clinical Informatics Researcher's Desiderata for the Data Content of the Next Generation Electronic Health Record , 2017, Applied Clinical Informatics.

[4]  M. Pratt,et al.  Documented need for more effective diagnosis and treatment of familial hypercholesterolemia according to data from 502 heterozygotes in Utah. , 1993, The American journal of cardiology.

[5]  Rahul Kashyap,et al.  Derivation and validation of a computable phenotype for acute decompensated heart failure in hospitalized patients , 2020, BMC Medical Informatics and Decision Making.

[6]  Dean F Sittig,et al.  Bringing science to medicine: an interview with Larry Weed, inventor of the problem-oriented medical record , 2014, J. Am. Medical Informatics Assoc..

[7]  J. Kastelein,et al.  20-Year Follow-up of Statins in Children with Familial Hypercholesterolemia. , 2019, The New England journal of medicine.

[8]  C. Lilly,et al.  The Coronary Artery Risk Detection in Appalachian Communities (CARDIAC) Project: An 18 Year Review. , 2018, Current pediatric reviews.

[9]  Jonathan P. Bickel,et al.  Developing an Algorithm to Detect Early Childhood Obesity in Two Tertiary Pediatric Medical Centers , 2016, Applied Clinical Informatics.

[10]  M. Pletcher,et al.  Young Adult Exposure to Cardiovascular Risk Factors and Risk of Events Later in Life: The Framingham Offspring Study , 2016, PloS one.

[11]  W. Neal,et al.  Screening for Hypercholesterolemia in Children: What Strategies Can Be Employed , 2017, Current Cardiovascular Risk Reports.

[12]  Mohammad Khalilia,et al.  Quantifying care coordination using natural language processing and domain-specific ontology , 2015, J. Am. Medical Informatics Assoc..

[13]  S. D. de Ferranti,et al.  Cholesterol testing among children and adolescents during health visits. , 2014, JAMA.

[14]  J. Knowles,et al.  Underutilization of cascade screening for familial hypercholesterolemia , 2014 .

[15]  A. Akobeng Understanding diagnostic tests 1: sensitivity, specificity and predictive values , 2007, Acta paediatrica.

[16]  D. Wald,et al.  Child-Parent Familial Hypercholesterolemia Screening in Primary Care. , 2016, The New England journal of medicine.

[17]  Michael J. Denney,et al.  Validating the extract, transform, load process used to populate a large clinical research database , 2016, Int. J. Medical Informatics.

[18]  Joshua C. Denny,et al.  Combining billing codes, clinical notes, and medications from electronic health records provides superior phenotyping performance , 2016, J. Am. Medical Informatics Assoc..

[19]  Francesca N. Delling,et al.  Heart Disease and Stroke Statistics—2019 Update: A Report From the American Heart Association , 2019, Circulation.

[20]  Eric E. Smith,et al.  2014 ACC/AHA Key Data Elements and Definitions for Cardiovascular Endpoint Events in Clinical Trials: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Data Standards (Writing Committee to Develop Cardiovascular Endpoints Data Standards). , 2015, Journal of the American College of Cardiology.

[21]  I. Kohane,et al.  Methods to Develop an Electronic Medical Record Phenotype Algorithm to Compare the Risk of Coronary Artery Disease across 3 Chronic Disease Cohorts , 2015, PloS one.

[22]  José Francisco Aldana Montes,et al.  Dione: An OWL representation of ICD-10-CM for classifying patients’ diseases , 2016, J. Biomed. Semant..

[23]  Catherine Boileau,et al.  Familial hypercholesterolaemia is underdiagnosed and undertreated in the general population: guidance for clinicians to prevent coronary heart disease , 2013, European heart journal.

[24]  W. Neal,et al.  Universal Versus Targeted Blood Cholesterol Screening Among Youth: The CARDIAC Project , 2010, Pediatrics.

[25]  Joshua C Denny,et al.  Evaluating electronic health record data sources and algorithmic approaches to identify hypertensive individuals , 2017, J. Am. Medical Informatics Assoc..