EHR-based phenome wide association study in pancreatic cancer

BACKGROUND Pancreatic cancer is one of the most common causes of cancer-related deaths in the United States, it is difficult to detect early and typically has a very poor prognosis. We present a novel method of large-scale clinical hypothesis generation based on phenome wide association study performed using Electronic Health Records (EHR) in a pancreatic cancer cohort. METHODS The study population consisted of 1,154 patients diagnosed with malignant neoplasm of pancreas seen at The Froedtert & The Medical College of Wisconsin academic medical center between the years 2004 and 2013. We evaluated death of a patient as the primary clinical outcome and tested its association with the phenome, which consisted of over 2.5 million structured clinical observations extracted out of the EHR including labs, medications, phenotypes, diseases and procedures. The individual observations were encoded in the EHR using 6,617 unique ICD-9, CPT-4, LOINC, and RxNorm codes. We remapped this initial code set into UMLS concepts and then hierarchically expanded to support generalization into the final set of 10,164 clinical concepts, which formed the final phenome. We then tested all possible pairwise associations between any of the original 10,164 concepts and death as the primary outcome. RESULTS After correcting for multiple testing and folding back (generalizing) child concepts were appropriate, we found 231 concepts to be significantly associated with death in the study population. CONCLUSIONS With the abundance of structured EHR data, phenome wide association studies combined with knowledge engineering can be a viable method of rapid hypothesis generation.

[1]  V. Bobek,et al.  Cimetidine: an anticancer drug? , 2011, European journal of pharmaceutical sciences : official journal of the European Federation for Pharmaceutical Sciences.

[2]  Marylyn D. Ritchie,et al.  PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations , 2010, Bioinform..

[3]  Lawrence M. Fagan,et al.  Medical informatics: computer applications in health care and biomedicine (Health informatics) , 2003 .

[4]  C. McDonald,et al.  LOINC, a universal standard for identifying laboratory observations: a 5-year update. , 2003, Clinical chemistry.

[5]  Martin Vingron,et al.  Improved detection of overrepresentation of Gene-Ontology annotations with parent-child analysis , 2007, Bioinform..

[6]  H. Lindén,et al.  The European Federation for Pharmaceutical Sciences , 2003 .

[7]  G. Hripcsak,et al.  Correlating electronic health record concepts with healthcare process events , 2013, Journal of the American Medical Informatics Association : JAMIA.

[8]  Nigam H. Shah,et al.  Practice-Based Evidence: Profiling the Safety of Cilostazol by Text-Mining of Clinical Notes , 2013, PloS one.

[9]  J. Nielson,et al.  Current procedural terminology (CPT). , 2016, JAMA.

[10]  D. Lindberg,et al.  Unified Medical Language System , 2020, Definitions.

[11]  R. Altman,et al.  Identifying phenotypic signatures of neuropsychiatric disorders from electronic medical records. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[12]  Nhan Do,et al.  Implementation of RxNorm as a Terminology Mediation Standard for Exchanging Pharmacy Medication between Federal Agencies , 2006, AMIA.

[13]  R. Lubrano,et al.  Relationship between global end-diastolic volume and cardiac output in critically ill infants and children. , 2008, Critical care medicine.

[14]  Marek Tutaj,et al.  Next generation ontology browser , 2013, ICBO.

[15]  R Finnegan,et al.  ICD-9-CM , 1986, Journal.

[16]  Ke-Xing Fan,et al.  Cimetidine suppresses lung tumor growth in mice through proapoptosis of myeloid-derived suppressor cells. , 2013, Molecular immunology.

[17]  O. Sürücü,et al.  Tumour growth inhibition of human pancreatic cancer xenografts in SCID mice by cimetidine , 2004, Inflammation Research.

[18]  Quan Ding,et al.  Temporal phenome analysis of a large electronic health record cohort enables identification of hospital-acquired complications. , 2013, Journal of the American Medical Informatics Association : JAMIA.

[19]  A. Seif,et al.  Induction mortality and resource utilization in children treated for acute myeloid leukemia at free‐standing pediatric hospitals in the United States , 2013, Cancer.