Towards a phenome-wide catalog of human clinical traits impacted by genetic ancestry

BackgroundRacial/ethnic differences for commonly measured clinical variables are well documented, and it has been postulated that population-specific genetic factors may play a role. The genetic heterogeneity of admixed populations, such as African Americans, provides a unique opportunity to identify genomic regions and variants associated with the clinical variability observed for diseases and traits across populations.MethodTo begin a systematic search for these population-specific genomic regions at the phenome-wide scale, we determined the relationship between global genetic ancestry, specifically European and African ancestry, and clinical variables measured in a population of African Americans from BioVU, Vanderbilt University’s biorepository linked to de-identified electronic medical records (EMRs) as part of the Epidemiologic Architecture using Genomics and Epidemiology (EAGLE) study. Through billing (ICD-9) codes, procedure codes, labs, and clinical notes, 36 common clinical and laboratory variables were mined from the EMR, including body mass index (BMI), kidney traits, lipid levels, blood pressure, and electrocardiographic measurements. A total of 15,863 DNA samples from non-European Americans were genotyped on the Illumina Metabochip containing ~200,000 variants, of which 11,166 were from African Americans. Tests of association were performed to examine associations between global ancestry and the phenotype of interest.ResultsIncreased European ancestry, and conversely decreased African ancestry, was most strongly correlated with an increase in QRS duration, consistent with previous observations that African Americans tend to have shorter a QRS duration compared with European Americans. Despite known racial/ethnic disparities in blood pressure, European and African ancestry was neither associated with diastolic nor systolic blood pressure measurements.ConclusionCollectively, these results suggest that this clinical population can be used to identify traits in which population differences may be due, in part, to population-specific genetics.

[1]  Joshua C Denny,et al.  Assessing the accuracy of observer-reported ancestry in a biorepository linked to electronic medical records , 2010, Genetics in Medicine.

[2]  Melissa A. Basford,et al.  Generalization of Variants Identified by Genome‐Wide Association Studies for Electrocardiographic Traits in African Americans , 2013, Annals of human genetics.

[3]  Tanya M. Teslovich,et al.  The Metabochip, a Custom Genotyping Array for Genetic Studies of Metabolic, Cardiovascular, and Anthropometric Traits , 2012, PLoS genetics.

[4]  C Kooperberg,et al.  The use of phenome‐wide association studies (PheWAS) for exploration of novel genotype‐phenotype relationships and pleiotropy discovery , 2011, Genetic epidemiology.

[5]  Joshua C Denny,et al.  Modulators of normal electrocardiographic intervals identified in a large electronic medical record. , 2011, Heart rhythm.

[6]  B. Egan,et al.  US trends in prevalence, awareness, treatment, and control of hypertension, 1988-2008. , 2010, JAMA.

[7]  Suzette J. Bielinski,et al.  Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study , 2012, J. Am. Medical Informatics Assoc..

[8]  W. Howard Trends in Serum Lipids and Lipoproteins of Adults, 1960–2002Carroll MD, Lacher DA, Sorlie PD, et al (Centers for Disease Control and Prevention, Hyattsville, Md; NIH, Bethesda, Md; Univ of Texas, Dallas) JAMA 294:1773–1781, 2005§ , 2006 .

[9]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[10]  Tanya M. Teslovich,et al.  Association analyses of 249,796 individuals reveal 18 new loci associated with body mass index , 2010 .

[11]  R. D. de Boer,et al.  Genetic Determinants of P Wave Duration and PR Segment , 2014, Circulation. Cardiovascular genetics.

[12]  Jacob B. Hall,et al.  Accuracy of Administratively-Assigned Ancestry for Diverse Populations in an Electronic Medical Record-Linked Biobank , 2014, PloS one.

[13]  Karen L. Mohlke,et al.  Fine-Mapping and Initial Characterization of QT Interval Loci in African Americans , 2012, PLoS genetics.

[14]  C. Carlson,et al.  The Next PAGE in Understanding Complex Traits: Design for the Analysis of Population Architecture Using Genetics and Epidemiology (PAGE) Study , 2011, American journal of epidemiology.

[15]  Jingyuan Fu,et al.  Common variants in 22 loci are associated with QRS duration and cardiac ventricular conduction , 2010, Nature Genetics.

[16]  Marylyn D. Ritchie,et al.  PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations , 2010, Bioinform..

[17]  Melissa A. Basford,et al.  Genome- and Phenome-Wide Analyses of Cardiac Conduction Identifies Markers of Arrhythmia Risk , 2013, Circulation.

[18]  Eran Halperin,et al.  Inference of locus-specific ancestry in closely related populations , 2009, Bioinform..

[19]  Suzette J. Bielinski,et al.  Phenome-wide association studies demonstrating pleiotropy of genetic variants within FTO with and without adjustment for body mass index , 2014, Front. Genet..

[20]  Christian Gieger,et al.  Common variants at ten loci modulate the QT interval duration in the QTSCD Study , 2009, Nature Genetics.

[21]  Melissa A. Basford,et al.  Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data , 2013, Nature Biotechnology.

[22]  Thomas Lumley,et al.  Common variants at ten loci influence QT interval duration in the QTGEN Study , 2009, Nature Genetics.

[23]  Janina M. Jeff,et al.  Evidence of Heterogeneity by Race/Ethnicity in Genetic Determinants of QT Interval , 2014, Epidemiology.

[24]  A. Reiner,et al.  African American race but not genome-wide ancestry is negatively associated with atrial fibrillation among postmenopausal women in the Women's Health Initiative. , 2013, American heart journal.

[25]  Janina M. Jeff,et al.  Characterization of Genome-Wide Association-Identified Variants for Atrial Fibrillation in African Americans , 2012, PloS one.

[26]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[27]  Mark N. Wass,et al.  Genetic variation in SCN10A influences cardiac conduction , 2010, Nature Genetics.

[28]  Robert J. Goodloe,et al.  Leveraging Epidemiologic and Clinical Collections for Genomic Studies of Complex Traits , 2015, Human Heredity.

[29]  N. Risch,et al.  The importance of race and ethnic background in biomedical research and clinical practice. , 2003, The New England journal of medicine.

[30]  Kathleen F. Kerr,et al.  European Ancestry as a Risk Factor for Atrial Fibrillation in African Americans , 2010, Circulation.

[31]  W. Elliott US Trends in Prevalence, Awareness, Treatment, and Control of Hypertension, 1988-2008 , 2011 .

[32]  Kathleen F. Kerr,et al.  Impact of Ancestry and Common Genetic Variants on QT Interval in African Americans , 2012, Circulation. Cardiovascular genetics.

[33]  Jennifer G. Robinson,et al.  Evaluation of the Metabochip Genotyping Array in African Americans and Implications for Fine Mapping of GWAS-Identified Loci: The PAGE Study , 2012, PloS one.

[34]  Katherine M Flegal,et al.  Prevalence of obesity and trends in the distribution of body mass index among US adults, 1999-2010. , 2012, JAMA.

[35]  Karen R Fitzgerald Review of article: Prevalence of obesity and trends in the distribution of body mass index among US adults, 1999-2010 by Katherine M. Flegal, PhD; Margaret D. Carroll, MSPH; Brian K. Kit, MD; Cynthia L. Ogden, PhD (JAMA 2012;307:491-7). , 2013, Journal of vascular nursing : official publication of the Society for Peripheral Vascular Nursing.

[36]  P. Sorlie,et al.  Trends in serum lipids and lipoproteins of adults, 1960-2002. , 2005, JAMA.

[37]  Kathleen F. Kerr,et al.  Genome-Wide Association Studies of the PR Interval in African Americans , 2011, PLoS genetics.

[38]  D. Roden,et al.  Development of a Large‐Scale De‐Identified DNA Biobank to Enable Personalized Medicine , 2008, Clinical pharmacology and therapeutics.

[39]  E. Clayton,et al.  Principles of Human Subjects Protections Applied in an Opt‐Out, De‐identified Biobank , 2010, Clinical and translational science.