Phenome-wide scanning identifies multiple diseases and disease severity phenotypes associated with HLA variants

Numerous associations were discovered between human leukocyte antigen (HLA) variation and a comprehensive set of phenotypes derived from electronic health records. Hints on health and disease from HLA Each of us expresses a mix of different human leukocyte antigens (HLAs), which present self- and foreign peptides to T cells. Because slightly different peptides are presented by each HLA type, HLA expression can influence an individual’s susceptibility to disease. Karnes et al. scrutinized electronic health record information from tens of thousands of people in two distinct cohorts to compare their phenotypes to the HLA alleles they express. This study confirmed previously identified HLA associations and also identified new ones; most associations were related to autoimmune diseases. The researchers have made the catalog freely available so that other groups can mine the data for future discoveries about how HLAs drive different phenotypes. Although many phenotypes have been associated with variants in human leukocyte antigen (HLA) genes, the full phenotypic impact of HLA variants across all diseases is unknown. We imputed HLA genomic variation from two populations of 28,839 and 8431 European ancestry individuals and tested association of HLA variation with 1368 phenotypes. A total of 104 four-digit and 92 two-digit HLA allele phenotype associations were significant in both discovery and replication cohorts, the strongest being HLA-DQB1*03:02 and type 1 diabetes. Four previously unidentified associations were identified across the spectrum of disease with two- and four-digit HLA alleles and 10 with nonsynonymous variants. Some conditions associated with multiple HLA variants and stronger associations with more severe disease manifestations were identified. A comprehensive, publicly available catalog of clinical phenotypes associated with HLA variation is provided. Examining HLA variant disease associations in this large data set allows comprehensive definition of disease associations to drive further mechanistic insights.

[1]  B. Brumback,et al.  Human leukocyte antigen class II and cervical cancer risk: a population-based study. , 2002, The Journal of infectious diseases.

[2]  A. Reunanen,et al.  The prevalence of celiac disease in Europe: Results of a centralized, international mass screening project , 2010, Annals of medicine.

[3]  Calliope A. Dendrou,et al.  Class II HLA interactions modulate genetic risk for multiple sclerosis , 2015, Nature Genetics.

[4]  Katsushi Tokunaga,et al.  Phenome-wide association study maps new diseases to the human major histocompatibility complex region , 2016, Journal of Medical Genetics.

[5]  J. Todd,et al.  HLA DR-DQ Haplotypes and Genotypes and Type 1 Diabetes Risk , 2008, Diabetes.

[6]  Melissa A. Basford,et al.  Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data , 2013, Nature Biotechnology.

[7]  Marylyn D. Ritchie,et al.  PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations , 2010, Bioinform..

[8]  H. Erlich,et al.  Allelic sequence variation of the HLA-DQ loci: relationship to serology and to insulin-dependent diabetes susceptibility. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[9]  P. Gregersen,et al.  Fine mapping seronegative and seropositive rheumatoid arthritis to shared and distinct HLA alleles by adjusting for the effects of heterogeneity. , 2014, American journal of human genetics.

[10]  S. Purcell,et al.  Pleiotropy in complex traits: challenges and strategies , 2013, Nature Reviews Genetics.

[11]  J. Anaya,et al.  HLA and Sjögren's syndrome susceptibility. A meta-analysis of worldwide studies. , 2012, Autoimmunity reviews.

[12]  N. H. ten Dam-van Loon,et al.  Potential Predictors of Poor Visual Outcome in Human Leukocyte Antigen-B27-Associated Uveitis. , 2016, American journal of ophthalmology.

[13]  S. Mallal,et al.  HLA-B*5701 screening for hypersensitivity to abacavir. , 2008, The New England journal of medicine.

[14]  Dana C Crawford,et al.  Pitfalls of merging GWAS data: lessons learned in the eMERGE network and quality control procedures to maintain high data quality , 2011, Genetic epidemiology.

[15]  J. Reveille,et al.  Subtypes of HLA-B27: history and implications in the pathogenesis of ankylosing spondylitis. , 2009, Advances in experimental medicine and biology.

[16]  Marylyn D. Ritchie,et al.  Detection of Pleiotropy through a Phenome-Wide Association Study (PheWAS) of Epidemiologic Data as Part of the Environmental Architecture for Genes Linked to Environment (EAGLE) Study , 2014, PLoS genetics.

[17]  Buhm Han,et al.  Imputing Amino Acid Polymorphisms in Human Leukocyte Antigens , 2013, PloS one.

[18]  Israel Ra The International Classification of Disease. Two hundred years of development. , 1978 .

[19]  Heather J Cordell,et al.  Absolute risk of childhood-onset type 1 diabetes defined by human leukocyte antigen class II genotype: a population-based study in the United Kingdom. , 2004, The Journal of clinical endocrinology and metabolism.

[20]  Robert M. Plenge,et al.  Defining the Role of the MHC in Autoimmunity: A Review and Pooled Analysis , 2008, PLoS genetics.

[21]  S. Raychaudhuri,et al.  Genetics and epigenetics of rheumatoid arthritis , 2013, Nature Reviews Rheumatology.

[22]  Robert M. Plenge,et al.  Five amino acids in three HLA proteins explain most of the association between MHC and seropositive rheumatoid arthritis , 2011, Nature Genetics.

[23]  C. Moore,et al.  Association between presence of HLA-B*5701, HLA-DR7, and HLA-DQ3 and hypersensitivity to HIV-1 reverse-transcriptase inhibitor abacavir , 2002, The Lancet.

[24]  Sue Povey,et al.  Gene map of the extended human MHC , 2004, Nature Reviews Genetics.

[25]  M. Baur,et al.  Genetic heterogeneity, modes of inheritance, and risk estimates for a joint study of Caucasians with insulin-dependent diabetes mellitus. , 1988, American journal of human genetics.

[26]  Eli Stahl,et al.  High density genetic mapping identifies new susceptibility loci for rheumatoid arthritis , 2012, Nature Genetics.

[27]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[28]  Jorge R Oksenberg,et al.  Multiple sclerosis genetics , 2018, Multiple sclerosis.

[29]  D. Roden,et al.  Biobanks and Electronic Medical Records: Enabling Cost-Effective Research , 2014, Science Translational Medicine.

[30]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[31]  S. Bang,et al.  An HLA-C amino-acid variant in addition to HLA-B*27 confers risk for ankylosing spondylitis in the Korean population , 2015, Arthritis Research & Therapy.

[32]  Miles Parkes,et al.  Genetic insights into common pathways and complex relationships among immune-mediated diseases , 2013, Nature Reviews Genetics.

[33]  B S Weir,et al.  HIBAG—HLA genotype imputation with attribute bagging , 2013, The Pharmacogenomics Journal.

[34]  P. Deloukas,et al.  A genome-wide association study for celiac disease identifies risk variants in the region harboring IL2 and IL21 , 2007, Nature Genetics.

[35]  Gerard Tromp,et al.  The phenotypic legacy of admixture between modern humans and Neandertals , 2016, Science.

[36]  R. Yamada,et al.  Ethnic differences in allele frequency of autoimmune-disease-associated SNPs , 2005, Journal of Human Genetics.

[37]  J. Granados,et al.  EXTENDED MHC HAPLOTYPES IN 21-HYDROXYLASE-DEFICIENCY CONGENITAL ADRENAL HYPERPLASIA: SHARED GENOTYPES IN UNRELATED PATIENTS , 1983, The Lancet.

[38]  P. Gregersen,et al.  Classical HLA-DRB1 and DPB1 Alleles Account for HLA Associations with Primary Biliary Cirrhosis , 2012, Genes and Immunity.

[39]  Y. Benjamini,et al.  More powerful procedures for multiple significance testing. , 1990, Statistics in medicine.

[40]  Kasper Lage,et al.  Pervasive Sharing of Genetic Effects in Autoimmune Disease , 2011, PLoS genetics.

[41]  Conor V. Dolan,et al.  TATES: Efficient Multivariate Genotype-Phenotype Analysis for Genome-Wide Association Studies , 2013, PLoS genetics.

[42]  Derek Middleton,et al.  New allele frequency database: www.allelefrequencies.net , 2003 .

[43]  J. Long,et al.  Illumina human exome genotyping array clustering and quality control , 2014, Nature Protocols.

[44]  H. Faruki,et al.  HLA-B*5701 screening for hypersensitivity to abacavir. , 2008, Personalized medicine.

[45]  Matti Pirinen,et al.  Identification of 15 new psoriasis susceptibility loci highlights the role of innate immunity , 2012 .

[46]  Wendy A. Wolf,et al.  The eMERGE Network: A consortium of biorepositories linked to electronic medical records data for conducting genomic studies , 2011, BMC Medical Genomics.

[47]  Kristian Løvås,et al.  CYP21A2 polymorphisms in patients with autoimmune Addison's disease, and linkage disequilibrium to HLA risk alleles. , 2014, European journal of endocrinology.

[48]  T. Wienker,et al.  Multiple sclerosis associated amino acids of polymorphic regions relevant for the HLA antigen binding are confined to HLA-DR2. , 2000, Human immunology.

[49]  S. Mallal,et al.  Comparison of HLA allelic imputation programs , 2017, PloS one.

[50]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[51]  M. Carrington,et al.  Fine-Mapping the Genetic Association of the Major Histocompatibility Complex in Multiple Sclerosis: HLA and Non-HLA Effects , 2013, PLoS genetics.

[52]  A. Barton,et al.  058 Investigating the pharmacogenetics of anti-TNF response in patients with rheumatoid arthritis utilising the re-weighted disease activity score: results from the biologics in rheumatoid arthritis genetics and genomics study syndicate , 2019, Rheumatology.

[53]  Zuguo Liu,et al.  The amino acid variation within the binding pocket 7 and 9 of HLA-DRB1 molecules are associated with primary Sjögren's syndrome. , 2015, Journal of autoimmunity.

[54]  J. Todd,et al.  HLA-DQβ gene contributes to susceptibility and resistance to insulin-dependent diabetes mellitus , 1987, Nature.

[55]  Dana C. Crawford,et al.  Unravelling the human genome–phenome relationship using phenome-wide association studies , 2016, Nature Reviews Genetics.

[56]  Daniel J. Gaffney,et al.  Dense fine-mapping study identifies new susceptibility loci for primary biliary cirrhosis , 2012, Nature Genetics.

[57]  M. Brilliant,et al.  A PheWAS approach in studying HLA-DRB1*1501 , 2013, Genes and Immunity.

[58]  Buhm Han,et al.  Additive and interaction effects at three amino acid positions in HLA-DQ and HLA-DR molecules drive type 1 diabetes risk , 2015, Nature Genetics.

[59]  Simon C. Potter,et al.  Genetic risk and a primary role for cell-mediated immune mechanisms in multiple sclerosis , 2011, Nature.

[60]  Joshua C. Denny,et al.  R PheWAS: data analysis and plotting tools for phenome-wide association studies in the R environment , 2014, Bioinform..

[61]  James T. Elder,et al.  Identification of fifteen new psoriasis susceptibility loci highlights the role of innate immunity , 2012, Nature Genetics.

[62]  Alexander T. Dilthey,et al.  HLA*IMP - an integrated framework for imputing classical HLA alleles from SNP genotypes , 2011, Bioinform..

[63]  T. Battelino,et al.  High-risk genotypes HLA-DR3-DQ2/DR3-DQ2 and DR3-DQ2/DR4-DQ8 in co-occurrence of type 1 diabetes and celiac disease , 2016, Autoimmunity.

[64]  A. Barabasi,et al.  Network medicine : a network-based approach to human disease , 2010 .

[65]  D. Roden,et al.  Development of a Large‐Scale De‐Identified DNA Biobank to Enable Personalized Medicine , 2008, Clinical pharmacology and therapeutics.

[66]  A. V. D. Van Der Zee,et al.  Analysis of the entire HLA region in susceptibility for cervical cancer: a comprehensive study , 2005, Journal of Medical Genetics.

[67]  P. Smouse,et al.  genalex 6: genetic analysis in Excel. Population genetic software for teaching and research , 2006 .

[68]  Sarah Edkins,et al.  Dense genotyping identifies and localizes multiple common and rare variant association signals in celiac disease , 2011, Nature Genetics.

[69]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.