Medical relevance of protein-truncating variants across 337,205 individuals in the UK Biobank study

Protein-truncating variants can have profound effects on gene function and are critical for clinical genome interpretation and generating therapeutic hypotheses, but their relevance to medical phenotypes has not been systematically assessed. Here, we characterize the effect of 18,228 protein-truncating variants across 135 phenotypes from the UK Biobank and find 27 associations between medical phenotypes and protein-truncating variants in genes outside the major histocompatibility complex. We perform phenome-wide analyses and directly measure the effect in homozygous carriers, commonly referred to as “human knockouts,” across medical phenotypes for genes implicated as being protective against disease or associated with at least one phenotype in our study. We find several genes with strong pleiotropic or non-additive effects. Our results illustrate the importance of protein-truncating variants in a variety of diseases.Protein-truncating variants (PTVs) are predicted to significantly affect a gene’s function and, thus, human traits. Here, DeBoever et al. systematically analyze PTVs in more than 300,000 individuals across 135 phenotypes and identify 27 associations between PTVs and medical conditions.

[1]  V. Sheffield,et al.  Identification of a Gene That Causes Primary Open Angle Glaucoma , 1997, Science.

[2]  S. Chen,et al.  ARC, an inhibitor of apoptosis expressed in skeletal muscle and heart that interacts selectively with caspases. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[3]  F. Christiansen,et al.  The genetic basis for the association of the 8.1 ancestral haplotype (A1, B8, DR3) with multiple immunopathological diseases , 1999, Immunological reviews.

[4]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[5]  D J Glass,et al.  Identification of Ubiquitin Ligases Required for Skeletal Muscle Atrophy , 2001, Science.

[6]  Xi Jiang,et al.  Human susceptibility and resistance to Norwalk virus infection , 2003, Nature Medicine.

[7]  M. Hentze,et al.  Nonsense-mediated decay approaches the clinic , 2004, Nature Genetics.

[8]  Alexander Pertsemlidis,et al.  Low LDL cholesterol in individuals of African descent resulting from frequent nonsense mutations in PCSK9 , 2005, Nature Genetics.

[9]  Pak Sham,et al.  Parental phenotypes in family-based association analysis. , 2005, American journal of human genetics.

[10]  Jonathan C. Cohen,et al.  Sequence variations in PCSK9, low LDL, and protection against coronary heart disease. , 2006, The New England journal of medicine.

[11]  C. Hansen Table A.1 , 2007 .

[12]  S. Chanock,et al.  Common variants of FUT2 are associated with plasma vitamin B12 levels , 2008, Nature Genetics.

[13]  K. Dewar,et al.  Allele-specific chromatin remodeling in the ZPBP2/GSDMB/ORMDL3 locus associated with the risk of asthma and autoimmune disease. , 2009, American journal of human genetics.

[14]  P. González-Sántos,et al.  Additive effects of LPL, APOA5 and APOE variant combinations on triglyceride levels and hypertriglyceridemia: results of the ICARIA genetic sub-study , 2010, BMC Medical Genetics.

[15]  Marylyn D. Ritchie,et al.  PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene–disease associations , 2010, Bioinform..

[16]  D. MacArthur,et al.  Loss-of-function variants in the genomes of healthy humans. , 2010, Human molecular genetics.

[17]  J. van Bergen,et al.  Celiac disease: how complicated can it get? , 2010, Immunogenetics.

[18]  Tariq Ahmad,et al.  Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci , 2010, Nature Genetics.

[19]  A. Robeznieks,et al.  Come and get IT. , 2010, Modern healthcare.

[20]  Beate Ritz,et al.  Replication of GWAS Associations for GAK and MAPT in Parkinson's Disease , 2010, Annals of human genetics.

[21]  A. Irvine,et al.  Filaggrin mutations associated with skin and allergic diseases. , 2011, The New England journal of medicine.

[22]  J. Todd,et al.  FUT2 Nonsecretor Status Links Type 1 Diabetes Susceptibility and Resistance to Infection , 2011, Diabetes.

[23]  Joshua M. Korn,et al.  Deep resequencing of GWAS loci identifies independent rare variants associated with inflammatory bowel disease , 2011, Nature Genetics.

[24]  Christine M. Williams,et al.  Genetic variations at the lipoprotein lipase gene influence plasma lipid concentrations and interact with plasma n-6 polyunsaturated fatty acids to modulate lipid metabolism. , 2011, Atherosclerosis.

[25]  J. Trent,et al.  Genome-wide association study identifies novel loci associated with serum level of vitamin B12 in Chinese men. , 2012, Human molecular genetics.

[26]  K. Kaukinen,et al.  Association study of FUT2 (rs601338) with celiac disease and inflammatory bowel disease in the Finnish population. , 2012, Tissue antigens.

[27]  Joseph K. Pickrell,et al.  A Systematic Survey of Loss-of-Function Variants in Human Protein-Coding Genes , 2012, Science.

[28]  A. Cassio,et al.  Current Loss-of-Function Mutations in the Thyrotropin Receptor Gene: When to Investigate, Clinical Effects, and Treatment , 2012, Journal of clinical research in pediatric endocrinology.

[29]  Huan Ren,et al.  Association of the IRF5 rs2004640 polymorphism with rheumatoid arthritis: a meta-analysis , 2013, Rheumatology International.

[30]  F. Hu,et al.  A genome wide association study of genetic loci that influence tumour biomarkers cancer antigen 19-9, carcinoembryonic antigen and α fetoprotein and their associations with cancer risk , 2013, Gut.

[31]  R. Kitsis,et al.  Apoptosis Repressor with a CARD Domain (ARC) Restrains Bax-Mediated Pathogenesis in Dystrophic Skeletal Muscle , 2013, PloS one.

[32]  D. G. MacArthur,et al.  Guidelines for investigating causality of sequence variants in human disease , 2014, Nature.

[33]  C. Sudlow,et al.  UK Biobank Data: Come and Get It , 2014, Science Translational Medicine.

[34]  William Wheeler,et al.  Rare variants of large effect in BRCA2 and CHEK2 affect risk of lung cancer , 2014, Nature Genetics.

[35]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[36]  Christoph Steinbeck,et al.  Genome-Wide Association Study of Metabolic Traits Reveals Novel Gene-Metabolite-Disease Links , 2014, PLoS genetics.

[37]  S. Donath,et al.  Functional, morphological, and apoptotic alterations in skeletal muscle of ARC deficient mice , 2015, Apoptosis.

[38]  S. O’Brien,et al.  Genetic Variations Affecting Serum Carcinoembryonic Antigen Levels and Status of Regional Lymph Nodes in Patients with Sporadic Colorectal Cancer from Southern China , 2014, PloS one.

[39]  Eric S. Lander,et al.  A polygenic burden of rare disruptive mutations in schizophrenia , 2014, Nature.

[40]  J. Horwood UK Biobank Data: Come and Get It , 2014 .

[41]  Andres Metspalu,et al.  Distribution and Medical Impact of Loss-of-Function Variants in the Finnish Founder Population , 2014, PLoS genetics.

[42]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[43]  U. Beuers,et al.  Fucosyltransferase 2: A Genetic Risk Factor for Primary Sclerosing Cholangitis and Crohn's Disease—A Comprehensive Review , 2015, Clinical Reviews in Allergy & Immunology.

[44]  C. Woods,et al.  New Mendelian Disorders of Painlessness , 2015, Trends in Neurosciences.

[45]  H. Stefánsson,et al.  Identification of a large set of rare complete human knockouts , 2015, Nature Genetics.

[46]  J. Danesh,et al.  Human knockouts in a cohort with a high rate of consanguinity , 2015, bioRxiv.

[47]  G. Montgomery,et al.  Accurate Imputation-Based Screening of Gln368Ter Myocilin Variant in Primary Open-Angle Glaucoma. , 2015, Investigative ophthalmology & visual science.

[48]  Tong Wang,et al.  The Association of GSDMB and ORMDL3 Gene Polymorphisms With Asthma: A Meta-Analysis , 2014, Allergy, asthma & immunology research.

[49]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[50]  Tomaz Berisa,et al.  Detection and interpretation of shared genetic influences on 40 human traits , 2015 .

[51]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[52]  Mitchell J. Machiela,et al.  LDlink: a web-based application for exploring population-specific haplotype structure and linking correlated alleles of possible functional variants , 2015, Bioinform..

[53]  Emily K. Tsang,et al.  Effect of predicted protein-truncating genetic variants on the human transcriptome , 2015, Science.

[54]  M. Feldmann,et al.  IRF5 controls both acute and chronic inflammation , 2015, Proceedings of the National Academy of Sciences.

[55]  Dermot F. Reilly,et al.  Coding variation in ANGPTL4, LPL, and SVEP1 and the risk of coronary disease , 2018 .

[56]  Q. Hamid,et al.  GSDMB induces an asthma phenotype characterized by increased airway responsiveness and remodeling without lung inflammation , 2016, Proceedings of the National Academy of Sciences.

[57]  A protein-truncating R179X variant in RNF186 confers protection against ulcerative colitis , 2016, Nature communications.

[58]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, Genome Biology.

[59]  James Y. Zou Analysis of protein-coding genetic variation in 60,706 humans , 2015, Nature.

[60]  Dieter O. Fürst,et al.  Breaking sarcomeres by in vitro exercise , 2016, Scientific Reports.

[61]  Stephen C. J. Parker,et al.  The genetic architecture of type 2 diabetes , 2016, Nature.

[62]  He Zhang,et al.  Trans-ancestry meta-analyses identify rare and common variants associated with blood pressure and hypertension , 2016, Nature Genetics.

[63]  Harry Hemingway,et al.  Health and population effects of rare gene knockouts in adult humans with related parents , 2015, Science.

[64]  C. Tyler-Smith,et al.  Human Knockout Carriers: Dead, Diseased, Healthy, or Improved? , 2016, Trends in molecular medicine.

[65]  Yang I Li,et al.  An Expanded View of Complex Traits: From Polygenic to Omnigenic , 2017, Cell.

[66]  Stephan J Sanders,et al.  Refining the role of de novo protein truncating variants in neurodevelopmental disorders using population reference samples , 2016, Nature Genetics.

[67]  Daniel G. MacArthur,et al.  Human knockouts and phenotypic analysis in a cohort with a high rate of consanguinity , 2017, Nature.

[68]  L. Liang,et al.  Shared Genetic Architecture between Asthma and Allergic Diseases: A Genome-Wide Cross Trait Analysis of 112,000 Individuals from UK Biobank , 2017, bioRxiv.

[69]  D. Gudbjartsson,et al.  A rare IL33 loss-of-function mutation reduces blood eosinophil counts and protects from asthma , 2017, PLoS genetics.

[70]  MicroRNA Markers for Acute Respiratory Distress Syndrome and Shared Genetic Architecture of Asthma With Allergic Diseases: A Genome-Wide Cross Trait Analysis of 112,000 Individuals From UK Biobank , 2017 .

[71]  B. Nordestgaard,et al.  Low LDL cholesterol, PCSK9 and HMGCR genetic variation, and risk of Alzheimer’s disease and Parkinson’s disease: Mendelian randomisation study , 2017, British Medical Journal.

[72]  James T. Elder,et al.  Large scale meta-analysis characterizes genetic architecture for common psoriasis associated variants , 2016, Nature Communications.

[73]  Yaniv Erlich,et al.  Case–control association mapping by proxy using family history of disease , 2017, Nature Genetics.

[74]  Cardiovascular endocrinology: Is ANGPTL3 the next PCSK9? , 2017, Nature Reviews Endocrinology.

[75]  F. Q. Ribeiro The meta-analysis , 2017, Brazilian journal of otorhinolaryngology.

[76]  Jingbo Shang,et al.  Stepwise Distributed Open Innovation Contests for Software Development: Acceleration of Genome-Wide Association Analysis , 2017, GigaScience.

[77]  P. Donnelly,et al.  Genome-wide genetic data on ~500,000 UK Biobank participants , 2017, bioRxiv.

[78]  F. Qadri,et al.  FUT2 non-secretor status is associated with altered susceptibility to symptomatic enterotoxigenic Escherichia coli infection in Bangladeshis , 2017, Scientific Reports.

[79]  J. Danesh,et al.  ANGPTL3 Deficiency and Protection Against Coronary Artery Disease. , 2017, Journal of the American College of Cardiology.

[80]  Fang Liu,et al.  The sponge microbiome project , 2017, GigaScience.

[81]  E. Israel,et al.  A functional splice variant associated with decreased asthma risk abolishes the ability of gasdermin B to induce epithelial cell pyroptosis , 2018, The Journal of allergy and clinical immunology.

[82]  Kari Stefansson,et al.  Genome-wide analyses using UK Biobank data provide insights into the genetic architecture of osteoarthritis , 2018, Nature Genetics.

[83]  Giovanni Malerba,et al.  Refining the accuracy of validated target identification through coding variant fine-mapping in type 2 diabetes , 2017, Nature Genetics.