Whole-genome sequencing analysis of the cardiometabolic proteome

The human proteome is a crucial intermediate between complex diseases and their genetic and environmental components, and an important source of drug development targets and biomarkers. Here, we conduct high-depth (22.5x) whole-genome sequencing (WGS) in 1,328 individuals to fully assess the genetic architecture of 257 circulating protein biomarkers of cardiometabolic relevance. We discover 132 independent sequence variant associations (P<7.45×10−11) across the allele frequency spectrum, including 44 new cis-acting and 11 new trans-acting loci, all of which replicate in an independent cohort (n=1,605, 18.4x WGS). We identify replicating evidence for rare-variant cis-acting protein quantitative trait loci for five genes, involving both coding and non-coding variation. We find causal links between protein biomarkers and cardiovascular, inflammatory and immune-related diseases. We construct and validate polygenic risk scores that explain up to 45% of protein level variation, and find significant correlation between genetically-predicted biomarker levels and cardiovascular disease risk in UK Biobank.

[1]  Judith A. Blake,et al.  Mouse Genome Database (MGD) 2019 , 2018, Nucleic Acids Res..

[2]  Stephen Burgess,et al.  Genomic atlas of the human plasma proteome , 2018, Nature.

[3]  Shing Wan Choi,et al.  PRSice-2: Polygenic Risk Score software for biobank-scale data , 2019, GigaScience.

[4]  C. Conover,et al.  Expression of Recombinant Human Pregnancy-associated Plasma Protein-A and Identification of the Proform of Eosinophil Major Basic Protein as Its Physiological Inhibitor* , 2000, The Journal of Biological Chemistry.

[5]  M. Stephens,et al.  Genome-wide Efficient Mixed Model Analysis for Association Studies , 2012, Nature Genetics.

[6]  A. Morris,et al.  Mapping of 79 loci for 83 plasma protein biomarkers in cardiovascular disease , 2017, PLoS genetics.

[7]  D. Foell,et al.  Interleukin-18 diagnostically distinguishes and pathogenically promotes human and murine macrophage activation syndrome. , 2018, Blood.

[8]  J. Raber,et al.  Apolipoprotein E–low density lipoprotein receptor interaction affects spatial memory retention and brain ApoE levels in an isoform-dependent manner , 2014, Neurobiology of Disease.

[9]  P. Visscher,et al.  Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits , 2012, Nature Genetics.

[10]  P. Visscher,et al.  Meta-analysis of genome-wide association studies for height and body mass index in ∼700,000 individuals of European ancestry , 2018, bioRxiv.

[11]  Inês Barroso,et al.  Cohort-wide deep whole genome sequencing and the allelic architecture of complex traits , 2018, Nature Communications.

[12]  T. Kita,et al.  Cell surface‐anchored SR‐PSOX/CXC chemokine ligand 16 mediates firm adhesion of CXC chemokine receptor 6‐expressing cells , 2004, Journal of leukocyte biology.

[13]  J. Buxbaum,et al.  A SPECTRAL APPROACH INTEGRATING FUNCTIONAL GENOMIC ANNOTATIONS FOR CODING AND NONCODING VARIANTS , 2015, Nature Genetics.

[14]  F. Cunningham,et al.  The Ensembl Variant Effect Predictor , 2016, Genome Biology.

[15]  S. Bandinelli,et al.  Novel gene variants predict serum levels of the cytokines IL-18 and IL-1ra in older adults. , 2014, Cytokine.

[16]  D. Shaw,et al.  Structure of Human Urokinase Plasminogen Activator in Complex with Its Receptor , 2006, Science.

[17]  Ellen T. Gelfand,et al.  A Novel Approach to High-Quality Postmortem Tissue Procurement: The GTEx Project , 2015, Biopreservation and biobanking.

[18]  Mary Sara McPeek,et al.  Robust Rare Variant Association Testing for Quantitative Traits in Samples With Related Individuals , 2014, Genetic epidemiology.

[19]  Jing Zhao,et al.  The Genetic Architecture of Gene Expression in Peripheral Blood. , 2017, American journal of human genetics.

[20]  J. Tschopp,et al.  A Soluble Form of B Cell Maturation Antigen, a Receptor for the Tumor Necrosis Factor Family Member April, Inhibits Tumor Cell Growth , 2000, The Journal of experimental medicine.

[21]  Jeremy Schwartzentruber,et al.  Whole genome sequencing and imputation in isolated populations identify genetic associations with medically-relevant complex traits , 2017, Nature Communications.

[22]  E. Zeggini,et al.  The mountainous Cretan dietary patterns and their relationship with cardiovascular risk factors: the Hellenic Isolated Cohorts MANOLIS study , 2016, Public Health Nutrition.

[23]  G. Kempermann Faculty Opinions recommendation of Human genomics. The Genotype-Tissue Expression (GTEx) pilot analysis: multitissue gene regulation in humans. , 2015 .

[24]  J. Cheverud,et al.  A simple correction for multiple comparisons in interval mapping genome scans , 2001, Heredity.

[25]  L. Bach,et al.  Insulin-like growth factors and kidney disease. , 2015, American journal of kidney diseases : the official journal of the National Kidney Foundation.

[26]  R. Horuk,et al.  The Promiscuous Chemokine Binding Profile of the Duffy Antigen/Receptor for Chemokines Is Primarily Localized to Sequences in the Amino-terminal Domain (*) , 1995, The Journal of Biological Chemistry.

[27]  J. Kelsen,et al.  Life-threatening NLRC4-associated hyperinflammation successfully treated with IL-18 inhibition. , 2017, The Journal of allergy and clinical immunology.

[28]  David S. Wishart,et al.  DrugBank: a knowledgebase for drugs, drug actions and drug targets , 2007, Nucleic Acids Res..

[29]  Stefan Enroth,et al.  Improved power and precision with whole genome sequencing data in genome-wide association studies of inflammatory biomarkers , 2019, Scientific Reports.

[30]  Systemic and specific effects of antihypertensive and lipid-lowering medication on plasma protein biomarkers for cardiovascular diseases , 2018, Scientific Reports.

[31]  O. Mors,et al.  The hypercholesterolemia-risk gene SORT1 facilitates PCSK9 secretion. , 2014, Cell metabolism.

[32]  Hui-Rong Jiang,et al.  Emerging role of interleukin‐33 in autoimmune diseases , 2014, Immunology.

[33]  Christian Gieger,et al.  Genome‐wide mapping of plasma protein QTLs identifies putatively causal genes and pathways for cardiovascular disease , 2018, Nature Communications.

[34]  Stephen Burgess,et al.  PhenoScanner V2: an expanded tool for searching human genotype–phenotype associations , 2019, Bioinform..

[35]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[36]  N. Calcutt Location, Location, Location? , 2013, Diabetes.

[37]  J. Kramer,et al.  Ectonucleoside Triphosphate Diphosphohydrolase Type 5 (Entpd5)-Deficient Mice Develop Progressive Hepatopathy, Hepatocellular Tumors, and Spermatogenic Arrest , 2009, Veterinary pathology.

[38]  D. Altman,et al.  Measuring inconsistency in meta-analyses , 2003, BMJ : British Medical Journal.

[39]  Christian Gieger,et al.  Connecting genetic risk to disease end points through the human blood plasma proteome , 2016, Nature Communications.

[40]  Tanya M. Teslovich,et al.  Biobank-driven genomic discovery yields new insight into atrial fibrillation biology , 2018, Nature Genetics.

[41]  J. Shendure,et al.  A general framework for estimating the relative pathogenicity of human genetic variants , 2014, Nature Genetics.

[42]  James M. Eales,et al.  Trans-ethnic kidney function association study reveals putative causal genes and effects on kidney-specific disease aetiologies , 2019, Nature Communications.

[43]  Xia Yang,et al.  Co-regulatory networks of human serum proteins link genetics to disease , 2018, Science.

[44]  William W. Greenwald,et al.  Identification of Common and Rare Genetic Variation Associated With Plasma Protein Levels Using Whole-Exome Sequencing and Mass Spectrometry , 2018, Circulation. Genomic and precision medicine.

[45]  S. Thompson,et al.  Quantifying heterogeneity in a meta‐analysis , 2002, Statistics in medicine.

[46]  M. Monden,et al.  A high endothelial venule-expressing promiscuous chemokine receptor DARC can bind inflammatory, but not lymphoid, chemokines and is dispensable for lymphocyte homing under physiological conditions. , 2003, International immunology.

[47]  Tanya M. Teslovich,et al.  Discovery and refinement of loci associated with lipid levels , 2013, Nature Genetics.

[48]  M. McCarthy,et al.  The Genetic Landscape of Renal Complications in Type 1 Diabetes. , 2017, Journal of the American Society of Nephrology : JASN.

[49]  Markus Perola,et al.  Genome-wide Association Study Identifies 27 Loci Influencing Concentrations of Circulating Cytokines and Growth Factors. , 2017, American journal of human genetics.

[50]  Mary E. Haas,et al.  Genetic Association of Albuminuria with Cardiometabolic Disease and Blood Pressure. , 2018, American journal of human genetics.

[51]  Jay A. Montgomery,et al.  Multi-ethnic genome-wide association study for atrial fibrillation , 2018, Nature Genetics.

[52]  G. Rossi,et al.  The Role of Oxidized Low-Density Lipoproteins in Atherosclerosis: The Myths and the Facts , 2013, Mediators of inflammation.

[53]  Martin Lundberg,et al.  Homogeneous antibody-based proximity extension assays provide sensitive and specific detection of low-abundant proteins in human blood , 2011, Nucleic acids research.

[54]  Gautier Koscielny,et al.  Open Targets: a platform for therapeutic target identification and validation , 2016, Nucleic Acids Res..

[55]  V. Moskvina,et al.  On multiple‐testing correction in genome‐wide association studies , 2008, Genetic epidemiology.

[56]  The role of PAPP-A in the IGF system: location, location, location , 2015, Journal of Cell Communication and Signaling.