Genome-wide Association Study Of Plasma Proteins Identifies Putatively Causal Genes, Proteins, And Pathways For Cardiovascular Disease

Identifying genetic variants associated with circulating protein concentrations (pQTLs) and integrating them with variants from genome-wide association studies (GWAS) may illuminate the proteome’s causal role in disease and bridge a GWAS knowledge gap for hitherto unexplained SNP-disease associations. We conducted GWAS of 71 high-value proteins for cardiovascular disease in 6,861 Framingham Heart Study participants followed by external replication. We comprehensively mapped thousands of pQTLs, including functional annotations and clinical-trait associations, and created an integrated plasma-protein-QTL searchable database. We next identified 15 proteins with pQTLs coinciding with coronary heart disease (CHD)-related variants from GWAS or tested causal for CHD by Mendelian randomization; most of these proteins were associated with new-onset cardiovascular disease events in Framingham participants with long-term follow-up. Identifying pQTLs and integrating them with GWAS results yields insights into genes, proteins, and pathways that may be causally associated with disease and can serve as therapeutic targets for treatment and prevention.

[1]  David S. Wishart,et al.  DrugBank 4.0: shedding new light on drug metabolism , 2013, Nucleic Acids Res..

[2]  Magda Tsolaki,et al.  Identification of cis-regulatory variation influencing protein abundance levels in human plasma. , 2012, Human molecular genetics.

[3]  Stephen Burgess,et al.  Combining information on multiple instrumental variables in Mendelian randomization: comparison of allele score and summarized data methods , 2015, Statistics in medicine.

[4]  Steven P. Gygi,et al.  Defining the consequences of genetic variation on a proteome-wide scale , 2016, Nature.

[5]  E. Boerwinkle,et al.  Quantitative Influence of ABO Blood Groups on Factor VIII and Its Ratio to von Willebrand Factor, Novel Observations from an ARIC Study of 11,673 Subjects , 2015, PloS one.

[6]  C. Carty,et al.  Association of genetic variation in serum amyloid-A with cardiovascular disease and interactions with IL6, IL1RN, IL1beta and TNF genes in the Cardiovascular Health Study. , 2009, Journal of atherosclerosis and thrombosis.

[7]  Olle Melander,et al.  From noncoding variant to phenotype via SORT1 at the 1p13 cholesterol locus , 2010, Nature.

[8]  Ruedi Aebersold,et al.  Quantitative variability of 342 plasma proteins in a human twin population , 2015 .

[9]  J. Pankow,et al.  ABO blood group associations with markers of endothelial dysfunction in the Multi-Ethnic Study of Atherosclerosis. , 2016, Atherosclerosis.

[10]  Wenjie Chen,et al.  GRASP v2.0: an update on the Genome-Wide Repository of Associations between SNPs and phenotypes , 2014, Nucleic Acids Res..

[11]  A. Morris,et al.  Mapping of 79 loci for 83 plasma protein biomarkers in cardiovascular disease , 2017, PLoS genetics.

[12]  J. Hirschhorn,et al.  Biological interpretation of genome-wide association studies using predicted gene functions , 2015, Nature Communications.

[13]  Leigh Anderson,et al.  Candidate‐based proteomics in the search for biomarkers of cardiovascular disease , 2005, The Journal of physiology.

[14]  L. Cupples,et al.  Lipoprotein(a) levels, apo(a) isoform size, and coronary heart disease risk in the Framingham Offspring Study , 2011, Journal of Lipid Research.

[15]  P. Flicek,et al.  The Ensembl Regulatory Build , 2015, Genome Biology.

[16]  C. Gieger,et al.  KORA-gen - Resource for Population Genetics, Controls and a Broad Spectrum of Disease Phenotypes , 2005 .

[17]  Xiang Ma,et al.  Polymorphisms in the SAA1/2 Gene Are Associated with Carotid Intima Media Thickness in Healthy Han Chinese Subjects: The Cardiovascular Risk Survey , 2010, PloS one.

[18]  J. Danesh,et al.  A comprehensive 1000 Genomes-based genome-wide association meta-analysis of coronary artery disease , 2016 .

[19]  J. Manson,et al.  Novel proteins associated with risk for coronary heart disease or stroke among postmenopausal women identified by in-depth plasma proteome profiling , 2010, Genome Medicine.

[20]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[21]  Jennifer G. Robinson,et al.  Association of low-frequency and rare coding-sequence variants with blood lipids and coronary heart disease in 56,000 whites and blacks. , 2014, American journal of human genetics.

[22]  T. Dawber,et al.  Epidemiological approaches to heart disease: the Framingham Study. , 1951, American journal of public health and the nation's health.

[23]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[24]  K. Frazer,et al.  Associations Between Common and Rare Exonic Genetic Variants and Serum Levels of 20 Cardiovascular-Related Proteins , 2016, Circulation. Cardiovascular genetics.

[25]  P. Wadhwa,et al.  Validation and comparison of luminex multiplex cytokine analysis kits with ELISA: determinations of a panel of nine cytokines in clinical sample culture supernatants. , 2005, Journal of reproductive immunology.

[26]  J. Rotter,et al.  Association between serum amyloid A proteins and coronary artery disease: evidence from two distinct arteriosclerotic processes. , 1997, Circulation.

[27]  Andrew D. Johnson,et al.  Gene Expression Signatures of Coronary Heart Disease , 2013, Arteriosclerosis, thrombosis, and vascular biology.

[28]  Asaf Bitton,et al.  Growing epidemic of coronary heart disease in low- and middle-income countries. , 2010, Current problems in cardiology.

[29]  N. Eriksen,et al.  Amyloid protein SAA is associated with high density lipoprotein from human serum. , 1977, Proceedings of the National Academy of Sciences of the United States of America.

[30]  D. Posthuma,et al.  17 FUMA: FUNCTIONAL MAPPING AND ANNOTATION OF GENETIC ASSOCIATIONS , 2017, European Neuropsychopharmacology.

[31]  B. Nordestgaard,et al.  Lipoprotein(a) and risk of myocardial infarction – genetic epidemiologic evidence of causality , 2011, Scandinavian journal of clinical and laboratory investigation.

[32]  Stefan Enroth,et al.  Identification of genetic variants influencing the human plasma proteome , 2013, Proceedings of the National Academy of Sciences.

[33]  Qiong Yang,et al.  The Third Generation Cohort of the National Heart, Lung, and Blood Institute's Framingham Heart Study: design, recruitment, and initial examination. , 2007, American journal of epidemiology.

[34]  W. Kannel,et al.  The Framingham Offspring Study. Design and preliminary data. , 1975, Preventive medicine.

[35]  P. Schnohr,et al.  Association of mutations in the apolipoprotein B gene with hypercholesterolemia and the risk of ischemic heart disease. , 1998, The New England journal of medicine.

[36]  S. Ebrahim,et al.  Mendelian randomization: prospects, potentials, and limitations. , 2004, International journal of epidemiology.

[37]  Mark I. McCarthy,et al.  A Genome-Wide Association Study Identifies Protein Quantitative Trait Loci (pQTLs) , 2008, PLoS genetics.

[38]  Ulf Gyllensten,et al.  Strong effects of genetic and lifestyle factors on biomarker variation and use of personalized cutoffs , 2014, Nature Communications.

[39]  Leopold Parts,et al.  A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies , 2010, PLoS Comput. Biol..

[40]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[41]  N. Mehta Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. , 2011, Circulation. Cardiovascular genetics.

[42]  P. Madsen,et al.  Sort1, encoded by the cardiovascular risk locus 1p13.3, is a regulator of hepatic lipoprotein export. , 2010, Cell metabolism.

[43]  A. Cohen-Solal,et al.  Soluble CD146, a new endothelial biomarker of acutely decompensated heart failure. , 2015, International journal of cardiology.

[44]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[45]  Roby Joehanes,et al.  Integrated genome-wide analysis of expression quantitative trait loci aids interpretation of genomic association studies , 2017, Genome Biology.

[46]  T. Mikkelsen,et al.  The NIH Roadmap Epigenomics Mapping Consortium , 2010, Nature Biotechnology.

[47]  Manolis Kellis,et al.  HaploReg v4: systematic mining of putative causal variants, cell types, regulators and target genes for human complex traits and disease , 2015, Nucleic Acids Res..

[48]  Christian Gieger,et al.  Connecting genetic risk to disease end points through the human blood plasma proteome , 2016, Nature Communications.

[49]  J. González-Porras,et al.  A nonsense polymorphism in the protein Z-dependent protease inhibitor increases the risk for venous thrombosis. , 2006, Blood.

[50]  Xiyun Yan,et al.  Adhesion Molecule CD146 and its Soluble Form Correlate Well with Carotid Atherosclerosis and Plaque Instability , 2014, CNS neuroscience & therapeutics.

[51]  D. Vignali,et al.  Simultaneous quantitation of 15 cytokines using a multiplexed flow cytometric assay. , 1999, Journal of immunological methods.

[52]  M. Adaş,et al.  The relationship between early atherosclerosis and endothelial dysfunction in type 1 diabetic patients as evidenced by measurement of carotid intima-media thickness and soluble CD146 levels: a cross sectional study , 2013, Cardiovascular Diabetology.

[53]  Jagmeet P. Singh,et al.  Soluble CD146 Is a Novel Marker of Systemic Congestion in Heart Failure Patients: An Experimental Mechanistic and Transcardiac Clinical Study. , 2017, Clinical chemistry.

[54]  Richard Leslie,et al.  GRASP: analysis of genotype-phenotype results from 1390 genome-wide association studies and corresponding open access database , 2014, Bioinform..

[55]  D. Raptis,et al.  Pancreatic stone protein/regenerating protein (PSP/reg): a novel secreted protein up-regulated in type 2 diabetes mellitus , 2015, Endocrine.

[56]  A. Mebazaa,et al.  Plasma Levels of Soluble CD146 Reflect the Severity of Pulmonary Congestion Better Than Brain Natriuretic Peptide in Acute Coronary Syndrome , 2016, Annals of laboratory medicine.

[57]  M. Brown,et al.  Receptor-mediated control of cholesterol metabolism. , 1976, Science.

[58]  Christian Gieger,et al.  The Role of Adiposity in Cardiometabolic Traits: A Mendelian Randomization Analysis , 2013, BDJ.

[59]  S. Ebrahim,et al.  'Mendelian randomization': can genetic epidemiology contribute to understanding environmental determinants of disease? , 2003, International journal of epidemiology.

[60]  Eric Boerwinkle,et al.  Best Practices and Joint Calling of the HumanExome BeadChip: The CHARGE Consortium , 2013, PloS one.

[61]  Deanna M. Church,et al.  ClinVar: public archive of relationships among sequence variation and human phenotype , 2013, Nucleic Acids Res..

[62]  R. Durbin,et al.  Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses , 2012, Nature Protocols.

[63]  Christopher D. Brown,et al.  Identification and Validation of Genetic Variants that Influence Transcription Factor and Cell Signaling Protein Levels , 2014, American journal of human genetics.

[64]  Andrew D. Johnson,et al.  A Meta-analysis of Gene Expression Signatures of Blood Pressure and Hypertension , 2015, PLoS genetics.

[65]  W James Gauderman,et al.  Sample size requirements for matched case‐control studies of gene–environment interaction , 2002, Statistics in medicine.

[66]  W. Kannel,et al.  ABO blood group and cardiovacular disease: the Framingham study. , 1976, Atherosclerosis.