C-reactive protein upregulates the whole blood expression of CD59 - an integrative analysis

Elevated C-reactive protein (CRP) concentrations in the blood are associated with acute and chronic infections and inflammation. Nevertheless, the functional role of increased CRP in multiple bacterial and viral infections as well as in chronic inflammatory diseases remains unclear. Here, we studied the relationship between CRP and gene expression levels in the blood in 491 individuals from the Estonian Biobank cohort, to elucidate the role of CRP in these inflammatory mechanisms. As a result, we identified a set of 1,614 genes associated with changes in CRP levels with a high proportion of interferon-stimulated genes. Further, we performed likelihood-based causality model selection and Mendelian randomization analysis to discover causal links between CRP and the expression of CRP-associated genes. Strikingly, our computational analysis and cell culture stimulation assays revealed increased CRP levels to drive the expression of complement regulatory protein CD59, suggesting CRP to have a critical role in protecting blood cells from the adverse effects of the immune defence system. Our results show the benefit of integrative analysis approaches in hypothesis-free uncovering of causal relationships between traits.

[1]  J. Castle,et al.  An integrative genomics approach to infer causal associations between gene expression and disease , 2005, Nature Genetics.

[2]  Gaurav Bhatia,et al.  Fast and accurate imputation of summary statistics enhances evidence of functional enrichment , 2013, Bioinform..

[3]  Tom Fiers,et al.  Agglutination of intravenously administered phosphatidylcholine-containing lipid emulsions with serum C-reactive protein. , 2013, Nutrition in clinical practice : official publication of the American Society for Parenteral and Enteral Nutrition.

[4]  Thomas Illig,et al.  FADS genotypes and desaturase activity estimated by the ratio of arachidonic acid to linoleic acid are associated with inflammation and coronary artery disease. , 2008, The American journal of clinical nutrition.

[5]  M. Peters,et al.  Systematic identification of trans eQTLs as putative drivers of known disease associations , 2013, Nature Genetics.

[6]  Matthew E. Ritchie,et al.  limma powers differential expression analyses for RNA-sequencing and microarray studies , 2015, Nucleic acids research.

[7]  Richard D. Weisel,et al.  C-Reactive Protein Upregulates Complement-Inhibitory Factors in Endothelial Cells , 2004, Circulation.

[8]  Ritsert C. Jansen,et al.  MixupMapper: correcting sample mix-ups in genome-wide datasets increases power to detect small genetic effects , 2011, Bioinform..

[9]  C. Rice,et al.  Interferon-stimulated genes: a complex web of host defenses. , 2014, Annual review of immunology.

[10]  James T. Elder,et al.  Investigating the Causal Relationship of C-Reactive Protein with 32 Complex Somatic and Psychiatric Outcomes: A Large-Scale Cross-Consortium Mendelian Randomization Study , 2016, PLoS medicine.

[11]  Anders Albrechtsen,et al.  Greenlandic Inuit show genetic signatures of diet and climate adaptation , 2015, Science.

[12]  Eric S. Lander,et al.  Comprehensive population-based genome sequencing provides insight into hematopoietic regulatory mechanisms , 2016, Proceedings of the National Academy of Sciences.

[13]  Jaak Vilo,et al.  g:Profiler—a web server for functional interpretation of gene lists (2011 update) , 2011, Nucleic Acids Res..

[14]  Sue Povey,et al.  Nephrocalcinosis (Enamel Renal Syndrome) Caused by Autosomal Recessive FAM20A Mutations , 2013, Nephron Physiology.

[15]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[16]  P. Visscher,et al.  Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets , 2016, Nature Genetics.

[17]  David R. Anderson,et al.  Multimodel Inference , 2004 .

[18]  Hongyu Wu,et al.  Genetic loci associated with circulating phospholipid trans fatty acids: a meta-analysis of genome-wide association studies from the CHARGE Consortium. , 2015, The American journal of clinical nutrition.

[19]  Eric E Schadt,et al.  Disentangling molecular relationships with a causal inference test , 2009, BMC Genetics.

[20]  Steve Horvath,et al.  Using genetic markers to orient the edges in quantitative trait networks: The NEO software , 2008, BMC Systems Biology.

[21]  Tanya M. Teslovich,et al.  Common variants associated with plasma triglycerides and risk for coronary artery disease , 2013, Nature Genetics.

[22]  Peggy Hall,et al.  The NHGRI GWAS Catalog, a curated resource of SNP-trait associations , 2013, Nucleic Acids Res..

[23]  B. Yandell,et al.  Inferring Causal Phenotype Networks From Segregating Populations , 2008, Genetics.

[24]  A. Butterworth,et al.  Mendelian Randomization Analysis With Multiple Genetic Variants Using Summarized Data , 2013, Genetic epidemiology.

[25]  Dylan S. Small,et al.  A review of instrumental variable estimators for Mendelian randomization , 2015, Statistical methods in medical research.

[26]  Amit R. Indap,et al.  Genes mirror geography within Europe , 2008, Nature.

[27]  A. G. de la Fuente,et al.  Gene Network Inference via Structural Equation Modeling in Genetical Genomics Experiments , 2008, Genetics.

[28]  Emmanouil T. Dermitzakis,et al.  Fast and efficient QTL mapper for thousands of molecular phenotypes , 2015, bioRxiv.

[29]  M. Fornage,et al.  Genetic Loci Associated with Plasma Phospholipid n-3 Fatty Acids: A Meta-Analysis of Genome-Wide Association Studies from the CHARGE Consortium , 2011, PLoS genetics.

[30]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[31]  Björn Usadel,et al.  Trimmomatic: a flexible trimmer for Illumina sequence data , 2014, Bioinform..

[32]  Irving Kushner,et al.  C-reactive Protein* , 2004, Journal of Biological Chemistry.

[33]  Gongxiong Wu,et al.  Complement Regulator CD59 Protects Against Atherosclerosis by Restricting the Formation of Complement Membrane Attack Complex , 2009, Circulation research.

[34]  John D. Storey,et al.  Cause and express , 2009, Nature Biotechnology.

[35]  P. Visscher,et al.  Conditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traits , 2012, Nature Genetics.

[36]  Jacek Majewski,et al.  The study of eQTL variations by RNA-seq: from SNPs to phenotypes. , 2011, Trends in genetics : TIG.

[37]  P. Elliott,et al.  Meta-Analysis of Genome-Wide Association Studies in >80 000 Subjects Identifies Multiple Loci for C-Reactive Protein Levels , 2011, Circulation.

[38]  N Rifai,et al.  Novel risk factors for systemic atherosclerosis: a comparison of C-reactive protein, fibrinogen, homocysteine, lipoprotein(a), and standard cholesterol screening as predictors of peripheral arterial disease. , 2001, JAMA.

[39]  Jeffrey C Murray,et al.  Whole-Exome sequencing identifies FAM20A mutations as a cause of amelogenesis imperfecta and gingival hyperplasia syndrome. , 2011, American journal of human genetics.

[40]  D L Simmons,et al.  CD59, an LY-6-like protein expressed in human lymphoid cells, regulates the action of the complement membrane attack complex on homologous cells , 1989, The Journal of experimental medicine.

[41]  M. Robinson,et al.  A scaling normalization method for differential expression analysis of RNA-seq data , 2010, Genome Biology.

[42]  T. Lehtimäki,et al.  Integrative approaches for large-scale transcriptome-wide association studies , 2015, Nature Genetics.

[43]  R. Mägi,et al.  Cohort Profile Cohort Profile : Estonian Biobank of the Estonian Genome Center , University of Tartu , 2015 .

[44]  Hedi Peterson,et al.  g:Profiler—a web server for functional interpretation of gene lists (2016 update) , 2016, Nucleic Acids Res..

[45]  Martin J. Aryee,et al.  Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in Rheumatoid Arthritis , 2013, Nature Biotechnology.

[46]  P. Visscher,et al.  Calculating statistical power in Mendelian randomization studies. , 2013, International journal of epidemiology.

[47]  J. Manson,et al.  C-reactive protein, interleukin 6, and risk of developing type 2 diabetes mellitus. , 2001, JAMA.

[48]  Mark D. Robinson,et al.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data , 2009, Bioinform..

[49]  G. Davey Smith,et al.  Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression , 2015, International journal of epidemiology.

[50]  J. Witztum,et al.  C-reactive protein binds to both oxidized LDL and apoptotic cells through recognition of a common ligand: Phosphorylcholine of oxidized phospholipids , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[51]  Christian Gieger,et al.  Loci influencing lipid levels and coronary heart disease risk in 16 European population cohorts , 2009, Nature Genetics.

[52]  Seoae Cho,et al.  RNA-seq analysis for detecting quantitative trait-associated genes , 2016, Scientific Reports.

[53]  angesichts der Corona-Pandemie,et al.  UPDATE , 1973, The Lancet.

[54]  Carson C Chow,et al.  Second-generation PLINK: rising to the challenge of larger and richer datasets , 2014, GigaScience.

[55]  Yair Anikster,et al.  CD59 deficiency is associated with chronic hemolysis and childhood relapsing immune-mediated polyneuropathy. , 2013, Blood.

[56]  Elias Chaibub Neto,et al.  Modeling Causality for Pairs of Phenotypes in System Genetics , 2013, Genetics.

[57]  Lale Tokgözoğlu,et al.  Low-density lipoproteins cause atherosclerotic cardiovascular disease. 1. Evidence from genetic, epidemiologic, and clinical studies. A consensus statement from the European Atherosclerosis Society Consensus Panel , 2017, European heart journal.

[58]  Paul M. Ridker,et al.  Prospective Study of C-Reactive Protein, Homocysteine, and Plasma Lipid Levels as Predictors of Sudden Cardiac Death , 2002, Circulation.

[59]  S. Meri,et al.  Distribution of protectin (CD59), a complement membrane attack inhibitor, in normal human tissues. , 1991, Laboratory investigation; a journal of technical methods and pathology.

[60]  Leopold Parts,et al.  A Bayesian Framework to Account for Complex Non-Genetic Factors in Gene Expression Levels Greatly Increases Power in eQTL Studies , 2010, PLoS Comput. Biol..

[61]  Simon Yu,et al.  INTERFEROME v2.0: an updated database of annotated interferon-regulated genes , 2012, Nucleic Acids Res..

[62]  Charity W. Law,et al.  voom: precision weights unlock linear model analysis tools for RNA-seq read counts , 2014, Genome Biology.

[63]  T. Frayling,et al.  C-reactive protein levels and body mass index: Elucidating direction of causation through reciprocal Mendelian randomization , 2010, International Journal of Obesity.

[64]  M. Ritchie,et al.  Methods of integrating data to uncover genotype–phenotype interactions , 2015, Nature Reviews Genetics.

[65]  John D. Storey,et al.  Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis , 2007, PLoS genetics.

[66]  Carmen W van den Berg,et al.  Letter regarding article by Li et al, "C-reactive protein upregulates complement-inhibitory factors in endothelial cells.". , 2004, Circulation.

[67]  Ayellet V. Segrè,et al.  Integrative Genomics Reveals Novel Molecular Pathways and Gene Networks for Coronary Artery Disease , 2014, PLoS genetics.

[68]  Alexey I. Nesvizhskii,et al.  Reconstructing targetable pathways in lung cancer by integrating diverse omics data , 2013, Nature Communications.

[69]  Keith Shockley,et al.  Structural Model Analysis of Multiple Quantitative Traits , 2006, PLoS genetics.

[70]  B. Yandell,et al.  CAUSAL GRAPHICAL MODELS IN SYSTEMS GENETICS: A UNIFIED FRAMEWORK FOR JOINT INFERENCE OF CAUSAL NETWORK AND GENETIC ARCHITECTURE FOR CORRELATED PHENOTYPES. , 2010, The annals of applied statistics.