Analysis of putative cis-regulatory elements regulating blood pressure variation

Hundreds of loci have been associated with blood pressure traits from many genome-wide association studies. We identified an enrichment of these loci in aorta and tibial artery expression quantitative trait loci in our previous work in ∼100,000 Genetic Epidemiology Research on Aging (GERA) study participants. In the present study, we subsequently focused on determining putative regulatory regions for these and other tissues of relevance to blood pressure, to both fine-map these loci by pinpointing genes and variants of functional interest within them, and to identify any novel genes. We constructed maps of putative cis-regulatory elements using publicly available open chromatin data for the heart, aorta and tibial arteries, and multiple kidney cell types. Sequence variants within these regions may be evaluated quantitatively for their tissue- or cell-type-specific regulatory impact using deltaSVM functional scores, as described in our previous work. In order to identify genes of interest, we aggregate these variants in these putative cis-regulatory elements within 50Kb of the start or end of genes considered as “expressed” in these tissues or cell types using publicly available gene expression data, and use the deltaSVM scores as weights in the well-known group-wise sequence kernel association test (SKAT). We test for association with both blood pressure traits as well as expression within these tissues or cell types of interest, and identify several genes, including MTHFR, C10orf32, CSK, NOV, ULK4, SDCCAG8, SCAMP5, RPP25, HDGFRP3, VPS37B, and PPCDC. Although our study centers on blood pressure traits, we additionally examined two known genes, SCN5A and NOS1AP involved in the cardiac trait QT interval, in the Atherosclerosis Risk in Communities Study (ARIC), as a positive control, and observed an expected heart-specific effect. Thus, our method may be used to identify variants and genes for further functional testing using tissue- or cell-type-specific putative regulatory information. Author Summary Sequence change in genes (“variants”) are linked to the presence and severity of different traits or diseases. However, as genes may be expressed in different tissues and at different times and degrees, using this information is expected to more accurately identify genes of interest. Variants within the genes are essential, but also in the sequences (“regulatory elements”) that control the genes’ expression in different tissues or cell types. In this study, we aim to use this information about expression and variants potentially involved in gene expression regulation to better pinpoint genes and variants in regulatory elements of interest for blood pressure regulation. We do so by taking advantage of such data that are publicly available, and use methods to combine information about variants in aggregate within a gene’s putative regulatory elements in tissues thought to be relevant for blood pressure, and identify several genes, meant to enable experimental follow-up.

[1]  U. Alon An introduction to systems biology : design principles of biological circuits , 2019 .

[2]  Christian Gieger,et al.  Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits , 2018, Nature Genetics.

[3]  A. Chakravarti,et al.  Human cardiac cis-regulatory elements, their cognate transcription factors, and regulatory DNA sequence variants , 2018, Genome research.

[4]  S. Ahn,et al.  Genetic Programming of Hypertension , 2018, Front. Pediatr..

[5]  Nicola J. Rinaldi,et al.  Genetic effects on gene expression across human tissues , 2017, Nature.

[6]  Mary Goldman,et al.  Exploring the phenotypic consequences of tissue specific gene expression variation inferred from GWAS summary statistics , 2016, Nature Communications.

[7]  Yang I Li,et al.  An Expanded View of Complex Traits: From Polygenic to Omnigenic , 2017, Cell.

[8]  J. Pankow,et al.  Genome-wide association study of 1,5-anhydroglucitol identifies novel genetic loci linked to glucose metabolism , 2017, Scientific Reports.

[9]  C. Xiao,et al.  Association of CYP17A1 Genetic Polymorphisms and Susceptibility to Essential Hypertension in the Southwest Han Chinese Population , 2017, Medical science monitor : international medical journal of experimental and clinical research.

[10]  Stephanie A. Bien,et al.  Fine mapping of QT interval regions in global populations refines previously identified QT interval loci and identifies signals unique to African and Hispanic descent populations. , 2017, Heart rhythm.

[11]  Y. Teo,et al.  Genome-Wide Association Study Meta-Analysis of Long-Term Average Blood Pressure in East Asians , 2017, Circulation. Cardiovascular genetics.

[12]  N. Risch,et al.  Genome-wide association analyses using electronic health records identify new loci influencing blood pressure variation , 2016, Nature Genetics.

[13]  GTEx consortium Genetic effects on gene expression across human tissues , 2017 .

[14]  E. Boerwinkle,et al.  Genetic Variants in LRP1 and ULK4 Are Associated with Acute Aortic Dissections. , 2016, American journal of human genetics.

[15]  Dongwon Lee,et al.  LS-GKM: a new gkm-SVM for large-scale datasets , 2016, Bioinform..

[16]  E. Boerwinkle,et al.  Rare coding TTN variants are associated with electrocardiographic QT interval in the general population , 2016, Scientific Reports.

[17]  B. Oh,et al.  Gene Silencing and Haploinsufficiency of Csk Increase Blood Pressure , 2016, PloS one.

[18]  Eric Haugen,et al.  Large-scale identification of sequence variants impacting human transcription factor occupancy in vivo , 2015, Nature Genetics.

[19]  Yakir A Reshef,et al.  Partitioning heritability by functional annotation using genome-wide association summary statistics , 2015, Nature Genetics.

[20]  E. Boerwinkle,et al.  Direct Estimates of the Genomic Contributions to Blood Pressure Heritability within a Population-Based Cohort (ARIC) , 2015, PloS one.

[21]  Kaanan P. Shah,et al.  A gene-based association method for mapping traits using reference transcriptome data , 2015, Nature Genetics.

[22]  Teresa A. Webster,et al.  Genotyping Informatics and Quality Control for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort , 2015, Genetics.

[23]  C. Sabatti,et al.  Characterizing Race/Ethnicity and Genetic Ancestry for 100,000 Subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) Cohort , 2015, Genetics.

[24]  J. Pankow,et al.  Genetics of Plasma Soluble Receptor for Advanced Glycation End-Products and Cardiovascular Outcomes in a Community-based Population: Results from the Atherosclerosis Risk in Communities Study , 2015, PloS one.

[25]  Benjamin J. Strober,et al.  A method to predict the impact of regulatory variants from DNA sequence , 2015, Nature Genetics.

[26]  P. Elliott,et al.  UK Biobank: An Open Access Resource for Identifying the Causes of a Wide Range of Complex Diseases of Middle and Old Age , 2015, PLoS medicine.

[27]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[28]  T. Paus,et al.  CYP17A1 and Blood Pressure Reactivity to Stress in Adolescence , 2015, International journal of hypertension.

[29]  Neva C. Durand,et al.  A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping , 2014, Cell.

[30]  S. Bekker-Jensen,et al.  Renal-retinal ciliopathy gene Sdccag8 regulates DNA damage response signaling. , 2014, Journal of the American Society of Nephrology : JASN.

[31]  A. Wilde,et al.  The Measurement of the QT Interval , 2014, Current cardiology reviews.

[32]  Morteza Mohammad Noori,et al.  Enhanced Regulatory Sequence Prediction Using Gapped k-mer Features , 2014, PLoS Comput. Biol..

[33]  Michael J Ackerman,et al.  Nature Genetics Advance Online Publication Genetic Association Study of Qt Interval Highlights Role for Calcium Signaling Pathways in Myocardial Repolarization , 2022 .

[34]  Jennifer E. Phillips-Cremins Unraveling architecture of the pluripotent genome. , 2014, Current opinion in cell biology.

[35]  Howard Y. Chang,et al.  Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position , 2013, Nature Methods.

[36]  Tom R. Gaunt,et al.  Loci influencing blood pressure identified using a cardiovascular gene-centric array. , 2013, Human molecular genetics.

[37]  E. Boerwinkle,et al.  Effects of Rare and Common Blood Pressure Gene Variants on Essential Hypertension: Results From the Family Blood Pressure Program, CLUE, and Atherosclerosis Risk in Communities Studies , 2013, Circulation research.

[38]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[39]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[40]  Jesse R. Dixon,et al.  Topological Domains in Mammalian Genomes Identified by Analysis of Chromatin Interactions , 2012, Nature.

[41]  W. Niu,et al.  Strong association of methylenetetrahydrofolate reductase gene C677T polymorphism with hypertension and hypertension-in-pregnancy in Chinese: a meta-analysis , 2012, Journal of Human Hypertension.

[42]  Pui-Yan Kwok,et al.  Design and coverage of high throughput genotyping arrays optimized for individuals of East Asian, African American, and Latino race/ethnicity using imputation and a novel hybrid SNP selection algorithm. , 2011, Genomics.

[43]  Jean Tichet,et al.  Blood pressure loci identified with a gene-centric array. , 2011, American journal of human genetics.

[44]  Christian Gieger,et al.  Genome-wide association study identifies six new loci influencing pulse pressure and mean arterial pressure , 2011, Nature Genetics.

[45]  Christian Gieger,et al.  Genetic Variants in Novel Pathways Influence Blood Pressure and Cardiovascular Disease Risk , 2011, Nature.

[46]  Simon Cawley,et al.  Next generation genome-wide association tool: design and coverage of a high-throughput European-optimized SNP array. , 2011, Genomics.

[47]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[48]  S. Mane,et al.  K+ Channel Mutations in Adrenal Aldosterone-Producing Adenomas and Hereditary Hypertension , 2011, Science.

[49]  E. Davidson Emerging properties of animal gene regulatory networks , 2010, Nature.

[50]  Josyf Mychaleckyj,et al.  Robust relationship inference in genome-wide association studies , 2010, Bioinform..

[51]  R. Bellazzi,et al.  Polymorphisms in the NOS1AP gene modulate QT interval duration and risk of arrhythmias in the long QT syndrome. , 2010, Journal of the American College of Cardiology.

[52]  Xia Yang,et al.  Integrating pathway analysis and genetics of gene expression for genome-wide association studies. , 2010, American journal of human genetics.

[53]  H. Kawamura,et al.  CCN3 Inhibits Neointimal Hyperplasia Through Modulation of Smooth Muscle Cell Growth and Migration , 2010, Arteriosclerosis, thrombosis, and vascular biology.

[54]  Joseph K. Pickrell,et al.  Understanding mechanisms underlying human gene expression variation with RNA sequencing , 2010, Nature.

[55]  P. O’Reilly,et al.  Genome-wide association study identifies eight loci associated with blood pressure , 2009, Nature Genetics.

[56]  Andrew D. Johnson,et al.  Genome-wide association study of blood pressure and hypertension , 2009, Nature Genetics.

[57]  P. O’Reilly,et al.  Genome-wide association study identifies eight loci associated with blood pressure , 2009, Nature Genetics.

[58]  Brad T. Sherman,et al.  Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists , 2008, Nucleic acids research.

[59]  H. Stefánsson,et al.  Genetics of gene expression and its effect on disease , 2008, Nature.

[60]  E. Segal,et al.  Predicting expression patterns from regulatory sequence in Drosophila segmentation , 2008, Nature.

[61]  Brad T. Sherman,et al.  Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources , 2008, Nature Protocols.

[62]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[63]  Christian Gieger,et al.  A common genetic variant in the NOS1 regulator NOS1AP modulates cardiac repolarization , 2006, Nature Genetics.

[64]  Marc W Kirschner,et al.  The Meaning of Systems Biology , 2005, Cell.

[65]  L. Lau,et al.  Integrin-dependent Functions of the Angiogenic Inducer NOV (CCN3) , 2005, Journal of Biological Chemistry.

[66]  D. Levy,et al.  QT interval is a heritable quantitative trait with evidence of linkage to chromosome 3 in a genome-wide linkage analysis: The Framingham Heart Study. , 2005, Heart rhythm.

[67]  J. Ioannidis,et al.  Association of C677T polymorphism in the methylenetetrahydrofolate reductase gene with hypertension in pregnancy and pre-eclampsia: a meta-analysis , 2004, Journal of hypertension.

[68]  A. Folsom,et al.  Heart rate-corrected QT interval prolongation predicts risk of coronary heart disease in black and white middle-aged men and women: the ARIC study. , 2004, Journal of the American College of Cardiology.

[69]  L. Lau,et al.  CCN3 (NOV) Is a Novel Angiogenic Regulator of the CCN Protein Family* , 2003, Journal of Biological Chemistry.

[70]  Ali G. Gharavi,et al.  Molecular Mechanisms of Human Hypertension , 2001, Cell.

[71]  H. Knoblauch,et al.  QT interval is linked to 2 long-QT syndrome loci in normal subjects. , 1999, Circulation.

[72]  J. Higaki,et al.  Methylenetetrahydrofolate reductase gene polymorphism: relation to blood pressure and cerebrovascular disease. , 1998, American journal of hypertension.

[73]  R. Lifton Molecular Genetics of Human Blood Pressure Variation , 1996, Science.

[74]  R. Matthews,et al.  A candidate genetic risk factor for vascular disease: a common mutation in methylenetetrahydrofolate reductase , 1995, Nature Genetics.

[75]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[76]  Aric Invest The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators , 1989 .

[77]  A. Folsom,et al.  The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. The ARIC investigators. , 1989, American journal of epidemiology.

[78]  D. Solomon,et al.  Hypogonadism and mineralocorticoid excess. The 17-hydroxylase deficiency syndrome. , 1967, The New England journal of medicine.