A genome-wide interactome of DNA-associated proteins in the human liver

Large-scale efforts like the Encyclopedia of DNA Elements (ENCODE) Project have made tremendous progress in cataloging the genomic binding patterns of DNA-associated proteins (DAPs), such as transcription factors (TFs). However most chromatin immunoprecipitation-sequencing (ChIP-seq) analyses have focused on a few immortalized cell lines whose activities and physiology deviate in important ways from endogenous cells and tissues. Consequently, binding data from primary human tissue are essential to improving our understanding of in vivo gene regulation. Here we analyze ChIP-seq data for 20 DAPs assayed in two healthy human liver tissue samples, identifying more than 450,000 binding sites. We integrated binding data with transcriptome and phased whole genome data to investigate allelic DAP interactions and the impact of heterozygous sequence variation on the expression of neighboring genes. We find our tissue-based dataset demonstrates binding patterns more consistent with liver biology than cell lines, and describe uses of these data to better prioritize impactful non-coding variation. Collectively, our rich dataset offers novel insights into genome function in healthy liver tissue and provides a valuable research resource for assessing disease-related disruptions.

[1]  R. Young,et al.  Super-Enhancers in the Control of Cell Identity and Disease , 2013, Cell.

[2]  Hani Z. Girgis,et al.  A High-Resolution Enhancer Atlas of the Developing Telencephalon , 2013, Cell.

[3]  Raymond K. Auerbach,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[4]  Wenjie Chen,et al.  GRASP v2.0: an update on the Genome-Wide Repository of Associations between SNPs and phenotypes , 2014, Nucleic Acids Res..

[5]  Tanya M. Teslovich,et al.  Biological, Clinical, and Population Relevance of 95 Loci for Blood Lipids , 2010, Nature.

[6]  A. Fujimoto,et al.  Whole-genome mutational landscape and characterization of noncoding and structural mutations in liver cancer. , 2017, Annals of oncology : official journal of the European Society for Medical Oncology.

[7]  Keith A. Boroevich,et al.  Whole-genome mutational landscape and characterization of noncoding and structural mutations in liver cancer , 2016, Nature Genetics.

[8]  E. Nagoshi,et al.  Dimerization of Sterol Regulatory Element-Binding Protein 2 via the Helix-Loop-Helix-Leucine Zipper Domain Is a Prerequisite for Its Nuclear Localization Mediated by Importin β , 2001, Molecular and Cellular Biology.

[9]  J. McMillan,et al.  GA-binding protein transcription factor: a review of GABP as an integrator of intracellular signaling and protein-protein interactions. , 2004, Blood cells, molecules & diseases.

[10]  Hunter B. Fraser,et al.  Pooled ChIP-Seq Links Variation in Transcription Factor Binding to Complex Disease Risk , 2016, Cell.

[11]  Richard M Myers,et al.  Transposase mediated construction of RNA-seq libraries. , 2012, Genome research.

[12]  R. Lucito,et al.  Reciprocal Interaction of Wnt and RXR-α Pathways in Hepatocyte Development and Hepatocellular Carcinoma , 2015, PloS one.

[13]  J. Carroll,et al.  Pioneer transcription factors: establishing competence for gene expression. , 2011, Genes & development.

[14]  Thomas R. Gingeras,et al.  STAR: ultrafast universal RNA-seq aligner , 2013, Bioinform..

[15]  Adam Ertel,et al.  Pathway-specific differences between tumor cell lines and normal and tumor tissue cells , 2006, Molecular Cancer.

[16]  Yun M. Zhao,et al.  KPNβ1 promotes palmitate-induced insulin resistance via NF-κB signaling in hepatocytes , 2015, Journal of Physiology and Biochemistry.

[17]  P. Ingham,et al.  Mechanisms and functions of Hedgehog signalling across the metazoa , 2011, Nature Reviews Genetics.

[18]  S. Robson,et al.  Nucleosome-Interacting Proteins Regulated by DNA and Histone Methylation , 2010, Cell.

[19]  S. Oliviero,et al.  Sp1 and Sp3 physically interact and co-operate with GABP for the activation of the utrophin promoter. , 2001, Journal of molecular biology.

[20]  Gabor T. Marth,et al.  A global reference for human genetic variation , 2015, Nature.

[21]  Michael Q. Zhang,et al.  Integrative analysis of 111 reference human epigenomes , 2015, Nature.

[22]  Allen D. Delaney,et al.  Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing , 2007, Nature Methods.

[23]  Marc D. Perry,et al.  ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia , 2012, Genome research.

[24]  A. Quinlan BEDTools: The Swiss‐Army Tool for Genome Feature Analysis , 2014, Current protocols in bioinformatics.

[25]  Brian S. Roberts,et al.  CETCh-seq: CRISPR epitope tagging ChIP-seq of DNA-binding proteins , 2015, Genome research.

[26]  Brian S. Roberts,et al.  Distinct gene regulatory programs define the inhibitory effects of liver X receptors and PPARG on cancer cell proliferation , 2016, Genome Medicine.

[27]  R Taub,et al.  Transcriptional up-regulation of the delayed early gene HRS/SRp40 during liver regeneration. Interactions among YY1, GA-binding proteins, and mitogenic signals. , 1998, The Journal of biological chemistry.

[28]  A. Visel,et al.  Rapid and Pervasive Changes in Genome-wide Enhancer Usage during Mammalian Development , 2013, Cell.

[29]  R. Tibshirani,et al.  Significance analysis of microarrays applied to the ionizing radiation response , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[30]  A. Dunning,et al.  Beyond GWASs: illuminating the dark road from association to function. , 2013, American journal of human genetics.

[31]  Thomas Whitington,et al.  Transcription Factor Binding in Human Cells Occurs in Dense Clusters Formed around Cohesin Anchor Sites , 2013, Cell.

[32]  Michael D. Wilson,et al.  Five-Vertebrate ChIP-seq Reveals the Evolutionary Dynamics of Transcription Factor Binding , 2010, Science.

[33]  E. Nora,et al.  CTCF and Cohesin in Genome Folding and Transcriptional Gene Regulation. , 2016, Annual review of genomics and human genetics.

[34]  M. Gerstein,et al.  Role of non-coding sequence variants in cancer , 2016, Nature Reviews Genetics.

[35]  J. Taipale,et al.  The role of enhancers in cancer , 2016, Nature Reviews Cancer.

[36]  Richard M Myers,et al.  Mapping genome-wide transcription factor binding sites in frozen tissues , 2013, Epigenetics & Chromatin.

[37]  Henning Hermjakob,et al.  The Reactome pathway knowledgebase , 2013, Nucleic Acids Res..

[38]  Richard M. Myers,et al.  aRNApipe: a balanced, efficient and distributed pipeline for processing RNA-seq data in high-performance computing environments , 2017, Bioinform..

[39]  A. Visel,et al.  ChIP-Seq identification of weakly conserved heart enhancers , 2010, Nature Genetics.

[40]  Gregory E. Crawford,et al.  The Hypersensitive Glucocorticoid Response Specifically Regulates Period 1 and Expression of Circadian Genes , 2012, Molecular and Cellular Biology.

[41]  R. Sandberg,et al.  Assessment of tumor characteristic gene expression in cell lines using a tissue similarity index (TSI). , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[42]  R. Sauer Protein-DNA interactions , 1991 .

[43]  L. Levin,et al.  Biodiversity on the Rocks: Macrofauna Inhabiting Authigenic Carbonate at Costa Rica Methane Seeps , 2015, PloS one.

[44]  Dongwon Lee,et al.  kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic data sets , 2013, Nucleic Acids Res..

[45]  Guy Karlebach,et al.  Modelling and analysis of gene regulatory networks , 2008, Nature Reviews Molecular Cell Biology.

[46]  Richard M Myers,et al.  Genistein and bisphenol A exposure cause estrogen receptor 1 to bind thousands of sites in a cell type-specific manner , 2012, Genome research.

[47]  T. Meehan,et al.  An atlas of active enhancers across human cell types and tissues , 2014, Nature.

[48]  E. Furlong,et al.  Transcription factors: from enhancer binding to developmental control , 2012, Nature Reviews Genetics.

[49]  Kyoung-Jae Won,et al.  Genetic Variation Determines PPARγ Function and Anti-diabetic Drug Response In Vivo , 2015, Cell.

[50]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[51]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[52]  Michael D. Wilson,et al.  Cooperativity and Rapid Evolution of Cobound Transcription Factors in Closely Related Mammals , 2013, Cell.

[53]  Dongwon Lee,et al.  LS-GKM: a new gkm-SVM for large-scale datasets , 2016, Bioinform..

[54]  S. Batzoglou,et al.  Distribution and intensity of constraint in mammalian genomic sequence. , 2005, Genome research.

[55]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[56]  Verena D. Schmittmann,et al.  Qgraph: Network visualizations of relationships in psychometric data , 2012 .

[57]  Martin S. Taylor,et al.  Mutational Biases Drive Elevated Rates of Substitution at Regulatory Sites across Cancer Types , 2016, PLoS genetics.

[58]  Gail Mandel,et al.  REST: A mammalian silencer protein that restricts sodium channel gene expression to neurons , 1995, Cell.

[59]  Jiang Qian,et al.  Transcription factors as readers and effectors of DNA methylation , 2016, Nature Reviews Genetics.

[60]  B. Deplancke,et al.  The Genetics of Transcription Factor DNA Binding Variation , 2016, Cell.

[61]  Eric Banks,et al.  Tools and best practices for data processing in allelic expression analysis , 2015, Genome Biology.

[62]  Morteza Mohammad Noori,et al.  Enhanced Regulatory Sequence Prediction Using Gapped k-mer Features , 2014, PLoS Comput. Biol..

[63]  Raja Jothi,et al.  GABP controls a critical transcription regulatory module that is essential for maintenance and differentiation of hematopoietic stem/progenitor cells. , 2011, Blood.

[64]  M. Rooman,et al.  Protein–DNA Interactions , 2015 .

[65]  Genevieve Konopka,et al.  HNF4A is essential for specification of hepatic progenitors from human pluripotent stem cells , 2011, Development.

[66]  Benjamin J. Strober,et al.  A method to predict the impact of regulatory variants from DNA sequence , 2015, Nature Genetics.

[67]  Timothy E. Reddy,et al.  Effects of sequence variation on differential allelic transcription factor occupancy and gene expression , 2012, Genome research.

[68]  Jun S. Liu,et al.  The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans , 2015, Science.

[69]  W. Huber,et al.  Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2 , 2014, Genome Biology.

[70]  Mikael Bodén,et al.  MEME Suite: tools for motif discovery and searching , 2009, Nucleic Acids Res..

[71]  Nam-Kyung Yu,et al.  CTCF as a multifunctional protein in genome regulation and gene expression , 2015, Experimental & Molecular Medicine.

[72]  C. Martinez-Jimenez,et al.  Hepatocyte Nuclear Factor 4 (cid:1) Coordinates a Transcription Factor Network Regulating Hepatic Fatty Acid Metabolism (cid:1) , 2010 .

[73]  David Z. Chen,et al.  Architecture of the human regulatory network derived from ENCODE data , 2012, Nature.