Cistrome Data Browser: expanded datasets and new tools for gene regulatory analysis

Abstract The Cistrome Data Browser (DB) is a resource of human and mouse cis-regulatory information derived from ChIP-seq, DNase-seq and ATAC-seq chromatin profiling assays, which map the genome-wide locations of transcription factor binding sites, histone post-translational modifications and regions of chromatin accessible to endonuclease activity. Currently, the Cistrome DB contains approximately 47,000 human and mouse samples with about 24,000 newly collected datasets compared to the previous release two years ago. Furthermore, the Cistrome DB has a new Toolkit module with several features that allow users to better utilize the large-scale ChIP-seq, DNase-seq, and ATAC-seq data. First, users can query the factors which are likely to regulate a specific gene of interest. Second, the Cistrome DB Toolkit facilitates searches for factor binding, histone modifications, and chromatin accessibility in any given genomic interval shorter than 2Mb. Third, the Toolkit can determine the most similar ChIP-seq, DNase-seq, and ATAC-seq samples in terms of genomic interval overlaps with user-provided genomic interval sets. The Cistrome DB is a user-friendly, up-to-date, and well maintained resource, and the new tools will greatly benefit the biomedical research community. The database is freely available at http://cistrome.org/db, and the Toolkit is at http://dbtoolkit.cistrome.org.

[1]  D. Pisano,et al.  A genetic interaction between RAP1 and telomerase reveals an unanticipated role for RAP1 in telomere maintenance , 2016, Aging cell.

[2]  A. Regev,et al.  The transcription factor BATF operates as an essential differentiation checkpoint in early effector CD8+ T cells , 2014, Nature Immunology.

[3]  R. Mann,et al.  Disentangling the many layers of eukaryotic transcriptional regulation. , 2012, Annual review of genetics.

[4]  Henry W. Long,et al.  A Somatically Acquired Enhancer of the Androgen Receptor Is a Noncoding Driver in Advanced Prostate Cancer , 2018, Cell.

[5]  Hongyu Zhao,et al.  Metabolic Regulation of Gene Expression by Histone Lysine β‐hydroxybutyrylation , 2016, Molecular cell.

[6]  Ting Wang,et al.  Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser , 2013, Bioinform..

[7]  Jie Zhang,et al.  Practical Guidelines for the Comprehensive Analysis of ChIP-seq Data , 2013, PLoS Comput. Biol..

[8]  Shenglin Mei,et al.  Modeling cis-regulation with a compendium of genome-wide histone H3K27ac profiles , 2016, Genome research.

[9]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[10]  Tao Liu,et al.  ChiLin: a comprehensive ChIP-seq and DNase-seq quality control and analysis pipeline , 2016, BMC Bioinformatics.

[11]  Clifford A. Meyer,et al.  Cistrome: an integrative platform for transcriptional regulation studies , 2011, Genome Biology.

[12]  Ting Wang,et al.  Using the Wash U Epigenome Browser to Examine Genome‐Wide Sequencing Data , 2012, Current protocols in bioinformatics.

[13]  Florian Hahne,et al.  Visualizing Genomic Data Using Gviz and Bioconductor , 2016, Statistical Genomics.

[14]  Myles Brown,et al.  BINOCh: binding inference from nucleosome occupancy changes , 2011, Bioinform..

[15]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[16]  Tao Liu,et al.  Cistrome Data Browser: a data portal for ChIP-Seq and chromatin accessibility data in human and mouse , 2016, Nucleic Acids Res..

[17]  Lisa Helbling Chadwick,et al.  The NIH Roadmap Epigenomics Program data resource. , 2012, Epigenomics.

[18]  P. Park,et al.  Design and analysis of ChIP-seq experiments for DNA-binding proteins , 2008, Nature Biotechnology.

[19]  Marc D. Perry,et al.  ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia , 2012, Genome research.

[20]  Qian Wang,et al.  A comprehensive view of nuclear receptor cancer cistromes. , 2011, Cancer research.

[21]  R. Young,et al.  Histone H3K27ac separates active from poised enhancers and predicts developmental state , 2010, Proceedings of the National Academy of Sciences.

[22]  Neva C. Durand,et al.  The Energetics and Physiological Impact of Cohesin Extrusion , 2018, Cell.

[23]  Ariel S. Schwartz,et al.  An Atlas of Combinatorial Transcriptional Regulation in Mouse and Man , 2010, Cell.

[24]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[25]  Howard Y. Chang,et al.  ATAC‐seq: A Method for Assaying Chromatin Accessibility Genome‐Wide , 2015, Current protocols in molecular biology.

[26]  Hanfei Sun,et al.  Target analysis by integration of transcriptome and ChIP-seq data with BETA , 2013, Nature Protocols.

[27]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[28]  Brent S. Pedersen,et al.  GIGGLE: a search engine for large-scale integrated genome analysis , 2017, Nature Methods.

[29]  Andrew Emili,et al.  Multiparameter functional diversity of human C2H2 zinc finger proteins , 2016, Genome research.

[30]  Julia A. Lasserre,et al.  Histone modification levels are predictive for gene expression , 2010, Proceedings of the National Academy of Sciences.

[31]  Benoît Ballester,et al.  ReMap 2018: an updated atlas of regulatory regions from an integrative analysis of DNA-binding ChIP-seq experiments , 2017, Nucleic Acids Res..

[32]  W. Shi,et al.  Transcription Factor IRF4 Promotes CD8+ T Cell Exhaustion and Limits the Development of Memory‐like T Cells during Chronic Infection , 2017, Immunity.

[33]  S. Barrans,et al.  SPIB and BATF provide alternate determinants of IRF4 occupancy in diffuse large B-cell lymphoma linked to disease heterogeneity , 2014, Nucleic acids research.

[34]  Clifford A. Meyer,et al.  Cistrome Cancer: A Web Resource for Integrative Gene Regulation Modeling in Cancer. , 2017, Cancer research.

[35]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[36]  T. Hughes,et al.  The Human Transcription Factors , 2018, Cell.

[37]  Helga Thorvaldsdóttir,et al.  Integrative Genomics Viewer , 2011, Nature Biotechnology.

[38]  G. Crawford,et al.  DNase-seq: a high-resolution technique for mapping active gene regulatory elements across the genome from mammalian cells. , 2010, Cold Spring Harbor protocols.

[39]  Ting Wang,et al.  The 3D Genome Browser: a web-based browser for visualizing 3D genome organization and long-range chromatin interactions , 2017, Genome Biology.

[40]  Hui Zhou,et al.  ChIPBase v2.0: decoding transcriptional regulatory networks of non-coding RNAs and protein-coding genes from ChIP-seq data , 2016, Nucleic Acids Res..

[41]  T. Mikkelsen,et al.  Genome-wide maps of chromatin state in pluripotent and lineage-committed cells , 2007, Nature.

[42]  Nathaniel D. Heintzman,et al.  Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome , 2007, Nature Genetics.

[43]  O. Jänne,et al.  SUMO ligase PIAS1 functions as a target gene selective androgen receptor coregulator on prostate cancer cell chromatin , 2014, Nucleic acids research.

[44]  F. Buchholz,et al.  ZBTB48 is both a vertebrate telomere‐binding protein and a transcriptional activator , 2017, EMBO reports.

[45]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[46]  S. Hilsenbeck,et al.  GATA2 facilitates steroid receptor coactivator recruitment to the androgen receptor complex , 2014, Proceedings of the National Academy of Sciences.

[47]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[48]  The androgen receptor (AR) amino-terminus imposes androgen-specific regulation of AR gene expression via an exonic enhancer. , 2001, Endocrinology.

[49]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[50]  J. Michael Cherry,et al.  The Encyclopedia of DNA elements (ENCODE): data portal update , 2017, Nucleic Acids Res..