Cistrome Data Browser: a data portal for ChIP-Seq and chromatin accessibility data in human and mouse

Chromatin immunoprecipitation, DNase I hypersensitivity and transposase-accessibility assays combined with high-throughput sequencing enable the genome-wide study of chromatin dynamics, transcription factor binding and gene regulation. Although rapidly accumulating publicly available ChIP-seq, DNase-seq and ATAC-seq data are a valuable resource for the systematic investigation of gene regulation processes, a lack of standardized curation, quality control and analysis procedures have hindered extensive reuse of these data. To overcome this challenge, we built the Cistrome database, a collection of ChIP-seq and chromatin accessibility data (DNase-seq and ATAC-seq) published before January 1, 2016, including 13 366 human and 9953 mouse samples. All the data have been carefully curated and processed with a streamlined analysis pipeline and evaluated with comprehensive quality control metrics. We have also created a user-friendly web server for data query, exploration and visualization. The resulting Cistrome DB (Cistrome Data Browser), available online at http://cistrome.org/db, is expected to become a valuable resource for transcriptional and epigenetic regulation studies.

[1]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[2]  Yan Cui,et al.  CTCFBSDB 2.0: a database for CTCF-binding sites and genome organization , 2012, Nucleic Acids Res..

[3]  Brian T. Lee,et al.  The UCSC Genome Browser database: 2015 update , 2014, Nucleic Acids Res..

[4]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[5]  Ting Wang,et al.  Exploring long-range genome interactions using the WashU Epigenome Browser , 2013, Nature Methods.

[6]  Marc D. Perry,et al.  ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia , 2012, Genome research.

[7]  T. Furey ChIP – seq and beyond : new and improved methodologies to detect and characterize protein – DNA interactions , 2012 .

[8]  Marco-Antonio Mendoza-Parra,et al.  A quality control system for profiles obtained by ChIP sequencing , 2013, Nucleic acids research.

[9]  Sean R. Davis,et al.  NCBI GEO: archive for functional genomics data sets—update , 2012, Nucleic Acids Res..

[10]  Tao Liu,et al.  ChiLin: a comprehensive ChIP-seq and DNase-seq quality control and analysis pipeline , 2016, BMC Bioinformatics.

[11]  Michael D. Wilson,et al.  ChIP-seq: using high-throughput sequencing to discover protein-DNA interactions. , 2009, Methods.

[12]  Alex E. Lash,et al.  Gene Expression Omnibus: NCBI gene expression and hybridization array data repository , 2002, Nucleic Acids Res..

[13]  Dominik Beck,et al.  BloodChIP: a database of comparative genome-wide transcription factor binding profiles in human blood cells , 2013, Nucleic Acids Res..

[14]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[15]  Jing Liu,et al.  CR Cistrome: a ChIP-Seq database for chromatin regulators and histone modification linkages in human and mouse , 2013, Nucleic Acids Res..

[16]  B. L,et al.  The accessible chromatin landscape of the human genome , 2016 .

[17]  Nadav S. Bar,et al.  Landscape of transcription in human cells , 2012, Nature.

[18]  G. Crawford,et al.  DNase-seq: a high-resolution technique for mapping active gene regulatory elements across the genome from mammalian cells. , 2010, Cold Spring Harbor protocols.

[19]  Hanfei Sun,et al.  Target analysis by integration of transcriptome and ChIP-seq data with BETA , 2013, Nature Protocols.

[20]  P. Park ChIP–seq: advantages and challenges of a maturing technology , 2009, Nature Reviews Genetics.

[21]  Guy Cochrane,et al.  Biocuration of functional annotation at the European nucleotide archive , 2015, Nucleic Acids Res..

[22]  Clifford A. Meyer,et al.  Cistrome: an integrative platform for transcriptional regulation studies , 2011, Genome Biology.

[23]  David Z. Chen,et al.  Architecture of the human regulatory network derived from ENCODE data , 2012, Nature.

[24]  T. Mikkelsen,et al.  The NIH Roadmap Epigenomics Mapping Consortium , 2010, Nature Biotechnology.

[25]  Clifford A. Meyer,et al.  Identifying and mitigating bias in next-generation sequencing methods for chromatin biology , 2014, Nature Reviews Genetics.

[26]  R. Young,et al.  Transcriptional Regulation and Its Misregulation in Disease , 2013, Cell.

[27]  Howard Y. Chang,et al.  Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position , 2013, Nature Methods.

[28]  Nathan C. Sheffield,et al.  The accessible chromatin landscape of the human genome , 2012, Nature.

[29]  Nathaniel D. Heintzman,et al.  Histone modifications at human enhancers reflect global cell-type-specific gene expression , 2009, Nature.