Genome-Wide Analysis of Transcription Factor Binding Sites Based on ChIP-Seq Data

Molecular interactions between protein complexes and DNA mediate essential gene-regulatory functions. Uncovering such interactions by chromatin immunoprecipitation coupled with massively parallel sequencing (ChIP-Seq) has recently become the focus of intense interest. We here introduce quantitative enrichment of sequence tags (QuEST), a powerful statistical framework based on the kernel density estimation approach, which uses ChIP-Seq data to determine positions where protein complexes contact DNA. Using QuEST, we discovered several thousand binding sites for the human transcription factors SRF, GABP and NRSF at an average resolution of about 20 base pairs. MEME motif-discovery tool–based analyses of the QuEST-identified sequences revealed DNA binding by cofactors of SRF, providing evidence that cofactor binding specificity can be obtained from ChIP-Seq data. By combining QuEST analyses with Gene Ontology (GO) annotations and expression data, we illustrate how general functions of transcription factors can be inferred.

[1]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[2]  Charles Elkan,et al.  Fitting a Mixture Model By Expectation Maximization To Discover Motifs In Biopolymer , 1994, ISMB.

[3]  David J. Anderson,et al.  Silencing is golden: negative regulation in the control of neuronal gene transcription , 1995, Current Opinion in Neurobiology.

[4]  D. Anderson,et al.  Identification of potential target genes for the neuron-restrictive silencer factor. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[5]  G. Gardner WELCOME TO THE NEW FRONTIER , 1996 .

[6]  G. Owens,et al.  Interaction of CArG Elements and a GC-rich Repressor Element in Transcriptional Regulation of the Smooth Muscle Myosin Heavy Chain Gene in Vascular Smooth Muscle Cells* , 1997, The Journal of Biological Chemistry.

[7]  J. Lieb,et al.  Genome-wide mapping of protein-DNA interactions by chromatin immunoprecipitation and DNA microarray hybridization. , 2003, Methods in molecular biology.

[8]  C. Dieterich,et al.  The SRF target gene Fhl2 antagonizes RhoA/MAL-dependent activation of SRF. , 2004, Molecular cell.

[9]  B. Wasylyk,et al.  Ets ternary complex transcription factors. , 2004, Gene.

[10]  S. Cawley,et al.  Unbiased Mapping of Transcription Factor Binding Sites along Human Chromosomes 21 and 22 Points to Widespread Regulation of Noncoding RNAs , 2004, Cell.

[11]  J. McMillan,et al.  GA-binding protein transcription factor: a review of GABP as an integrator of intracellular signaling and protein-protein interactions. , 2004, Blood cells, molecules & diseases.

[12]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[13]  R. Prywes,et al.  Myocardin/MKL family of SRF coactivators: Key regulators of immediate early and muscle specific gene expression , 2004, Journal of cellular biochemistry.

[14]  G. Mandel,et al.  REST and Its Corepressors Mediate Plasticity of Neuronal Gene Chromatin throughout Neurogenesis , 2005, Cell.

[15]  R. Treisman,et al.  Actin' together: serum response factor, its cofactors and the link to signal transduction. , 2006, Trends in cell biology.

[16]  E. Creemers,et al.  The myocardin family of transcriptional coactivators: versatile regulators of cell growth, migration, and myogenesis. , 2006, Genes & development.

[17]  N. Hannett,et al.  Activated Signal Transduction Kinases Frequently Occupy Target Genes , 2006, Science.

[18]  Richard M Myers,et al.  Network: from Single Conserved Sites to Genome-wide Repertoire Comparative Genomics Modeling of the Nrsf/rest Repressor Material Supplemental , 2022 .

[19]  William Stafford Noble,et al.  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project , 2007, Nature.

[20]  Zhiping Weng,et al.  Transcription factor binding and modified histones in human bidirectional promoters. , 2007, Genome research.

[21]  Allen D. Delaney,et al.  Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing , 2007, Nature Methods.

[22]  R. Myers,et al.  The ets-Related Transcription Factor GABP Directs Bidirectional Transcription , 2007, PLoS genetics.

[23]  E. Mardis ChIP-seq: welcome to the new frontier , 2007, Nature Methods.

[24]  Dustin E. Schones,et al.  High-Resolution Profiling of Histone Methylations in the Human Genome , 2007, Cell.

[25]  R. Myers,et al.  Serum response factor binding sites differ in three human cell types. , 2007, Genome research.

[26]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[27]  B. Wold,et al.  Sequence census methods for functional genomics , 2008, Nature Methods.

[28]  Mark Gerstein,et al.  Systematic evaluation of variability in ChIP-chip experiments using predefined DNA targets. , 2008, Genome research.