Discovery of non-directional and directional pioneer transcription factors by modeling DNase profile magnitude and shape

We describe protein interaction quantitation (PIQ), a computational method for modeling the magnitude and shape of genome-wide DNase I hypersensitivity profiles to identify transcription factor (TF) binding sites. Through the use of machine-learning techniques, PIQ identified binding sites for >700 TFs from one DNase I hypersensitivity analysis followed by sequencing (DNase-seq) experiment with accuracy comparable to that of chromatin immunoprecipitation followed by sequencing (ChIP-seq). We applied PIQ to analyze DNase-seq data from mouse embryonic stem cells differentiating into prepancreatic and intestinal endoderm. We identified 120 and experimentally validated eight 'pioneer' TF families that dynamically open chromatin. Four pioneer TF families only opened chromatin in one direction from their motifs. Furthermore, we identified 'settler' TFs whose genomic binding is principally governed by proximity to open chromatin. Our results support a model of hierarchical TF binding in which directional and nondirectional pioneer activity shapes the chromatin landscape for population by settler TFs.

[1]  M. Groudine,et al.  Chromosomal subunits in active genes have an altered conformation. , 1976, Science.

[2]  Carl Wu The 5′ ends of Drosophila heat shock genes in chromatin are hypersensitive to DNase I , 1980, Nature.

[3]  J. R. Coleman,et al.  Hepatic specification of the gut endoderm in vitro: cell signaling and transcriptional control. , 1996, Genes & development.

[4]  Amos Storkey UAI '00: Proceedings of the 16th Conference in Uncertainty in Artificial Intelligence , 2000 .

[5]  Tom Minka,et al.  Expectation Propagation for approximate Bayesian inference , 2001, UAI.

[6]  R. Scarpulla,et al.  Mitochondrial DNA Instability and Peri-Implantation Lethality Associated with Targeted Disruption of Nuclear Respiratory Factor 1 in Mice , 2001, Molecular and Cellular Biology.

[7]  Frank R. Lin,et al.  Opening of compacted chromatin by early developmental transcription factors HNF3 (FoxA) and GATA-4. , 2002, Molecular cell.

[8]  J. Deng,et al.  The B subunit of the CCAAT box binding transcription factor complex (CBF/NF-Y) is essential for early mouse development and cell proliferation. , 2003, Cancer research.

[9]  Transposition of the Tol 2 Element , an Ac-Like Element From the Japanese Medaka Fish Oryzias latipes , in Mouse Embryonic Stem Cells , 2004 .

[10]  K. Kawakami,et al.  Transposition of the Tol2 element, an Ac-like element from the Japanese medaka fish Oryzias latipes, in mouse embryonic stem cells. , 2004, Genetics.

[11]  Jérôme Eeckhoute,et al.  A cell-type-specific transcriptional network required for estrogen regulation of cyclin D1 and cell cycle progression in breast cancer. , 2006, Genes & development.

[12]  Jérôme Eeckhoute,et al.  A cell-type-specific transcriptional network required for estrogen regulation of cyclin D 1 and cell cycle progression in breast cancer , 2006 .

[13]  S. Yamanaka,et al.  Induction of Pluripotent Stem Cells from Mouse Embryonic and Adult Fibroblast Cultures by Defined Factors , 2006, Cell.

[14]  K. Zaret,et al.  Repression by Groucho/TLE/Grg proteins: genomic site recruitment generates compacted chromatin in vitro and impairs activator binding in vivo. , 2007, Molecular cell.

[15]  A. Mortazavi,et al.  Genome-Wide Mapping of in Vivo Protein-DNA Interactions , 2007, Science.

[16]  N. D. Clarke,et al.  Integration of External Signaling Pathways with the Core Transcriptional Network in Embryonic Stem Cells , 2008, Cell.

[17]  Z. Weng,et al.  High-Resolution Mapping and Characterization of Open Chromatin across the Genome , 2008, Cell.

[18]  B. Doble,et al.  The ground state of embryonic stem cell self-renewal , 2008, Nature.

[19]  Uma M. Muthurajan,et al.  Nucleosome-binding affinity as a primary determinant of the nuclear mobility of the pioneer transcription factor FoxA. , 2009, Genes & development.

[20]  M. Kyba,et al.  A conserved role for Hox paralog group 4 in regulation of hematopoietic progenitors. , 2009, Stem cells and development.

[21]  William Stafford Noble,et al.  Global mapping of protein-DNA interactions in vivo by digital genomic footprinting , 2009, Nature Methods.

[22]  Jian Xu,et al.  Transcriptional competence and the active marking of tissue-specific enhancers by defined transcription factors in embryonic and induced pluripotent stem cells. , 2009, Genes & development.

[23]  Jeff A. Bilmes,et al.  A dynamic Bayesian network for identifying protein-binding footprints from single molecule-based sequencing data , 2010, Bioinform..

[24]  Rudolf Grosschedl,et al.  Early B cell factor 1 regulates B cell gene networks by activation, repression, and transcription- independent poising of chromatin. , 2010, Immunity.

[25]  E. Davidson Emerging properties of animal gene regulatory networks , 2010, Nature.

[26]  Yuchun Guo,et al.  Discovering homotypic binding events at high spatial resolution , 2010, Bioinform..

[27]  G. Crawford,et al.  DNase-seq: a high-resolution technique for mapping active gene regulatory elements across the genome from mammalian cells. , 2010, Cold Spring Harbor protocols.

[28]  S. Hori c‐Rel: A pioneer in directing regulatory T‐cell lineage commitment? , 2010, European journal of immunology.

[29]  Krishanu Saha,et al.  Pluripotency and Cellular Reprogramming: Facts, Hypotheses, Unresolved Issues , 2010, Cell.

[30]  N. D. Clarke,et al.  Integrative model of genomic factors for determining binding site selection by estrogen receptor-α , 2010, Molecular systems biology.

[31]  J. Stamatoyannopoulos,et al.  Chromatin accessibility pre-determines glucocorticoid receptor binding patterns , 2011, Nature Genetics.

[32]  W. Reinhold,et al.  G4 motifs correlate with promoter-proximal transcriptional pausing in human genes , 2011, Nucleic acids research.

[33]  J. Stamatoyannopoulos,et al.  Quantitative Models of the Mechanisms That Control Genome-Wide Patterns of Transcription Factor Binding during Early Drosophila Development , 2011, PLoS genetics.

[34]  Jacob F. Degner,et al.  Sequence and Chromatin Accessibility Data Accurate Inference of Transcription Factor Binding from Dna Material Supplemental Open Access , 2022 .

[35]  Richard A Young,et al.  Control of the Embryonic Stem Cell State , 2011, Cell.

[36]  George Q. Daley,et al.  Lineage Regulators Direct BMP and Wnt Pathways to Cell-Specific Programs during Differentiation and Regeneration , 2011, Cell.

[37]  George Q. Daley,et al.  Lineage Regulators Direct BMP and Wnt Pathways to Cell-Specific Programs During Differentiation and Regeneration, , 2011 .

[38]  David A. Orlando,et al.  Master Transcription Factors Determine Cell-Type-Specific Responses to TGF-β Signaling , 2011, Cell.

[39]  R. Sherwood,et al.  Wnt signaling specifies and patterns intestinal endoderm , 2011, Mechanisms of Development.

[40]  Michael Q. Zhang,et al.  Study of FoxA Pioneer Factor at Silent Genes Reveals Rfx-Repressed Enhancer at Cdx2 and a Potential Indicator of Esophageal Adenocarcinoma Development , 2011, PLoS genetics.

[41]  Jonathan Schug,et al.  The Nucleosome Map of the Mammalian Liver , 2011, Nature Structural &Molecular Biology.

[42]  E. Birney,et al.  High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells. , 2011, Genome research.

[43]  J. Carroll,et al.  Pioneer transcription factors: establishing competence for gene expression. , 2011, Genes & development.

[44]  Yuchun Guo,et al.  High Resolution Genome Wide Binding Event Finding and Motif Discovery Reveals Transcription Factor Spatial Binding Constraints , 2012, PLoS Comput. Biol..

[45]  Shane J. Neph,et al.  An expansive human regulatory lexicon encoded in transcription factor footprints , 2012, Nature.

[46]  Greg Donahue,et al.  Facilitators and Impediments of the Pluripotency Reprogramming Factors' Initial Engagement with the Genome , 2012, Cell.

[47]  Nathan C. Sheffield,et al.  The accessible chromatin landscape of the human genome , 2012, Nature.

[48]  D. Figarella-Branger,et al.  The selector gene Pax7 dictates alternate pituitary cell fates through its pioneer action on chromatin remodeling. , 2012, Genes & development.

[49]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[50]  Maxwell W. Libbrecht,et al.  Ubiquitous heterogeneity and asymmetry of the chromatin environment at regulatory elements , 2012, Genome research.

[51]  Thomas A. Down,et al.  Chromatin Accessibility Data Sets Show Bias Due to Sequence Specificity of the DNase I Enzyme , 2013, PloS one.

[52]  M. Nardini,et al.  Sequence-Specific Transcription Factor NF-Y Displays Histone-like DNA Binding and H2B-like Ubiquitination , 2013, Cell.

[53]  A. Gautam,et al.  STATE , 2016, Intell. Serv. Robotics.