Curated collection of yeast transcription factor DNA binding specificity data reveals novel structural and gene regulatory insights

BackgroundTranscription factors (TFs) play a central role in regulating gene expression by interacting with cis-regulatory DNA elements associated with their target genes. Recent surveys have examined the DNA binding specificities of most Saccharomyces cerevisiae TFs, but a comprehensive evaluation of their data has been lacking.ResultsWe analyzed in vitro and in vivo TF-DNA binding data reported in previous large-scale studies to generate a comprehensive, curated resource of DNA binding specificity data for all characterized S. cerevisiae TFs. Our collection comprises DNA binding site motifs and comprehensive in vitro DNA binding specificity data for all possible 8-bp sequences. Investigation of the DNA binding specificities within the basic leucine zipper (bZIP) and VHT1 regulator (VHR) TF families revealed unexpected plasticity in TF-DNA recognition: intriguingly, the VHR TFs, newly characterized by protein binding microarrays in this study, recognize bZIP-like DNA motifs, while the bZIP TF Hac1 recognizes a motif highly similar to the canonical E-box motif of basic helix-loop-helix (bHLH) TFs. We identified several TFs with distinct primary and secondary motifs, which might be associated with different regulatory functions. Finally, integrated analysis of in vivo TF binding data with protein binding microarray data lends further support for indirect DNA binding in vivo by sequence-specific TFs.ConclusionsThe comprehensive data in this curated collection allow for more accurate analyses of regulatory TF-DNA interactions, in-depth structural studies of TF-DNA specificity determinants, and future experimental investigations of the TFs' predicted target genes and regulatory roles.

[1]  H. Ronne,et al.  Yeast SKO1 gene encodes a bZIP protein that binds to the CRE motif and acts as a repressor of transcription. , 1992, Nucleic acids research.

[2]  Alexander J. Hartemink,et al.  Informative priors based on transcription factor structural class improve de novo motif discovery , 2006, ISMB.

[3]  Martha L. Bulyk,et al.  Using a structural and logics systems approach to infer bHLH–DNA binding specificity determinants , 2011, Nucleic acids research.

[4]  Dominique Thomas,et al.  Assembly of a bZIP–bHLH transcription activation complex: formation of the yeast Cbf1–Met4–Met28 complex is regulated through Met28 stimulation of Cbf1 DNA binding , 1997, The EMBO journal.

[5]  Joon Kim,et al.  Determinants of half-site spacing preferences that distinguish AP-1 and ATF/CREB bZIP domains , 1995, Nucleic Acids Res..

[6]  Daniel E. Newburger,et al.  High-resolution DNA-binding specificity analysis of yeast transcription factors. , 2009, Genome research.

[7]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[8]  Shelley Lane,et al.  Regulation of Mating and Filamentation Genes by Two Distinct Ste12 Complexes in Saccharomyces cerevisiae , 2006, Molecular and Cellular Biology.

[9]  E. Siggia,et al.  Connecting protein structure with predictions of regulatory sites , 2007, Proceedings of the National Academy of Sciences.

[10]  Ting Wang,et al.  An improved map of conserved regulatory sites for Saccharomyces cerevisiae , 2006, BMC Bioinformatics.

[11]  Daniel E. Newburger,et al.  A Multiparameter Network Reveals Extensive Divergence between C. elegans bHLH Transcription Factors , 2009, Cell.

[12]  Frederick P. Roth,et al.  Next generation software for functional trend analysis , 2009, Bioinform..

[13]  Roger E Bumgarner,et al.  Integrated genomic and proteomic analyses of a systematically perturbed metabolic network. , 2001, Science.

[14]  W. L. Ruzzo,et al.  Genome-wide MyoD binding in skeletal muscle cells: a potential for broad cellular reprogramming. , 2010, Developmental cell.

[15]  Christopher L. Warren,et al.  A library of yeast transcription factor motifs reveals a widespread function for Rsc3 in targeting nucleosome exclusion at promoters. , 2008, Molecular cell.

[16]  Daniel E. Newburger,et al.  Variation in Homeodomain DNA Binding Revealed by High-Resolution Analysis of Sequence Preferences , 2008, Cell.

[17]  I. Taylor,et al.  Characterization of the DNA-binding domains from the yeast cell-cycle transcription factors Mbp1 and Swi4. , 2000, Biochemistry.

[18]  William Stafford Noble,et al.  Quantifying similarity between motifs , 2007, Genome Biology.

[19]  Raluca Gordân,et al.  Distinguishing direct versus indirect transcription factor-DNA interactions. , 2009, Genome research.

[20]  William Stafford Noble,et al.  Global mapping of protein-DNA interactions in vivo by digital genomic footprinting , 2009, Nature Methods.

[21]  S. Quake,et al.  De Novo Identification and Biophysical Characterization of Transcription Factor Binding Sites with Microfluidic Affinity Analysis , 2010, Nature Biotechnology.

[22]  Kara Dolinski,et al.  Gene Ontology annotations at SGD: new data sources and annotation methods , 2007, Nucleic Acids Res..

[23]  Liam J. McGuffin,et al.  Protein structure prediction servers at University College London , 2005, Nucleic Acids Res..

[24]  William Stafford Noble,et al.  Improved similarity scores for comparing motifs , 2011, Bioinform..

[25]  David Botstein,et al.  Nutritional homeostasis in batch and steady-state culture of yeast. , 2004, Molecular biology of the cell.

[26]  R. Young,et al.  Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays , 2004, Nature Genetics.

[27]  K. Struhl,et al.  The transcription factor Ifh1 is a key regulator of yeast ribosomal protein genes , 2004, Nature.

[28]  K. Mori,et al.  Signalling from endoplasmic reticulum to nucleus: transcription factor with a basic‐leucine zipper motif is required for the unfolded protein‐response pathway , 1996, Genes to cells : devoted to molecular & cellular mechanisms.

[29]  Sean R. Collins,et al.  Global landscape of protein complexes in the yeast Saccharomyces cerevisiae , 2006, Nature.

[30]  Stephen K. Burley,et al.  Structure of the winged-helix protein hRFX1 reveals a new mode of DNA binding , 2000, Nature.

[31]  Timothy Ravasi,et al.  A systems approach to delineate functions of paralogous transcription factors: Role of the Yap family in the DNA damage response , 2008, Proceedings of the National Academy of Sciences.

[32]  J. Rouvière-Yaniv,et al.  The histone‐like protein HU binds specifically to DNA recombination and repair intermediates , 2000, The EMBO journal.

[33]  M. Berger,et al.  Protein binding microarrays (PBMs) for rapid, high-throughput characterization of the sequence specificities of DNA binding proteins. , 2006, Methods in molecular biology.

[34]  Toshio Hakoshima,et al.  Structural basis for the diversity of DNA recognition by bZIP transcription factors , 2000, Nature Structural Biology.

[35]  Mark B Gerstein,et al.  Dynamic and complex transcription factor binding during an inducible response in yeast. , 2009, Genes & development.

[36]  T. Curran,et al.  Transcription factor interactions: basics on zippers , 1991 .

[37]  G. Church,et al.  Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. , 2000, Journal of molecular biology.

[38]  M. Ajimura,et al.  [Schizosaccharomyces pombe]. , 1997, Tanpakushitsu kakusan koso. Protein, nucleic acid, enzyme.

[39]  D. Wolf,et al.  Aminopeptidase yscII of yeast , 1988 .

[40]  Andrew R. Gehrke,et al.  Genome-wide analysis of ETS-family DNA-binding in vitro and in vivo , 2010, The EMBO journal.

[41]  Juan M. Vaquerizas,et al.  Multiplexed massively parallel SELEX for characterization of human transcription factor binding specificities. , 2010, Genome research.

[42]  John Aach,et al.  Measuring absolute expression with microarrays with a calibrated reference sample and an extended signal intensity range , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[43]  R. Trumbly,et al.  Isolation and characterization of aminopeptidase mutants of Saccharomyces cerevisiae , 1983, Journal of bacteriology.

[44]  J. Ragoussis,et al.  Principles of dimer-specific gene regulation revealed by a comprehensive characterization of NF-κB family DNA binding , 2011, Nature Immunology.

[45]  Daniel E. Newburger,et al.  Diversity and Complexity in DNA Recognition by Transcription Factors , 2009, Science.

[46]  Kai Li,et al.  Exploring the functional landscape of gene expression: directed search of large microarray compendia , 2007, Bioinform..

[47]  Naama Barkai,et al.  Strategy of Transcription Regulation in the Budding Yeast , 2007, PloS one.

[48]  B. Akache,et al.  Candida albicans Zinc Cluster Protein Upc2p Confers Resistance to Antifungal Drugs and Is an Activator of Ergosterol Biosynthetic Genes , 2005, Antimicrobial Agents and Chemotherapy.

[49]  N. D. Clarke,et al.  Explicit equilibrium modeling of transcription-factor binding and gene regulation , 2005, Genome Biology.

[50]  G. Church,et al.  Genome-wide co-occurrence of promoter elements reveals a cis-regulatory cassette of rRNA transcription motifs in Saccharomyces cerevisiae. , 2002, Genome research.

[51]  T. Ideker,et al.  Supporting Online Material for A Systems Approach to Mapping DNA Damage Response Pathways , 2006 .

[52]  F. Klebl,et al.  Vhr1p, a New Transcription Factor from Budding Yeast, Regulates Biotin-dependent Expression of VHT1 and BIO5* , 2006, Journal of Biological Chemistry.

[53]  M. Bulyk,et al.  Using protein design algorithms to understand the molecular basis of disease caused by protein–DNA interactions: the Pax6 example , 2010, Nucleic acids research.

[54]  B. Birren,et al.  Sequencing and comparison of yeast species to identify genes and regulatory elements , 2003, Nature.

[55]  L. Fulton,et al.  Finding Functional Features in Saccharomyces Genomes by Phylogenetic Footprinting , 2003, Science.

[56]  K. Struhl Molecular mechanisms of transcriptional regulation in yeast. , 1990, Annual review of biochemistry.

[57]  K. Struhl,et al.  Yap, a novel family of eight bZIP proteins in Saccharomyces cerevisiae with distinct biological functions , 1997, Molecular and cellular biology.

[58]  D. Koller,et al.  Sfp1 is a stress- and nutrient-sensitive regulator of ribosomal protein gene expression. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[59]  M. Berger,et al.  Universal protein-binding microarrays for the comprehensive characterization of the DNA-binding specificities of transcription factors , 2009, Nature Protocols.

[60]  Yanhui Hu,et al.  Approaching a complete repository of sequence-verified protein-encoding clones for Saccharomyces cerevisiae. , 2007, Genome research.

[61]  M. Marton,et al.  Transcriptional Profiling Shows that Gcn4p Is a Master Regulator of Gene Expression during Amino Acid Starvation in Yeast , 2001, Molecular and Cellular Biology.

[62]  Trey Ideker,et al.  Coevolution within a transcriptional network by compensatory trans and cis mutations. , 2010, Genome research.

[63]  Martha L. Bulyk,et al.  UniPROBE, update 2011: expanded content and search tools in the online database of protein-binding microarray data on protein–DNA interactions , 2010, Nucleic Acids Res..

[64]  Michael A. Beer,et al.  Predicting Gene Expression from Sequence , 2004, Cell.

[65]  K. Struhl,et al.  The GCN4 basic region leucine zipper binds DNA as a dimer of uninterrupted α Helices: Crystal structure of the protein-DNA complex , 1992, Cell.

[66]  Nicola J. Rinaldi,et al.  Transcriptional regulatory code of a eukaryotic genome , 2004, Nature.

[67]  Martha L Bulyk,et al.  Non-DNA-binding cofactors enhance DNA-binding specificity of a transcriptional regulatory complex , 2011, Molecular systems biology.

[68]  Jasper Rine,et al.  Upc2p and Ecm22p, Dual Regulators of Sterol Biosynthesis in Saccharomyces cerevisiae , 2001, Molecular and Cellular Biology.

[69]  A. Philippakis,et al.  Compact, universal DNA microarrays to comprehensively determine transcription-factor binding site specificities , 2006, Nature Biotechnology.

[70]  Markus J. Tamás,et al.  The Saccharomyces cerevisiae Sko1p transcription factor mediates HOG pathway‐dependent osmotic regulation of a set of genes encoding enzymes implicated in protection from oxidative damage , 2001, Molecular microbiology.

[71]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[72]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[73]  Rodrigo Lopez,et al.  Clustal W and Clustal X version 2.0 , 2007, Bioinform..

[74]  Rachel Patton McCord,et al.  Inferring condition-specific transcription factor function from DNA binding and gene expression data , 2007, Molecular systems biology.

[75]  Gary D. Stormo,et al.  enoLOGOS: a versatile web tool for energy normalized sequence logos , 2005, Nucleic Acids Res..