Energy-dependent fitness: A quantitative model for the evolution of yeast transcription factor binding sites

We present a genomewide cross-species analysis of regulation for broad-acting transcription factors in yeast. Our model for binding site evolution is founded on biophysics: the binding energy between transcription factor and site is a quantitative phenotype of regulatory function, and selection is given by a fitness landscape that depends on this phenotype. The model quantifies conservation, as well as loss and gain, of functional binding sites in a coherent way. Its predictions are supported by direct cross-species comparison between four yeast species. We find ubiquitous compensatory mutations within functional sites, such that the energy phenotype and the function of a site evolve in a significantly more constrained way than does its sequence. We also find evidence for substantial evolution of regulatory function involving point mutations as well as sequence insertions and deletions within binding sites. Genes lose their regulatory link to a given transcription factor at a rate similar to the neutral point mutation rate, from which we infer a moderate average fitness advantage of functional over nonfunctional sites. In a wider context, this study provides an example of inference of selection acting on a quantitative molecular trait.

[1]  Ting Wang,et al.  An improved map of conserved regulatory sites for Saccharomyces cerevisiae , 2006, BMC Bioinformatics.

[2]  M. Lässig,et al.  Evolutionary population genetics of promoters: predicting binding sites and functional phylogenies. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[3]  N. Patel,et al.  Evidence for stabilizing selection in a eukaryotic enhancer element , 2000, Nature.

[4]  M. Lässig,et al.  Adaptations to fluctuating selection in Drosophila , 2007, Proceedings of the National Academy of Sciences.

[5]  Nigel F. Delaney,et al.  Darwinian Evolution Can Follow Only Very Few Mutational Paths to Fitter Proteins , 2006, Science.

[6]  Erik van Nimwegen,et al.  Statistical Features of Yeast ’ s Transcriptional Regulatory Code , 2006 .

[7]  P. V. von Hippel,et al.  Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters. , 1987, Journal of molecular biology.

[8]  Rodrigo Lopez,et al.  Multiple sequence alignment with the Clustal series of programs , 2003, Nucleic Acids Res..

[9]  T. Ohta,et al.  Protein Polymorphism as a Phase of Molecular Evolution , 1971, Nature.

[10]  C. T. Brown,et al.  Evolutionary comparisons suggest many novel cAMP response protein binding sites in Escherichia coli. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[11]  Matthew W. Hahn,et al.  The evolution of transcriptional regulation in eukaryotes. , 2003, Molecular biology and evolution.

[12]  G. Jung,et al.  Structure-function analysis of the DNA binding domain of Saccharomyces cerevisiae ABF1. , 1995, Nucleic acids research.

[13]  Michael Lässig,et al.  From biophysics to evolutionary genetics: statistical aspects of gene regulation , 2007, BMC Bioinformatics.

[14]  Alan M. Moses,et al.  Position specific variation in the rate of evolution in transcription factor binding sites , 2003, BMC Evolutionary Biology.

[15]  T. Ohta,et al.  The Average Number of Generations until Fixation of a Mutant Gene in a Finite Population. , 1969, Genetics.

[16]  Terence Hwa,et al.  On the Selection and Evolution of Regulatory DNA Motifs , 2001, Journal of Molecular Evolution.

[17]  Stuart A. Kauffman,et al.  ORIGINS OF ORDER , 2019, Origins of Order.

[18]  M. Kreitman,et al.  Functional Evolution of a cis-Regulatory Module , 2005, PLoS biology.

[19]  A. Halpern,et al.  Evolutionary distances for protein-coding sequences: modeling site-specific residue frequencies. , 1998, Molecular biology and evolution.

[20]  Ron Shamir,et al.  Evolution and Selection in Yeast Promoters: Analyzing the Combined Effect of Diverse Transcription Factor Binding Sites , 2008, PLoS Comput. Biol..

[21]  R. Young,et al.  Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays , 2004, Nature Genetics.

[22]  D. S. Fields,et al.  Quantitative specificity of the Mnt repressor. , 1997, Journal of molecular biology.

[23]  Xin He,et al.  MORPH: Probabilistic Alignment Combined with Hidden Markov Models of cis-Regulatory Modules , 2007, PLoS Comput. Biol..

[24]  Scott Doniger,et al.  Frequent Gain and Loss of Functional Transcription Factor Binding Sites , 2007, PLoS Comput. Biol..

[25]  A. Sandelin,et al.  Applied bioinformatics for the identification of regulatory elements , 2004, Nature Reviews Genetics.

[26]  Nicola J. Rinaldi,et al.  Transcriptional Regulatory Networks in Saccharomyces cerevisiae , 2002, Science.

[27]  Tomoko Ohta,et al.  Evolution by nearly-neutral mutations , 2004, Genetica.

[28]  J. Liu,et al.  Phylogenetic footprinting of transcription factor binding sites in proteobacterial genomes. , 2001, Nucleic acids research.

[29]  S. Quake,et al.  A Systems Approach to Measuring the Binding Energy Landscapes of Transcription Factors , 2007, Science.

[30]  J. Monod,et al.  Teleonomic mechanisms in cellular metabolism, growth, and differentiation. , 1961, Cold Spring Harbor symposia on quantitative biology.

[31]  H. Tachida Molecular Evolution in a Multisite Nearly Neutral Mutation Model , 2000, Journal of Molecular Evolution.

[32]  Johannes Berg,et al.  Adaptive evolution of transcription factor binding sites , 2003, BMC Evolutionary Biology.

[33]  H. Kishino,et al.  Dating of the human-ape splitting by a molecular clock of mitochondrial DNA , 2005, Journal of Molecular Evolution.

[34]  T. Ohta Role of very slightly deleterious mutations in molecular evolution and polymorphism. , 1976, Theoretical population biology.

[35]  B. Birren,et al.  Sequencing and comparison of yeast species to identify genes and regulatory elements , 2003, Nature.

[36]  Erik van Nimwegen,et al.  SwissRegulon: a database of genome-wide annotations of regulatory sites , 2006, Nucleic Acids Res..

[37]  L. Fulton,et al.  Finding Functional Features in Saccharomyces Genomes by Phylogenetic Footprinting , 2003, Science.

[38]  Ari Löytynoja,et al.  An algorithm for progressive multiple alignment of sequences with insertions. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[39]  Mark Gerstein,et al.  Divergence of transcription factor binding sites across related yeast species. , 2007, Science.

[40]  J. Kinney,et al.  Precise physical models of protein–DNA interaction from high-throughput data , 2007, Proceedings of the National Academy of Sciences.

[41]  M. Kreitman,et al.  Evolutionary dynamics of the enhancer region of even-skipped in Drosophila. , 1995, Molecular biology and evolution.

[42]  T. Ohta Role of random genetic drift in the evolution of interactive systems , 2009, Journal of Molecular Evolution.

[43]  J. Monod,et al.  Genetic regulatory mechanisms in the synthesis of proteins. , 1961, Journal of molecular biology.

[44]  D. S. Fields,et al.  Specificity, free energy and information content in protein-DNA interactions. , 1998, Trends in biochemical sciences.

[45]  Nicola J. Rinaldi,et al.  Transcriptional regulatory code of a eukaryotic genome , 2004, Nature.

[46]  Richard A. Watson,et al.  PERSPECTIVE:SIGN EPISTASIS AND GENETIC CONSTRAINT ON EVOLUTIONARY TRAJECTORIES , 2005 .

[47]  Alexandre V. Morozov,et al.  Statistical mechanical modeling of genome-wide transcription factor occupancy data by MatrixREDUCE , 2006, ISMB.

[48]  J. Fay,et al.  Identification of functional transcription factor binding sites using closely related Saccharomyces species. , 2005, Genome research.

[49]  M. Ptashne,et al.  Genes and Signals , 2001 .