Binding of Transcription Factors Adapts to Resolve Information-Energy Tradeoff

We examine the binding of transcription factors to DNA in terms of an information transfer problem. The input of the noisy channel is the biophysical signal of a factor bound to a DNA site, and the output is a distribution of probable DNA sequences at this site. This task involves an inherent tradeoff between the information gain and the energetics of the binding interaction—high binding energies provide higher information gain but hinder the dynamics of the system as factors are bound too tightly. We show that adaptation of the binding interaction towards increasing information transfer under a general energy constraint implies that the information gain per specific binding energy at each base-pair is maximized. We analyze hundreds of prokaryote and eukaryote transcription factors from various organisms to evaluate the discrimination energies. We find that, in accordance with our theoretical argument, binding energies nearly maximize the information gain per energy. This work suggests the adaptation of information gain as a generic design principle of molecular recognition systems.

[1]  Alexandre V. Morozov,et al.  Statistical mechanical modeling of genome-wide transcription factor occupancy data by MatrixREDUCE , 2006, ISMB.

[2]  J. Kinney,et al.  Using deep sequencing to characterize the biophysical mechanism of a transcriptional regulatory sequence , 2010, Proceedings of the National Academy of Sciences.

[3]  David J. Arenillas,et al.  JASPAR 2014: an extensively expanded and updated open-access database of transcription factor binding profiles , 2013, Nucleic Acids Res..

[4]  G. Stormo,et al.  Analysis of Homeodomain Specificities Allows the Family-wide Prediction of Preferred Recognition Sites , 2008, Cell.

[5]  T. D. Schneider,et al.  Information content of binding sites on nucleotide sequences. , 1986, Journal of molecular biology.

[6]  G. Stormo,et al.  Improved Models for Transcription Factor Binding Site Identification Using Nonindependent Interactions , 2012, Genetics.

[7]  D. S. Fields,et al.  Specificity, free energy and information content in protein-DNA interactions. , 1998, Trends in biochemical sciences.

[8]  Kate B. Cook,et al.  Determination and Inference of Eukaryotic Transcription Factor Sequence Specificity , 2014, Cell.

[9]  Alexander Hoffmann,et al.  Understanding the temporal codes of intra-cellular signals. , 2010, Current opinion in genetics & development.

[10]  R. Young,et al.  Rapid analysis of the DNA-binding specificities of transcription factors with DNA microarrays , 2004, Nature Genetics.

[11]  L. Gold,et al.  Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase. , 1990, Science.

[12]  G. Lahav,et al.  Encoding and Decoding Cellular Information through Signaling Dynamics , 2013, Cell.

[13]  Daniel E. Newburger,et al.  Variation in Homeodomain DNA Binding Revealed by High-Resolution Analysis of Sequence Preferences , 2008, Cell.

[14]  Thierry Mora,et al.  A General Pairwise Interaction Model Provides an Accurate Description of In Vivo Transcription Factor Binding Sites , 2014, PloS one.

[15]  Gary D. Stormo,et al.  Modeling the specificity of protein-DNA interactions , 2013, Quantitative Biology.

[16]  T. Hwa,et al.  Physical constraints and functional characteristics of transcription factor–DNA interaction , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[17]  Tsvi Tlusty,et al.  Cross-species analysis traces adaptation of Rubisco toward optimality in a low-dimensional landscape , 2010, Proceedings of the National Academy of Sciences.

[18]  P. V. von Hippel,et al.  On the specificity of DNA-protein interactions. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Takeshi Norimatsu,et al.  Encoding and Decoding , 2016 .

[20]  Daniel E. Newburger,et al.  A Multiparameter Network Reveals Extensive Divergence between C. elegans bHLH Transcription Factors , 2009, Cell.

[21]  P. Park ChIP–seq: advantages and challenges of a maturing technology , 2009, Nature Reviews Genetics.

[22]  Tsvi Tlusty,et al.  The Ribosome as an Optimal Decoder: A Lesson in Molecular Recognition , 2013, Cell.

[23]  Nicola J. Rinaldi,et al.  Transcriptional regulatory code of a eukaryotic genome , 2004, Nature.

[24]  S. Quake,et al.  A Systems Approach to Measuring the Binding Energy Landscapes of Transcription Factors , 2007, Science.

[25]  Julio Collado-Vides,et al.  RegulonDB v8.0: omics data sets, evolutionary conservation, regulatory phrases, cross-validated gold standards and more , 2012, Nucleic Acids Res..

[26]  Julio Collado-Vides,et al.  RegulonDB (version 6.0): gene regulation model of Escherichia coli K-12 beyond transcription, active (experimental) annotated promoters and Textpresso navigation , 2007, Nucleic Acids Res..

[27]  Ville Mustonen,et al.  Energy-dependent fitness: A quantitative model for the evolution of yeast transcription factor binding sites , 2008, Proceedings of the National Academy of Sciences.

[28]  Daniel E. Newburger,et al.  High-resolution DNA-binding specificity analysis of yeast transcription factors. , 2009, Genome research.

[29]  P. V. von Hippel,et al.  Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters. , 1987, Journal of molecular biology.

[30]  L. Mirny,et al.  Different gene regulation strategies revealed by analysis of binding motifs. , 2009, Trends in genetics : TIG.

[31]  E. Furlong,et al.  Transcription factors: from enhancer binding to developmental control , 2012, Nature Reviews Genetics.

[32]  Inna Dubchak,et al.  RegTransBase – a database of regulatory sequences and interactions based on literature: a resource for investigating transcriptional regulation in prokaryotes , 2013, BMC Genomics.

[33]  Juan M. Vaquerizas,et al.  DNA-Binding Specificities of Human Transcription Factors , 2013, Cell.

[34]  M. Lässig,et al.  Evolutionary population genetics of promoters: predicting binding sites and functional phylogenies. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[35]  A. E. Hirsh,et al.  The application of statistical physics to evolutionary biology. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[36]  Erin K O'Shea,et al.  Signal-dependent dynamics of transcription factor translocation controls gene expression , 2011, Nature Structural &Molecular Biology.

[37]  Anirvan M. Sengupta,et al.  A biophysical approach to transcription factor binding site discovery. , 2003, Genome research.

[38]  J. Kinney,et al.  Precise physical models of protein–DNA interaction from high-throughput data , 2007, Proceedings of the National Academy of Sciences.

[39]  A. Barski,et al.  Genomic location analysis by ChIP‐Seq , 2009, Journal of cellular biochemistry.

[40]  G. Stormo,et al.  Additivity in protein-DNA interactions: how good an approximation is it? , 2002, Nucleic acids research.

[41]  T. Tlusty,et al.  RecA-mediated homology search as a nearly optimal signal detection system. , 2010, Molecular cell.

[42]  Yue Zhao,et al.  Inferring Binding Energies from Selected Binding Sites , 2009, PLoS Comput. Biol..

[43]  Michael Lässig,et al.  From biophysics to evolutionary genetics: statistical aspects of gene regulation , 2007, BMC Bioinformatics.

[44]  Nir Friedman,et al.  Balancing speed and accuracy of polyclonal T cell activation: a role for extracellular feedback , 2012, BMC Systems Biology.

[45]  Daniel E. Newburger,et al.  Diversity and Complexity in DNA Recognition by Transcription Factors , 2009, Science.

[46]  P. V. Hippel,et al.  On the Molecular Bases of the Specificity of Interaction of Transcriptional Proteins with Genome DNA , 1979 .