70% efficiency of bistate molecular machines explained by information theory, high dimensional geometry and evolutionary convergence

The relationship between information and energy is key to understanding biological systems. We can display the information in DNA sequences specifically bound by proteins by using sequence logos, and we can measure the corresponding binding energy. These can be compared by noting that one of the forms of the second law of thermodynamics defines the minimum energy dissipation required to gain one bit of information. Under the isothermal conditions that molecular machines function this is joules per bit ( is Boltzmann's constant and T is the absolute temperature). Then an efficiency of binding can be computed by dividing the information in a logo by the free energy of binding after it has been converted to bits. The isothermal efficiencies of not only genetic control systems, but also visual pigments are near 70%. From information and coding theory, the theoretical efficiency limit for bistate molecular machines is ln 2 = 0.6931. Evolutionary convergence to maximum efficiency is limited by the constraint that molecular states must be distinct from each other. The result indicates that natural molecular machines operate close to their information processing maximum (the channel capacity), and implies that nanotechnology can attain this goal.

[1]  Nicolas Léonard Sadi Carnot,et al.  Reflections on the Motive Power of Fire , 1824 .

[2]  A. J. Lotka Contribution to the Energetics of Evolution. , 1922, Proceedings of the National Academy of Sciences of the United States of America.

[3]  C. Goodeve,et al.  The Spectral Variation of the Photosensitivity of Visual Purple , 1939 .

[4]  F. Young Biochemistry , 1955, The Indian Medical Gazette.

[5]  L. Brillouin,et al.  Science and information theory , 1956 .

[6]  Advances in Space Science , 1959 .

[7]  H. Callen Thermodynamics and an Introduction to Thermostatistics , 1988 .

[8]  L. Goddard Information Theory , 1962, Nature.

[9]  H. Bent The second law , 1965 .

[10]  H. Dartnall The photosensitivities of visual pigments in the presence of hydroxylamine. , 1968, Vision research.

[11]  G. Wald The Molecular Basis of Visual Excitation , 1968, Nature.

[12]  G. Falk,et al.  Physical Changes Induced by Light in the Rod Outer Segment of Vertebrates , 1972 .

[13]  H. Goodman,et al.  Specificity of substrate recognition by the EcoRI restriction endonuclease. , 1975, Proceedings of the National Academy of Sciences of the United States of America.

[14]  M. Sanders Handbook of Sensory Physiology , 1975 .

[15]  N. Seeman,et al.  Sequence-specific Recognition of Double Helical Nucleic Acids by Proteins (base Pairs/hydrogen Bonding/recognition Fidelity/ion Binding) , 2022 .

[16]  B. Naroditsky,et al.  EcoRI activity: enzyme modification or activation of accompanying endonuclease? , 1978, Gene.

[17]  J. Pierce An introduction to information theory: symbols, signals & noise , 1980 .

[18]  D. Crothers,et al.  Equilibria and kinetics of lac repressor-operator interactions by polyacrylamide gel electrophoresis. , 1981, Nucleic acids research.

[19]  A. Malcolm,et al.  Cation dependence of restriction endonuclease EcoRI activity. , 1981, European journal of biochemistry.

[20]  S. Kim,et al.  DNA sequences of structural genes for Eco RI DNA restriction and modification enzymes. , 1981, The Journal of biological chemistry.

[21]  G M Clore,et al.  Theoretical aspects of specific and non-specific equilibrium binding of proteins to DNA as studied by the nitrocellulose filter binding assay. Co-operative and non-co-operative binding to a one-dimensional lattice. , 1982, Journal of molecular biology.

[22]  J. Nathans,et al.  Isolation, sequence analysis, and intron-exon arrangement of the gene encoding bovine rhodopsin , 1983, Cell.

[23]  C.E. Shannon,et al.  Communication in the Presence of Noise , 1949, Proceedings of the IRE.

[24]  A. Pingoud,et al.  Spermidine increases the accuracy of type II restriction endonucleases. Suppression of cleavage at degenerate, non-symmetrical sites. , 1985, European journal of biochemistry.

[25]  T. D. Schneider,et al.  Information content of binding sites on nucleotide sequences. , 1986, Journal of molecular biology.

[26]  P. V. von Hippel,et al.  Selection of DNA binding sites by regulatory proteins. Statistical-mechanical theory and application to operators and promoters. , 1987, Journal of molecular biology.

[27]  C. Ray Smith,et al.  Maximum-entropy and Bayesian methods in science and engineering , 1988 .

[28]  E. T. Jaynes,et al.  The Evolution of Carnot’s Principle , 1988 .

[29]  G. Tollin,et al.  Photoactive yellow protein from the purple phototrophic bacterium, Ectothiorhodospira halophila. Quantum yield of photobleaching and effects of temperature, alcohols, glycerol, and sucrose on kinetics of photobleaching and recovery. , 1989, Biophysical journal.

[30]  M. Stockburger,et al.  Photochemical quantum yield of bacteriorhodopsin from resonance Raman scattering as a probe for photolysis. , 1989 .

[31]  J. Heitman,et al.  Repair of the Escherichia coli chromosome after in vivo scission by the EcoRI endonuclease. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[32]  J M Rosenberg,et al.  Refinement of Eco RI endonuclease crystal structure: a revised protein chain tracing. , 1990, Science.

[33]  A. Pingoud,et al.  Accuracy of the EcoRI restriction endonuclease: binding and cleavage studies with oligodeoxynucleotide substrates containing degenerate recognition sequences. , 1990, Biochemistry.

[34]  T. D. Schneider,et al.  Sequence logos: a new way to display consensus sequences. , 1990, Nucleic acids research.

[35]  W. Fischer,et al.  Sphere Packings, Lattices and Groups , 1990 .

[36]  M. Kurpiewski,et al.  The energetic basis of specificity in the Eco RI endonuclease--DNA interaction. , 1990, Science.

[37]  T. D. Schneider,et al.  Theory of molecular machines. I. Channel capacity of molecular machines. , 1991, Journal of theoretical biology.

[38]  T. D. Schneider,et al.  Theory of molecular machines. II. Energy dissipation from molecular machines. , 1991, Journal of theoretical biology.

[39]  T. D. Schneider,et al.  Features of spliceosome evolution and function inferred from an analysis of the information at human splice sites. , 1992, Journal of molecular biology.

[40]  D. Draper Protein-DNA complexes: the cost of recognition. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[41]  D. Chattoraj,et al.  Activation of DNA binding by the monomeric form of the P1 replication initiator RepA by heat shock proteins DnaJ and DnaK. , 1993, Journal of molecular biology.

[42]  T. D. Schneider,et al.  Information analysis of sequences that bind the replication initiator RepA. , 1993, Journal of molecular biology.

[43]  T. D. Schneider,et al.  Sequence logos, machine/channel capacity, Maxwell's demon, and molecular computers: a review of the theory of molecular machines , 1994 .

[44]  Missing-base and ethylation interference footprinting of P1 plasmid replication initiator. , 1994, Nucleic acids research.

[45]  P. Champion,et al.  Investigations of the thermal response of laser-excited biomolecules. , 1994, Biophysical journal.

[46]  J. Alves,et al.  Accuracy of the EcoRV restriction endonuclease: binding and cleavage studies with oligodeoxynucleotide substrates containing degenerate recognition sequences. , 1990, Biochemistry.

[47]  Gillespie,et al.  Exact numerical simulation of the Ornstein-Uhlenbeck process and its integral. , 1996, Physical review. E, Statistical physics, plasmas, fluids, and related interdisciplinary topics.

[48]  H. Topoff The Cartoon Guide to Physics , 1996 .

[49]  T. D. Schneider,et al.  Information content of individual genetic sequences. , 1997, Journal of theoretical biology.

[50]  D. Chattoraj,et al.  Replication control of plasmid P1 and its host chromosome: the common ground. , 1997, Progress in nucleic acid research and molecular biology.

[51]  Peter A. Corning,et al.  Thermodynamics, information and life revisited, Part II: ‘Thermoeconomics’ and ‘Control information’ , 1998 .

[52]  T. D. Schneider,et al.  Evolution of biological information. , 2000, Nucleic acids research.

[53]  Gary D. Stormo,et al.  DNA binding sites: representation and discovery , 2000, Bioinform..

[54]  T. D. Schneider,et al.  Strong minor groove base conservation in sequence logos implies DNA distortion or base flipping during replication and transcription initiation. , 2001, Nucleic acids research.

[55]  R. W. Keyes,et al.  Fundamental limits of silicon technology , 2001, Proc. IEEE.

[56]  T. D. Schneider,et al.  The P1 phage replication protein RepA contacts an otherwise inaccessible thymine N3 proton by DNA distortion or base flipping. , 2001, Nucleic acids research.

[57]  T. D. Schneider,et al.  Consensus sequence Zen. , 2002, Applied bioinformatics.

[58]  K. Donner,et al.  pH and rate of ‘dark’ events in toad retinal rods: test of a hypothesis on the molecular origin of photoreceptor noise , 2002, The Journal of physiology.

[59]  G. Stormo,et al.  Additivity in protein-DNA interactions: how good an approximation is it? , 2002, Nucleic acids research.

[60]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[61]  Thomas D. Schneider,et al.  Claude shannon : Biologist , 2006 .

[62]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[63]  Thomas D. Schneider,et al.  Claude Shannon: Biologist [information theory used in biology] , 2006, IEEE Engineering in Medicine and Biology Magazine.

[64]  Eckart Bindewald,et al.  CorreLogo: an online server for 3D sequence logos of RNA and DNA alignments , 2006, Nucleic Acids Res..

[65]  Thomas D. Schneider,et al.  Correlation between binding rate constants and individual information of E. coli Fis binding sites , 2007, Nucleic acids research.

[66]  T. D. Schneider,et al.  Discovery of novel tumor suppressor p53 response elements using information theory , 2008, Nucleic acids research.

[67]  C. Crane-Robinson,et al.  Defining the thermodynamics of protein/DNA complexes and their components using micro-calorimetry. , 2009, Methods in molecular biology.

[68]  Mihai Sanduleac,et al.  Energy and Information , 2010 .