Entropic Stabilization of Proteins and Its Proteomic Consequences

Evolutionary traces of thermophilic adaptation are manifest, on the whole-genome level, in compositional biases toward certain types of amino acids. However, it is sometimes difficult to discern their causes without a clear understanding of underlying physical mechanisms of thermal stabilization of proteins. For example, it is well-known that hyperthermophiles feature a greater proportion of charged residues, but, surprisingly, the excess of positively charged residues is almost entirely due to lysines but not arginines in the majority of hyperthermophilic genomes. All-atom simulations show that lysines have a much greater number of accessible rotamers than arginines of similar degree of burial in folded states of proteins. This finding suggests that lysines would preferentially entropically stabilize the native state. Indeed, we show in computational experiments that arginine-to-lysine amino acid substitutions result in noticeable stabilization of proteins. We then hypothesize that if evolution uses this physical mechanism as a complement to electrostatic stabilization in its strategies of thermophilic adaptation, then hyperthermostable organisms would have much greater content of lysines in their proteomes than comparably sized and similarly charged arginines. Consistent with that, high-throughput comparative analysis of complete proteomes shows extremely strong bias toward arginine-to-lysine replacement in hyperthermophilic organisms and overall much greater content of lysines than arginines in hyperthermophiles. This finding cannot be explained by genomic GC compositional biases or by the universal trend of amino acid gain and loss in protein evolution. We discovered here a novel entropic mechanism of protein thermostability due to residual dynamics of rotamer isomerization in native state and demonstrated its immediate proteomic implications. Our study provides an example of how analysis of a fundamental physical mechanism of thermostability helps to resolve a puzzle in comparative genomics as to why amino acid compositions of hyperthermophilic proteomes are significantly biased toward lysines but not similarly charged arginines.

[1]  N Gautham,et al.  Correlations between nucleotide frequencies and amino acid composition in 115 bacterial species. , 2004, Biochemical and biophysical research communications.

[2]  R. Jaenicke,et al.  Stability and stabilization of globular proteins in solution. , 2000, Journal of biotechnology.

[3]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[4]  A. Elcock The stability of salt bridges at high temperatures: implications for hyperthermophilic proteins. , 1998, Journal of molecular biology.

[5]  Srebrenka Robic,et al.  Energetic evidence for formation of a pH-dependent hydrophobic cluster in the denatured state of Thermus thermophilus ribonuclease H. , 2003, Journal of molecular biology.

[6]  Orna Man,et al.  Proteomic signatures: Amino acid and oligopeptide compositions differentiate among phyla , 2003, Proteins.

[7]  P. Privalov,et al.  Energetics of protein structure. , 1995, Advances in protein chemistry.

[8]  M J Sternberg,et al.  Empirical scale of side-chain conformational entropy in protein folding. , 1993, Journal of molecular biology.

[9]  David P. Kreil,et al.  Identification of thermophilic species by the amino acid compositions deduced from their genomes. , 2001, Nucleic acids research.

[10]  C. Cambillau,et al.  Structural and Genomic Correlates of Hyperthermostability* , 2000, The Journal of Biological Chemistry.

[11]  Andrés Moya,et al.  Genomic determinants of protein folding thermodynamics in prokaryotic organisms. , 2004, Journal of molecular biology.

[12]  Hervé Minoux,et al.  An electrostatic basis for the stability of thermophilic proteins , 2004, Proteins.

[13]  M J Sternberg,et al.  Protein side-chain conformational entropy derived from fusion data--comparison with other empirical scales. , 1994, Protein engineering.

[14]  A. Karshikoff,et al.  Ion pairs and the thermotolerance of proteins from hyperthermophiles: a "traffic rule" for hot roads. , 2001, Trends in biochemical sciences.

[15]  Stephen L. Mayo,et al.  Design, structure and stability of a hyperthermophilic protein variant , 1998, Nature Structural Biology.

[16]  Edward N. Trifonov,et al.  The Triplet Code From First Principles , 2004, Journal of biomolecular structure & dynamics.

[17]  R. Jaenicke,et al.  Protein stability and molecular adaptation to extreme conditions. , 1991, European journal of biochemistry.

[18]  W E Stites,et al.  Energetics of side chain packing in staphylococcal nuclease assessed by systematic double mutant cycles. , 2001, Biochemistry.

[19]  Darren A. Natale,et al.  The complete genome of hyperthermophile Methanopyrus kandleri AV19 and monophyly of archaeal methanogens , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[20]  B. Tidor,et al.  Do salt bridges stabilize proteins? A continuum electrostatic analysis , 1994, Protein science : a publication of the Protein Society.

[21]  C. Vieille,et al.  Hyperthermophilic Enzymes: Sources, Uses, and Molecular Mechanisms for Thermostability , 2001, Microbiology and Molecular Biology Reviews.

[22]  Fredj Tekaia,et al.  Amino acid composition of genomes, lifestyles of organisms, and evolutionary trends: a global picture with correspondence analysis. , 2002, Gene.

[23]  P. Privalov,et al.  Contribution of hydration to protein folding thermodynamics. I. The enthalpy of hydration. , 1993, Journal of molecular biology.

[24]  N. Go Theoretical studies of protein folding. , 1983, Annual review of biophysics and bioengineering.

[25]  D. M. Taverna,et al.  Why are proteins marginally stable? , 2002, Proteins.

[26]  N. Go,et al.  Noninteracting local‐structure model of folding and unfolding transition in globular proteins. I. Formulation , 1981, Biopolymers.

[27]  S. Marqusee,et al.  Comparison of the folding processes of T. thermophilus and E. coli ribonucleases H. , 2002, Journal of molecular biology.

[28]  R. Varadarajan,et al.  Elucidation of factors responsible for enhanced thermal stability of proteins: a structural genomics based study. , 2002, Biochemistry.

[29]  Haruo Abe,et al.  Noninteracting local‐structure model of folding and unfolding transition in globular proteins. II. Application to two‐dimensional lattice proteins , 1981, Biopolymers.

[30]  J. V. Van Beeumen,et al.  Crystal Structures of an Oxygen-binding Cytochrome cfrom Rhodobacter sphaeroides * , 2000, The Journal of Biological Chemistry.

[31]  Charles L Brooks,et al.  The effects of ionic strength on protein stability: the cold shock protein family. , 2002, Journal of molecular biology.

[32]  Adrian A Canutescu,et al.  Access the most recent version at doi: 10.1110/ps.03154503 References , 2003 .

[33]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[34]  E. Shakhnovich,et al.  Side‐chain dynamics and protein folding , 2001, Proteins.

[35]  E. Trifonov,et al.  Molecular evolution from abiotic scratch , 2002, FEBS letters.

[36]  M. Gerstein,et al.  The stability of thermophilic proteins: a study based on comprehensive genome comparison , 2000, Functional & Integrative Genomics.

[37]  E. Koonin,et al.  A universal trend of amino acid gain and loss in protein evolution , 2005, Nature.

[38]  M J Sternberg,et al.  Side‐chain conformational entropy in protein folding , 1995, Protein science : a publication of the Protein Society.

[39]  Tina M. Hay To Charge or Not to Charge. , 1989 .

[40]  G. Böhm,et al.  The stability of proteins in extreme environments. , 1998, Current opinion in structural biology.

[41]  W E Stites,et al.  Energetics of side chain packing in staphylococcal nuclease assessed by exchange of valines, isoleucines, and leucines. , 2001, Biochemistry.

[42]  G. Makhatadze,et al.  Engineering a thermostable protein via optimization of charge-charge interactions on the protein surface. , 1999, Biochemistry.

[43]  Paul B Rainey,et al.  Global analysis of predicted proteomes: functional adaptation of physical properties. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[44]  R. Huber,et al.  The complete genome of the hyperthermophilic bacterium Aquifex aeolicus , 1998, Nature.

[45]  B. Stoddard,et al.  Computational Thermostabilization of an Enzyme , 2005, Science.

[46]  Srebrenka Robic,et al.  Role of residual structure in the unfolded state of a thermophilic protein , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[47]  Y. Kawarabayasi,et al.  Complete genome sequence of an aerobic hyper-thermophilic crenarchaeon, Aeropyrum pernix K1. , 1999, DNA research : an international journal for rapid publication of reports on genes and genomes.

[48]  Satoshi Fukuchi,et al.  Unique amino acid composition of proteins in halophilic bacteria. , 2003, Journal of molecular biology.

[49]  B. Matthews,et al.  Enhanced protein thermostability from site-directed mutations that decrease the entropy of unfolding. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[50]  P. Privalov,et al.  Contribution of hydration to protein folding thermodynamics. II. The entropy and Gibbs energy of hydration. , 1993, Journal of molecular biology.

[51]  W. Stites,et al.  Replacement of staphylococcal nuclease hydrophobic core residues with those from thermophilic homologues indicates packing is improved in some thermostable proteins. , 2004, Journal of molecular biology.

[52]  W E Stites,et al.  Increasing the thermostability of staphylococcal nuclease: implications for the origin of protein thermostability. , 2000, Journal of molecular biology.

[53]  E. Shakhnovich,et al.  The folding thermodynamics and kinetics of crambin using an all-atom Monte Carlo simulation. , 2000, Journal of molecular biology.

[54]  T M Handel,et al.  Review: protein design--where we were, where we are, where we're going. , 2001, Journal of structural biology.