Structural genomics of thermotoga maritima proteins shows that contact order is a major determinant of protein thermostability.

Despite numerous studies, understanding the structural basis of protein stability in thermophilic organisms has remained elusive. One of the main reasons is the limited number of thermostable protein structures available for analysis, but also the difficulty in identifying relevant features to compare. Notably, an intuitive feeling of "compactness" of thermostable proteins has eluded quantification. With the unprecedented opportunity to assemble a data set for comparative analyses due to the recent advances in structural genomics, we can now revisit this issue and focus on experimentally determined structures of proteins from the hyperthermophilic bacterium Thermotoga maritima. We find that 73% of T. maritima proteins have higher contact order than their mesophilic homologs. Thus, contact order, a structural feature that was originally introduced to explain differences in folding rates of different protein families, is a significant parameter that can now be correlated with thermostability.

[1]  M. Danson,et al.  Assaying activity and assessing thermostability of hyperthermophilic enzymes. , 2001, Methods in enzymology.

[2]  Jorng-Tzong Horng,et al.  PGTdb: a database providing growth temperatures of prokaryotes , 2004, Bioinform..

[3]  J. R. Doyle There's more than one way to skin a cat. , 2003, Techniques in hand & upper extremity surgery.

[4]  Dmitry N Ivankov,et al.  Chain length is the main determinant of the folding rate for proteins with three‐state folding kinetics , 2003, Proteins.

[5]  A. Godzik,et al.  Are proteins ideal mixtures of amino acids? Analysis of energy parameter sets , 1995, Protein science : a publication of the Protein Society.

[6]  R. Varadarajan,et al.  Elucidation of factors responsible for enhanced thermal stability of proteins: a structural genomics based study. , 2002, Biochemistry.

[7]  A. Godzik,et al.  Topology fingerprint approach to the inverse protein folding problem. , 1992, Journal of molecular biology.

[8]  Kevin W Plaxco,et al.  How the folding rate constant of simple, single-domain proteins depends on the number of native contacts , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[9]  R. Jaenicke,et al.  The crystal structure of holo-glyceraldehyde-3-phosphate dehydrogenase from the hyperthermophilic bacterium Thermotoga maritima at 2.5 A resolution. , 1995, Journal of molecular biology.

[10]  Philip E. Bourne,et al.  The distribution and query systems of the RCSB Protein Data Bank , 2004, Nucleic Acids Res..

[11]  A. Szilágyi,et al.  Structural differences between mesophilic, moderately thermophilic and extremely thermophilic protein subunits: results of a comprehensive survey. , 2000, Structure.

[12]  S. Marqusee,et al.  Comparison of the folding processes of T. thermophilus and E. coli ribonucleases H. , 2002, Journal of molecular biology.

[13]  M. Gerstein,et al.  The stability of thermophilic proteins: a study based on comprehensive genome comparison , 2000, Functional & Integrative Genomics.

[14]  A. Clarke,et al.  The influence of intramolecular bridges on the dynamics of a protein folding reaction. , 2002, Biochemistry.

[15]  J. Tanner,et al.  Determinants of enzyme thermostability observed in the molecular structure of Thermus aquaticus D-glyceraldehyde-3-phosphate dehydrogenase at 25 Angstroms Resolution. , 1996, Biochemistry.

[16]  G. Petsko Structural basis of thermostability in hyperthermophilic proteins, or "there's more than one way to skin a cat". , 2001, Methods in enzymology.

[17]  R. Sterner,et al.  Thermophilic Adaptation of Proteins , 2001, Critical reviews in biochemistry and molecular biology.

[18]  Karen N. Allen,et al.  X-ray structure analysis of a designed oligomeric miniprotein reveals a discrete quaternary architecture. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[20]  J. Li,et al.  Structure of holo-glyceraldehyde-3-phosphate dehydrogenase from Palinurus versicolor refined at 2 A resolution. , 1998, Acta crystallographica. Section D, Biological crystallography.

[21]  Guy Perrière,et al.  Integrated databanks access and sequence/structure analysis services at the PBIL , 2003, Nucleic Acids Res..

[22]  Adam Godzik,et al.  Structural genomics of the Thermotoga maritima proteome implemented in a high-throughput structure determination pipeline , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[23]  A. Aubry,et al.  Comparative analysis of thermoadaptation within the archaeal glyceraldehyde-3-phosphate dehydrogenases from mesophilic Methanobacterium bryantii and thermophilic Methanothermus fervidus. , 2002, Biopolymers.

[24]  O. Gascuel,et al.  A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. , 2003, Systematic biology.

[25]  R. Nussinov,et al.  Factors enhancing protein thermostability. , 2000, Protein engineering.

[26]  D Eisenberg,et al.  Transproteomic evidence of a loop-deletion mechanism for enhancing protein thermostability. , 1999, Journal of molecular biology.

[27]  D. Baker,et al.  Contact order, transition state placement and the refolding rates of single domain proteins. , 1998, Journal of molecular biology.