Stabilizing proteins from sequence statistics: the interplay of conservation and correlation in triosephosphate isomerase stability.

Understanding the determinants of protein stability remains one of protein science's greatest challenges. There are still no computational solutions that calculate the stability effects of even point mutations with sufficient reliability for practical use. Amino acid substitutions rarely increase the stability of native proteins; hence, large libraries and high-throughput screens or selections are needed to stabilize proteins using directed evolution. Consensus mutations have proven effective for increasing stability, but these mutations are successful only about half the time. We set out to understand why some consensus mutations fail to stabilize, and what criteria might be useful to predict stabilization more accurately. Overall, consensus mutations at more conserved positions were more likely to be stabilizing in our model, triosephosphate isomerase (TIM) from Saccharomyces cerevisiae. However, positions coupled to other sites were more likely not to stabilize upon mutation. Destabilizing mutations could be removed both by removing sites with high statistical correlations to other positions and by removing nearly invariant positions at which "hidden correlations" can occur. Application of these rules resulted in identification of stabilizing mutations in 9 out of 10 positions, and amalgamation of all predicted stabilizing positions resulted in the most stable yeast TIM variant we produced (+8 °C). In contrast, a multimutant with 14 mutations each found to stabilize TIM independently was destabilized by 2 °C. Our results are a practical extension to the consensus concept of protein stabilization, and they further suggest the importance of positional independence in the mechanism of consensus stabilization.

[1]  Martin Lehmann,et al.  The consensus concept for thermostability engineering of proteins: further proof of concept. , 2002, Protein engineering.

[2]  T. Suzuki,et al.  Ancestral residues stabilizing 3-isopropylmalate dehydrogenase of an extreme thermophile: experimental evidence supporting the thermophilic common ancestor hypothesis. , 2001, Journal of biochemistry.

[3]  K. Dill Dominant forces in protein folding. , 1990, Biochemistry.

[4]  Z. Peng,et al.  Consensus-derived structural determinants of the ankyrin repeat motif , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[5]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[6]  S. Steinbacher,et al.  Sequence statistics reliably predict stabilizing mutations in a protein domain. , 1994, Journal of molecular biology.

[7]  F. Opperdoes,et al.  Kinetic properties of triose-phosphate isomerase from Trypanosoma brucei brucei. A comparison with the rabbit muscle and yeast enzymes. , 1987, European journal of biochemistry.

[8]  W. P. Russ,et al.  Natural-like function in artificial WW domains , 2005, Nature.

[9]  John Alan Gerlt,et al.  Evolution of function in (β/α)8-barrel enzymes , 2003 .

[10]  M. Lehmann,et al.  From DNA sequence to improved functionality: using protein sequence comparisons to rapidly design a thermostable consensus phytase. , 2000, Protein engineering.

[11]  W. P. Russ,et al.  Evolutionary information for specifying a protein fold , 2005, Nature.

[12]  J. Richard,et al.  Role of Lys-12 in catalysis by triosephosphate isomerase: a two-part substrate approach. , 2010, Biochemistry.

[13]  A. Fersht,et al.  Semirational design of active tumor suppressor p53 DNA binding domain with enhanced stability. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[14]  M. Sudol,et al.  WW: An isolated three‐stranded antiparallel β‐sheet domain that unfolds and refolds reversibly; evidence for a structured hydrophobic cluster in urea and GdnHCl and a disordered thermal unfolded state , 2008, Protein science : a publication of the Protein Society.

[15]  W. Stemmer,et al.  Single-step assembly of a gene and entire plasmid from large numbers of oligodeoxyribonucleotides. , 1995, Gene.

[16]  E Ohage,et al.  Intrabody construction and expression. I. The critical role of VL domain stability. , 1999, Journal of molecular biology.

[17]  J. Martial,et al.  Crystal structure of recombinant human triosephosphate isomerase at 2.8 Å resolution. Triosephosphate isomerase‐related human genetic disorders and comparison with the trypanosomal enzyme , 1994, Protein science : a publication of the Protein Society.

[18]  R. Sauer,et al.  Sequence space, folding and protein design. , 1996, Current opinion in structural biology.

[19]  M. Noble,et al.  Structure of triosephosphate isomerase from Escherichia coli determined at 2.6 A resolution. , 1993, Acta crystallographica. Section D, Biological crystallography.

[20]  Yong Xiong,et al.  Design of stable alpha-helical arrays from an idealized TPR motif. , 2003, Structure.

[21]  J. Gerlt,et al.  Evolution of function in (beta/alpha)8-barrel enzymes. , 2003, Current opinion in chemical biology.

[22]  Thomas J Magliery,et al.  Protein stability by number: high-throughput and statistical approaches to one of protein science's most difficult problems. , 2011, Current opinion in chemical biology.

[23]  Sanjay B. Hari,et al.  High-throughput thermal scanning: a general, rapid dye-binding thermal shift screen for protein engineering. , 2009, Journal of the American Chemical Society.

[24]  J. Knowles,et al.  pH-dependence of the triose phosphate isomerase reaction. , 1972, The Biochemical journal.

[25]  M. Edgell,et al.  High-precision, high-throughput stability determinations facilitated by robotics and a semiautomated titrating fluorometer. , 2003, Biochemistry.

[26]  Frances H. Arnold,et al.  Combinatorial and computational challenges for biocatalyst design , 2001, Nature.

[27]  F. Arnold Combinatorial and computational challenges for biocatalyst design , 2001, Nature.

[28]  Venuka Durani,et al.  Triosephosphate isomerase by consensus design: dramatic differences in physical properties and activity of related variants. , 2011, Journal of molecular biology.

[29]  Susumu Goto,et al.  The KEGG databases at GenomeNet , 2002, Nucleic Acids Res..

[30]  F. Richards Protein stability: still an unsolved problem , 1997, Cellular and Molecular Life Sciences CMLS.

[31]  Yaoqi Zhou,et al.  Web-based toolkits for topology prediction of transmembrane helical proteins, fold recognition, structure and binding scoring, folding-kinetics analysis and comparative analysis of domain combinations , 2005, Nucleic Acids Res..

[32]  K. H. Kalk,et al.  Structure determination of the glycosomal triosephosphate isomerase from Trypanosoma brucei brucei at 2.4 A resolution. , 1987, Journal of molecular biology.

[33]  G. Senisterra,et al.  High throughput methods of assessing protein stability and aggregation. , 2009, Molecular bioSystems.

[34]  M. Billeter,et al.  MOLMOL: a program for display and analysis of macromolecular structures. , 1996, Journal of molecular graphics.

[35]  Frances H Arnold,et al.  Consensus protein design without phylogenetic bias. , 2010, Journal of molecular biology.

[36]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[37]  François Stricher,et al.  The FoldX web server: an online force field , 2005, Nucleic Acids Res..

[38]  M. Lehmann,et al.  The consensus concept for thermostability engineering of proteins. , 2000, Biochimica et biophysica acta.

[39]  Gary J Lye,et al.  High-throughput measurement of protein stability in microtiter plates. , 2005, Biotechnology and bioengineering.

[40]  F. C. Hartman,et al.  Structure of yeast triosephosphate isomerase at 1.9-A resolution. , 1990, Biochemistry.

[41]  D W Banner,et al.  On the three-dimensional structure and catalytic mechanism of triose phosphate isomerase. , 1981, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[42]  David Rodriguez-Larrea,et al.  Engineering proteins with tunable thermodynamic and kinetic stabilities , 2008, Proteins.

[43]  Keiko Watanabe,et al.  Designing thermostable proteins: ancestral mutants of 3-isopropylmalate dehydrogenase designed by using a phylogenetic tree. , 2006, Journal of molecular biology.

[44]  Andreas Plückthun,et al.  Designing repeat proteins: well-expressed, soluble and stable proteins from combinatorial libraries of consensus ankyrin repeat proteins. , 2003, Journal of molecular biology.

[45]  François Stricher,et al.  A graphical interface for the FoldX forcefield , 2011, Bioinform..

[46]  A. Fersht,et al.  Structural basis for understanding oncogenic p53 mutations and designing rescue drugs , 2006, Proceedings of the National Academy of Sciences.

[47]  P. Harbury,et al.  Reverse engineering the (β/α)8 barrel fold , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[48]  A. Yamagishi,et al.  Improvement of Bacillus circulans beta-amylase activity attained using the ancestral mutation method. , 2010, Protein engineering, design & selection : PEDS.

[49]  C. Orengo,et al.  One fold with many functions: the evolutionary relationships between TIM barrel families based on their sequences, structures and functions. , 2002, Journal of molecular biology.

[50]  J. M. Sanchez-Ruiz,et al.  A stability pattern of protein hydrophobic mutations that reflects evolutionary structural optimization. , 2005, Biophysical journal.

[51]  A. Dąbrowska,et al.  Purification, crystallization and properties of triosephosphate isomerase from human skeletal muscle. , 1978, Acta biochimica Polonica.

[52]  A. Plückthun,et al.  Fully synthetic human combinatorial antibody libraries (HuCAL) based on modular consensus frameworks and CDRs randomized with trinucleotides. , 2000, Journal of molecular biology.

[53]  Lynne Regan,et al.  Sequence variation in ligand binding sites in proteins , 2005, BMC Bioinformatics.

[54]  G. Rose,et al.  Hydrogen bonding, hydrophobicity, packing, and protein folding. , 1993, Annual review of biophysics and biomolecular structure.

[55]  Mauno Vihinen,et al.  Performance of protein stability predictors , 2010, Human mutation.

[56]  A. Yamagishi,et al.  Extremely thermophilic translation system in the common ancestor commonote: ancestral mutants of Glycyl-tRNA synthetase from the extreme thermophile Thermus thermophilus. , 2007, Journal of molecular biology.