Prediction of protein mutant stability using classification and regression tool.

Prediction of protein stability upon amino acid substitutions is an important problem in molecular biology and the solving of which would help for designing stable mutants. In this work, we have analyzed the stability of protein mutants using two different datasets of 1396 and 2204 mutants obtained from ProTherm database, respectively for free energy change due to thermal (DeltaDeltaG) and denaturant denaturations (DeltaDeltaG(H(2)O)). We have used a set of 48 physical, chemical energetic and conformational properties of amino acid residues and computed the difference of amino acid properties for each mutant in both sets of data. These differences in amino acid properties have been related to protein stability (DeltaDeltaG and DeltaDeltaG(H(2)O)) and are used to train with classification and regression tool for predicting the stability of protein mutants. Further, we have tested the method with 4 fold, 5 fold and 10 fold cross validation procedures. We found that the physical properties, shape and flexibility are important determinants of protein stability. The classification of mutants based on secondary structure (helix, strand, turn and coil) and solvent accessibility (buried, partially buried, partially exposed and exposed) distinguished the stabilizing/destabilizing mutants at an average accuracy of 81% and 80%, respectively for DeltaDeltaG and DeltaDeltaG(H(2)O). The correlation between the experimental and predicted stability change is 0.61 for DeltaDeltaG and 0.44 for DeltaDeltaG(H(2)O). Further, the free energy change due to the replacement of amino acid residue has been predicted within an average error of 1.08 kcal/mol and 1.37 kcal/mol for thermal and chemical denaturation, respectively. The relative importance of secondary structure and solvent accessibility, and the influence of the dataset on prediction of protein mutant stability have been discussed.

[1]  D Gilis,et al.  Stability changes upon mutation of solvent-accessible residues in proteins evaluated by database-derived potentials. , 1996, Journal of molecular biology.

[2]  W E Stites,et al.  Contributions of the large hydrophobic amino acids to the stability of staphylococcal nuclease. , 1990, Biochemistry.

[3]  Shandar Ahmad,et al.  RVP-net: online prediction of real valued accessible surface area of proteins from single sequences , 2003, Bioinform..

[4]  Hongyi Zhou,et al.  Stability scale and atomic solvation parameters extracted from 1023 mutation experiments , 2002, Proteins.

[5]  M. Michael Gromiha,et al.  Importance of Native-State Topology for Determining the Folding Rate of Two-State Proteins , 2003, J. Chem. Inf. Comput. Sci..

[6]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[7]  M. Michael Gromiha,et al.  Relative importance of secondary structure and solvent accessibility to the stability of protein mutants.: A case study with amino acid properties and energetics on T4 and human lysozymes , 2005, Comput. Biol. Chem..

[8]  M. Gromiha,et al.  Role of structural and sequence information in the prediction of protein stability changes: comparison between buried and partially buried mutations. , 1999, Protein engineering.

[9]  M. Michael Gromiha,et al.  A Statistical Method for Predicting Protein Unfolding Rates from Amino Acid Sequence. , 2006 .

[10]  M. Michael Gromiha,et al.  FOLD-RATE: prediction of protein folding rates from amino acid sequence , 2006, Nucleic Acids Res..

[11]  B. Matthews,et al.  Studies on protein stability with T4 lysozyme. , 1995, Advances in protein chemistry.

[12]  M. Michael Gromiha,et al.  A Statistical Model for Predicting Protein Folding Rates from Amino Acid Sequence with Structural Class Information , 2005, J. Chem. Inf. Model..

[13]  M. Gromiha,et al.  Important amino acid properties for enhanced thermostability from mesophilic to thermophilic proteins. , 1999, Biophysical chemistry.

[14]  Piero Fariselli,et al.  A neural-network-based method for predicting protein stability changes upon single point mutations , 2004, ISMB/ECCB.

[15]  R. Abagyan,et al.  Large‐scale prediction of protein geometry and stability changes for arbitrary single point mutations , 2004, Proteins.

[16]  W F van Gunsteren,et al.  Prediction of the activity and stability effects of site-directed mutagenesis on a protein core. , 1992, Journal of molecular biology.

[17]  Steinar Thorvaldsen,et al.  Property-Dependent Analysis of Aligned Proteins from Two Or More Populations , 2005, APBC.

[18]  T Tsujita,et al.  Dependence of conformational stability on hydrophobicity of the amino acid residue in a series of variant proteins substituted at a unique position of tryptophan synthase alpha subunit. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[19]  M. Kanehisa,et al.  Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. , 1996, Protein engineering.

[20]  M. Gromiha,et al.  Relationship Between Amino Acid Properties and Protein Stability: Buried Mutations , 1999, Journal of protein chemistry.

[21]  B. García-Moreno E.,et al.  Changes in stability upon charge reversal and neutralization substitution in staphylococcal nuclease are dominated by favorable electrostatic effects. , 2003, Biochemistry.

[22]  Akinori Sarai,et al.  ProTherm: Thermodynamic Database for Proteins and Mutants , 1999, Nucleic Acids Res..

[23]  Burkhard Rost,et al.  PHD - an automatic mail server for protein secondary structure prediction , 1994, Comput. Appl. Biosci..

[24]  D Gilis,et al.  Predicting protein stability changes upon mutation using database-derived potentials: solvent accessibility determines the importance of local versus non-local interactions along the sequence. , 1997, Journal of molecular biology.

[25]  Y. Yamagata,et al.  Positive Contribution of Hydration Structure on the Surface of Human Lysozyme to the Conformational Stability* , 2002, The Journal of Biological Chemistry.

[26]  Nikolay V Dokholyan,et al.  Can contact potentials reliably predict stability of proteins? , 2004, Journal of molecular biology.

[27]  M. Gromiha,et al.  Importance of Surrounding Residues for Protein Stability of Partially Buried Mutations , 2000, Journal of biomolecular structure & dynamics.

[28]  M. N. Ponnuswamy,et al.  Average assignment method for predicting the stability of protein mutants , 2006, Biopolymers.

[29]  Piero Fariselli,et al.  Predicting protein stability changes from sequences using support vector machines , 2005, ECCB/JBI.

[30]  Akinori Sarai,et al.  ProTherm, version 4.0: thermodynamic database for proteins and mutants , 2004, Nucleic Acids Res..

[31]  M Michael Gromiha,et al.  Important amino acid properties for determining the transition state structures of two‐state protein mutants , 2002, FEBS letters.

[32]  A. Fersht,et al.  Effect of cavity-creating mutations in the hydrophobic core of chymotrypsin inhibitor 2. , 1993, Biochemistry.

[33]  George I Makhatadze,et al.  Effects of charge-to-alanine substitutions on the stability of ribosomal protein L30e from Thermococcus celer. , 2005, Biochemistry.

[34]  Frank Eisenhaber,et al.  Improved strategy in analytic surface calculation for molecular systems: Handling of singularities and computational efficiency , 1993, J. Comput. Chem..

[35]  Kevin L. Shaw,et al.  Asp79 makes a large, unfavorable contribution to the stability of RNase Sa. , 2005, Journal of molecular biology.

[36]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[37]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.