Predicting protein stability changes from sequences using support vector machines

MOTIVATION The prediction of protein stability change upon mutations is key to understanding protein folding and misfolding. At present, methods are available to predict stability changes only when the atomic structure of the protein is available. Methods addressing the same task starting from the protein sequence are, however, necessary in order to complete genome annotation, especially in relation to single nucleotide polymorphisms (SNPs) and related diseases. RESULTS We develop a method based on support vector machines that, starting from the protein sequence, predicts the sign and the value of free energy stability change upon single point mutation. We show that the accuracy of our predictor is as high as 77% in the specific task of predicting the DeltaDeltaG sign related to the corresponding protein stability. When predicting the DeltaDeltaG values, a satisfactory correlation agreement with the experimental data is also found. As a final blind benchmark, the predictor is applied to proteins with a set of disease-related SNPs, for which thermodynamic data are also known. We found that our predictions corroborate the view that disease-related mutations correspond to a decrease in protein stability. AVAILABILITY http://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant2.0/I-Mutant2.0.cgi

[1]  V. Shnyrov,et al.  Comparative calorimetric study of non-amyloidogenic and amyloidogenic variants of the homotetrameric protein transthyretin. , 2000, Biophysical chemistry.

[2]  D. Selkoe Folding proteins in fatal ways , 2003, Nature.

[3]  Hongyi Zhou,et al.  Distance‐scaled, finite ideal‐gas reference state improves structure‐derived potentials of mean force for structure selection and stability prediction , 2002, Protein science : a publication of the Protein Society.

[4]  Marianne Rooman,et al.  PoPMuSiC, rationally designing point mutations in protein structures , 2002, Bioinform..

[5]  A. Fersht,et al.  Is there a unifying mechanism for protein folding? , 2003, Trends in biochemical sciences.

[6]  John Moult,et al.  Three‐dimensional structural location and molecular functional effects of missense SNPs in the T cell receptor Vβ domain , 2003, Proteins.

[7]  P. Kollman,et al.  Exhaustive mutagenesis in silico: Multicoordinate free energy calculations on proteins and peptides , 2000, Proteins.

[8]  K. Takano,et al.  Are the parameters of various stabilization factors estimated from mutant human lysozymes compatible with other proteins? , 2001, Protein engineering.

[9]  D Gilis,et al.  Predicting protein stability changes upon mutation using database-derived potentials: solvent accessibility determines the importance of local versus non-local interactions along the sequence. , 1997, Journal of molecular biology.

[10]  C. Dobson Protein folding and misfolding , 2003, Nature.

[11]  J. Moult,et al.  SNPs, protein structure, and disease , 2001, Human mutation.

[12]  Akinori Sarai,et al.  ProTherm, version 4.0: thermodynamic database for proteins and mutants , 2004, Nucleic Acids Res..

[13]  R. Glockshuber,et al.  Influence of amino acid substitutions related to inherited human prion diseases on the thermodynamic stability of the cellular prion protein. , 1999, Biochemistry.

[14]  Piero Fariselli,et al.  A neural-network-based method for predicting protein stability changes upon single point mutations , 2004, ISMB/ECCB.

[15]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[16]  W. Surewicz,et al.  The Effect of Disease-associated Mutations on the Folding Pathway of Human Prion Protein* , 2004, Journal of Biological Chemistry.

[17]  S J Wodak,et al.  Contribution of the hydrophobic effect to protein stability: analysis based on simulations of the Ile-96----Ala mutation in barnase. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[18]  K Wüthrich,et al.  NMR structures of three single-residue variants of the human prion protein. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[19]  T L Blundell,et al.  Prediction of the stability of protein mutants based on structural environment-dependent amino acid substitution and propensity tables. , 1997, Protein engineering.