DUET: a server for predicting effects of mutations on protein stability using an integrated computational approach

Cancer genome and other sequencing initiatives are generating extensive data on non-synonymous single nucleotide polymorphisms (nsSNPs) in human and other genomes. In order to understand the impacts of nsSNPs on the structure and function of the proteome, as well as to guide protein engineering, accurate in silicomethodologies are required to study and predict their effects on protein stability. Despite the diversity of available computational methods in the literature, none has proven accurate and dependable on its own under all scenarios where mutation analysis is required. Here we present DUET, a web server for an integrated computational approach to study missense mutations in proteins. DUET consolidates two complementary approaches (mCSM and SDM) in a consensus prediction, obtained by combining the results of the separate methods in an optimized predictor using Support Vector Machines (SVM). We demonstrate that the proposed method improves overall accuracy of the predictions in comparison with either method individually and performs as well as or better than similar methods. The DUET web server is freely and openly available at http://structure.bioc.cam.ac.uk/duet.

[1]  Gary D Bader,et al.  International network of cancer genome projects , 2010, Nature.

[2]  Wagner Meira,et al.  Protein cutoff scanning: A comparative analysis of cutoff dependent and cutoff free methods for prospecting contacts in proteins , 2009, Proteins.

[3]  S. Sathiya Keerthi,et al.  Improvements to the SMO algorithm for SVM regression , 2000, IEEE Trans. Neural Networks Learn. Syst..

[4]  Piero Fariselli,et al.  A neural-network-based method for predicting protein stability changes upon single point mutations , 2004, ISMB/ECCB.

[5]  Emidio Capriotti,et al.  Bioinformatics for personal genome interpretation , 2012, Briefings Bioinform..

[6]  J. H. Steiger Tests for comparing elements of a correlation matrix. , 1980 .

[7]  L. Serrano,et al.  Predicting changes in the stability of proteins and protein complexes: a study of more than 1000 mutations. , 2002, Journal of molecular biology.

[8]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[9]  T L Blundell,et al.  Prediction of the stability of protein mutants based on structural environment-dependent amino acid substitution and propensity tables. , 1997, Protein engineering.

[10]  Wagner Meira,et al.  Cutoff Scanning Matrix (CSM): structural classification and function prediction by protein inter-residue distance patterns , 2011, BMC Genomics.

[11]  H. Dyson,et al.  Mechanism of coupled folding and binding of an intrinsically disordered protein , 2007, Nature.

[12]  Douglas E. V. Pires,et al.  mCSM: predicting the effects of mutations in proteins using graph-based signatures , 2013, Bioinform..

[13]  M. Michael Gromiha,et al.  CUPSAT: prediction of protein stability upon point mutations , 2006, Nucleic Acids Res..

[14]  Philippe Bogaerts,et al.  Fast and accurate predictions of protein stability changes upon mutations using statistical potentials and neural networks: PoPMuSiC-2.0 , 2009, Bioinform..

[15]  N. Tokuriki,et al.  Modulating protein stability – directed evolution strategies for improved protein function , 2013, The FEBS Journal.

[16]  Piero Fariselli,et al.  I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure , 2005, Nucleic Acids Res..

[17]  M. Stratton,et al.  The cancer genome , 2009, Nature.

[18]  Arlo Z. Randall,et al.  Prediction of protein stability changes for single‐site mutations using support vector machines , 2005, Proteins.

[19]  Bernhard Schölkopf,et al.  Comparing support vector machines with Gaussian kernels to radial basis function classifiers , 1997, IEEE Trans. Signal Process..

[20]  Wagner Meira,et al.  aCSM: noise-free graph-based signatures to large-scale receptor-based ligand prediction , 2013, Bioinform..

[21]  Piero Fariselli,et al.  Correlating disease‐related mutations to their effect on protein stability: A large‐scale analysis of the human proteome , 2011, Human mutation.

[22]  Emidio Capriotti,et al.  Bioinformatics and variability in drug response: a protein structural perspective , 2012, Journal of The Royal Society Interface.

[23]  Chi-Wei Chen,et al.  iStable: off-the-shelf predictor integration for predicting protein stability changes , 2013, BMC Bioinformatics.

[24]  Akinori Sarai,et al.  ProTherm and ProNIT: thermodynamic databases for proteins and protein–nucleic acid interactions , 2005, Nucleic Acids Res..

[25]  David F. Burke,et al.  Andante: reducing side-chain rotamer search space during comparative modeling using environment-specific substitution probabilities , 2007, Bioinform..

[26]  J. R. Quinlan Learning With Continuous Classes , 1992 .