Improved method for predicting ?-turn using support vector machine

Motivation: Numerous methods for predicting β-turns in proteins have been developed based on various computational schemes. Here, we introduce a new method of β-turn prediction that uses the support vector machine (SVM) algorithm together with predicted secondary structure information. Various parameters from the SVM have been adjusted to achieve optimal prediction performance. Results: The SVM method achieved excellent performance as measured by the Matthews correlation coefficient (MCC = 0.45) using a 7-fold cross validation on a database of 426 non-homologous protein chains. To our best knowledge, this MCC value is the highest achieved so far for predicting β-turn. The overall prediction accuracy Qtotal was 77.3%, which is the best among the existing prediction methods. Among its unique attractive features, the present SVM method avoids overtraining and compresses information and provides a predicted reliability index. Availability: The algorithm is available via a web server on: http://serine.umdnj.edu/~zhangq3/betaturn/ Contact: welshwj@umdnj.edu Supplementary information: http://serine.umdnj.edu/~zhangq3/betaturn

[1]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[2]  G P S Raghava,et al.  An evaluation of beta-turn prediction methods. , 2002, Bioinformatics.

[3]  Chun-Ting Zhang,et al.  Prediction of β‐turns in proteins by 1‐4 and 2‐3 correlation model , 1997 .

[4]  K. Takano,et al.  Role of amino acid residues at turns in the conformational stability and folding of human lysozyme. , 1999, Biochemistry.

[5]  J. Thornton,et al.  Beta-turns and their distortions: a proposed new nomenclature. , 1990, Protein engineering.

[7]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[8]  William J Welsh,et al.  Detecting hidden sequence propensity for amyloid fibril formation , 2004, Protein science : a publication of the Protein Society.

[9]  J. Thornton,et al.  PROMOTIF—A program to identify and analyze structural motifs in proteins , 1996, Protein science : a publication of the Protein Society.

[10]  K. Chou,et al.  Prediction of beta-turns. , 1979, Journal of protein chemistry.

[11]  Kuo-Chen Chou,et al.  Support vector machines for the classification and prediction of β‐turn types , 2002, Journal of peptide science : an official publication of the European Peptide Society.

[12]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[13]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[14]  G. Singh Prediction of-turns in proteins from multiple alignment using neural network , 2002 .

[15]  Quan Pan,et al.  Classification of protein quaternary structure with support vector machine , 2003, Bioinform..

[16]  Yingdong Zhao,et al.  Application of support vector machines for T-cell epitopes prediction , 2003, Bioinform..

[17]  Thy-Hou Lin,et al.  Prediction of beta-turns in proteins using the first-order Markov models. , 2002, Journal of chemical information and computer sciences.

[18]  Gajendra P. S. Raghava,et al.  An evaluation of ß-turn prediction methods , 2002, Bioinform..

[19]  Hu Chen,et al.  A novel method for protein secondary structure prediction using dual‐layer SVM and profiles , 2004, Proteins.

[20]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[21]  K. Chou,et al.  Prediction of β-turns , 2009 .

[22]  K C Chou,et al.  Prediction of tight turns and their types in proteins. , 2000, Analytical biochemistry.

[23]  G. Rose,et al.  Turns in peptides and proteins. , 1985, Advances in protein chemistry.

[24]  P. Y. Chou,et al.  Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins. , 1974, Biochemistry.

[25]  Gajendra Pal Singh Raghava,et al.  Prediction of β‐turns in proteins from multiple alignment using neural network , 2003, Protein science : a publication of the Protein Society.

[26]  J. Richardson,et al.  The anatomy and taxonomy of protein structure. , 1981, Advances in protein chemistry.

[27]  K. Guruprasad,et al.  Beta-and gamma-turns in proteins revisited: a new set of amino acid turn-type dependent positional preferences and potentials. , 2000, Journal of biosciences.

[28]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[29]  Bernard F. Buxton,et al.  Secondary structure prediction with support vector machines , 2003, Bioinform..

[30]  J. Gibrat,et al.  Further developments of protein secondary structure prediction using information theory. New parameters and consideration of residue pairs. , 1987, Journal of molecular biology.

[31]  D Gorse,et al.  Prediction of the location and type of β‐turns in proteins using neural networks , 1999, Protein science : a publication of the Protein Society.

[32]  S. Hua,et al.  A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach. , 2001, Journal of molecular biology.

[33]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.