Disulphide Bridge Prediction using Fuzzy Support Vector Machines

One of the major contributors to the native form of protein is cystines forming covalent bonds in oxidized state. The prediction of such bridges from the sequence is a very challenging task given that the number of bridges rises exponentially as the number of cystines increases. We propose a novel technique for disulphide bridge prediction based on fuzzy support vector machines. We call the system dizzy. In our investigation, we look at disulphide bond connectivity given two cystines with and without a priori knowledge of the bonding state. We make use of a new encoding scheme based on physico-chemical properties and statistical features such as the probability of occurrence of each amino acid in different secondary structure states along with psiblast profiles. The performance is compared with normal support vector machines. We evaluate our method and compare it with the existing method using SPX dataset

[1]  Ram Samudrala,et al.  Ab initio protein structure prediction using a combined hierarchical approach , 1999, Proteins.

[2]  Piero Fariselli,et al.  Prediction of the disulfide bonding state of cysteines in proteins with hidden neural networks. , 2002, Protein engineering.

[3]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[4]  R. Raines,et al.  Contribution of disulfide bonds to the conformational stability and catalytic activity of ribonuclease A. , 2000, European journal of biochemistry.

[5]  Paolo Frasconi,et al.  A two-stage SVM architecture for predicting the disulfide bonding state of cysteines , 2002, Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing.

[6]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[7]  Pierre Baldi,et al.  Large-Scale Prediction of Disulphide Bond Connectivity , 2004, NIPS.

[8]  Paolo Frasconi,et al.  Disulfide connectivity prediction using recursive neural networks and evolutionary information , 2004, Bioinform..

[9]  J. Gern The Sequence of the Human Genome , 2001, Science.

[10]  Hyunsoo Kim,et al.  Protein secondary structure prediction based on an improved support vector machines approach. , 2003, Protein engineering.

[11]  P. Y. Chou,et al.  Conformational parameters for amino acids in helical, beta-sheet, and random coil regions calculated from proteins. , 1974, Biochemistry.

[12]  Piero Fariselli,et al.  Prediction of disulfide connectivity in proteins , 2001, Bioinform..

[13]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[14]  Thomas Earnest,et al.  Automation of X-ray crystallography , 2000, Nature Structural Biology.

[15]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[16]  P Fariselli,et al.  Role of evolutionary information in predicting the disulfide‐bonding state of cysteine in proteins , 1999, Proteins.

[17]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[18]  Sheng-De Wang,et al.  Fuzzy support vector machines , 2002, IEEE Trans. Neural Networks.

[19]  Peter Clote,et al.  Disulfide connectivity prediction using secondary structure information and diresidue frequencies , 2005, Bioinform..

[20]  W R Taylor,et al.  A model recognition approach to the prediction of all-helical membrane protein structure and topology. , 1994, Biochemistry.

[21]  Anastassis Perrakis,et al.  Current state of automated crystallographic data analysis , 2000, Nature Structural Biology.

[22]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[23]  Pierre Baldi,et al.  Large‐scale prediction of disulphide bridges using kernel methods, two‐dimensional recursive neural networks, and weighted graph matching , 2005, Proteins.

[24]  András Fiser,et al.  Predicting the oxidation state of cysteines by multiple sequence alignment , 2000, Bioinform..

[25]  Marimuthu Palaniswami,et al.  Incremental training of support vector machines , 2005, IEEE Transactions on Neural Networks.

[26]  R. Grantham Amino Acid Difference Formula to Help Explain Protein Evolution , 1974, Science.