A neural network based approach for protein structural class prediction

The paper provides an alternative approach to protein structural class prediction employing artificial neural network. Existing works on protein structural class prediction are computationally intensive. The method employs SOFM for extraction of representative feature vectors, for the four different structural classes and then uses Principal Component Analysis for finding optimum feature vector dimension. Nearest neighborhood classification technique is finally utilised; to classify these protein datapoints to their respective classes. The proposed work presented in this paper, maintains the same level of classification accuracy in minimum computation time, as it employs most prominent and reduced number of feature set for classification.

[1]  C. DeLisi,et al.  Prediction of protein structural class from the amino acid sequence , 1986, Biopolymers.

[2]  Chris H. Q. Ding,et al.  Multi-class protein fold recognition using support vector machines and neural networks , 2001, Bioinform..

[3]  K. Chou,et al.  An optimization approach to predicting protein structural class from amino acid composition , 1992, Protein science : a publication of the Protein Society.

[4]  G Deléage,et al.  An algorithm for protein secondary structure prediction based on class prediction. , 1987, Protein engineering.

[5]  K. Chou A novel approach to predicting protein structural classes in a (20–1)‐D amino acid composition space , 1995, Proteins.

[6]  Lindsay I. Smith,et al.  A tutorial on Principal Components Analysis , 2002 .

[7]  K. Chou,et al.  A key driving force in determination of protein structural classes. , 1999, Biochemical and biophysical research communications.

[8]  C. Chothia,et al.  Structural patterns in globular proteins , 1976, Nature.

[9]  Klaus Schulten,et al.  Self-organizing maps: ordering, convergence properties and energy functions , 1992, Biological Cybernetics.

[10]  T. Kikuchi,et al.  Discrimination of folding types of globular proteins based on average distance maps constructed from their sequences , 1993, Journal of protein chemistry.

[11]  J. Richardson,et al.  The anatomy and taxonomy of protein structure. , 1981, Advances in protein chemistry.

[12]  J M Thornton,et al.  Analysis of domain structural class using an automated class assignment protocol. , 1996, Journal of molecular biology.

[13]  Y Cai,et al.  Prediction of protein structural classes by neural network. , 2000, Biochimie.

[14]  Jiang Wang,et al.  Prediction of protein structural class with Rough Sets , 2006, BMC Bioinformatics.

[15]  O. Ptitsyn,et al.  Similarities of protein topologies: evolutionary divergence, functional convergence or principles of folding? , 1980, Quarterly Reviews of Biophysics.

[16]  K. Chou,et al.  Predicting protein structural classes from amino acid composition: application of fuzzy clustering. , 1995, Protein engineering.

[17]  Z Zhang,et al.  Prediction of the Secondary Structure Contents of Globular Proteins Based on Three Structural Classes , 1998, Journal of protein chemistry.

[18]  J M Chandonia,et al.  Neural networks for secondary structure and structural class predictions , 1995, Protein science : a publication of the Protein Society.

[19]  Yu-Dong Cai,et al.  Support Vector Machines for predicting protein structural class , 2001, BMC Bioinformatics.

[20]  R. Jernigan,et al.  Understanding the recognition of protein structural classes by amino acid composition , 1997, Proteins.

[21]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[22]  K. Nishikawa,et al.  Classification of proteins into groups based on amino acid composition and other characters. I. Angular distribution. , 1983, Journal of biochemistry.

[23]  P. Y. Chou,et al.  Prediction of Protein Structural Classes from Amino Acid Compositions , 1989 .

[24]  J. Richardson,et al.  β-Sheet topology and the relatedness of proteins , 1977, Nature.

[25]  Teuvo Kohonen,et al.  The self-organizing map , 1990 .

[26]  D. Connelly,et al.  Cross‐validation of protein structural class prediction using statistical clustering and neural networks , 1993, Protein science : a publication of the Protein Society.

[27]  K. Nishikawa,et al.  Classification of proteins into groups based on amino acid composition and other characters. II. Grouping into four types. , 1983, Journal of biochemistry.

[28]  Chris Sander,et al.  Protein folds and families: sequence and structure alignments , 1999, Nucleic Acids Res..