Using a Hierarchical Classification Model to Predict Protein Tertiary Structure

To predict protein tertiary structure accurately is helpful for understanding the functions of proteins. In this study, a hierarchical classification method based on flexible neural tree was proposed to predict the structures, in which the tier classifiers were flexible neural trees due to their excellent performances. In order to classify the structures, three types of feature are used, i.e. the tripeptide composed of dimension reduction, the pseudo amino acid composition and the position information of amino acid residues. To evaluate our method, the 640 data set was used in this investigation. The experimental results suggest that our method overwhelms several representative approaches to predicting protein tertiary structure.

[1]  K Nishikawa,et al.  The folding type of a protein is relevant to the amino acid composition. , 1986, Journal of biochemistry.

[2]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[3]  K. Chou A novel approach to predicting protein structural classes in a (20–1)‐D amino acid composition space , 1995, Proteins.

[4]  Jiwen Dong,et al.  Time-series forecasting using flexible neural tree model , 2005, Inf. Sci..

[5]  K. Chou,et al.  A key driving force in determination of protein structural classes. , 1999, Biochemical and biophysical research communications.

[6]  K. Chou Prediction of protein cellular attributes using pseudo‐amino acid composition , 2001, Proteins.

[7]  C. Zhang,et al.  A joint prediction of the folding types of 1490 human proteins from their genetic codons. , 1993, Journal of theoretical biology.

[8]  Kuo-Chen Chou,et al.  Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes , 2005, Bioinform..

[9]  Yang Li,et al.  A novel protein structural classes prediction method based on predicted secondary structure. , 2012, Biochimie.

[10]  Yuehui Chen,et al.  Exchange Rate Forecasting Using Flexible Neural Trees , 2006, ISNN.

[11]  S. Brunak,et al.  Predicting subcellular localization of proteins based on their N-terminal amino acid sequence. , 2000, Journal of molecular biology.

[12]  Jürgen Schmidhuber,et al.  Probabilistic Incremental Program Evolution: Stochastic Search Through Program Space , 1997, ECML.

[13]  Rafal Salustowicz,et al.  Probabilistic Incremental Program Evolution , 1997, Evolutionary Computation.

[14]  Ying-Li Chen,et al.  Prediction of the subcellular location of apoptosis proteins. , 2007, Journal of theoretical biology.

[15]  Peter Nordin,et al.  Genetic programming - An Introduction: On the Automatic Evolution of Computer Programs and Its Applications , 1998 .

[16]  C. Chothia,et al.  Structural patterns in globular proteins , 1976, Nature.

[17]  Bo Yang,et al.  Feature selection and classification using flexible neural tree , 2006, Neurocomputing.

[18]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.