Prediction of protein structural classes based on feature selection technique

The prediction of protein structural classes is beneficial to understanding folding patterns, functions and interactions of proteins. In this study, we proposed a feature selection-based method to accurately predict protein structural classes. Three datasets with sequence identity lower than 25% were used to test the prediction performance of the method. Through jackknife cross-validation, we have verified that the overall accuracies of these three datasets are 92.1%, 89.7% and 84.0%, respectively. The proposed method is more efficient and accurate than other existing methods. The present study will offer an excellent alternative to other methods for predicting protein structural classes.

[1]  Xiao-Qing Yu,et al.  Predicting protein structural class by incorporating patterns of over-represented k-mers into the general form of Chou's PseAAC. , 2012, Protein and peptide letters.

[2]  Zu-Guo Yu,et al.  Prediction of protein structural classes by recurrence quantification analysis based on chaos game representation. , 2009 .

[3]  Lukasz A. Kurgan,et al.  SCPRED: Accurate prediction of protein structural class for sequences of twilight-zone similarity with predicting sequences , 2008, BMC Bioinformatics.

[4]  Hui Ding,et al.  The prediction of protein structural class using averaged chemical shifts , 2012, Journal of biomolecular structure & dynamics.

[5]  Cangzhi Jia,et al.  A high-accuracy protein structural class prediction algorithm using predicted secondary structural information. , 2010, Journal of theoretical biology.

[6]  Xiaoqi Zheng,et al.  Prediction of protein structural class for low-similarity sequences using support vector machine and PSI-BLAST profile. , 2010, Biochimie.

[7]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[8]  Zong Dai,et al.  Prediction of protein structural classes by Chou’s pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis , 2009, Amino Acids.

[9]  V. P. Whittaker,et al.  The specificity of the human erythrocyte cholinesterase. , 1948, The Biochemical journal.

[10]  C. Chothia,et al.  Structural patterns in globular proteins , 1976, Nature.

[11]  Qianzhong Li,et al.  Using pseudo amino acid composition to predict protein structural class: Approached by incorporating 400 dipeptide components , 2007, J. Comput. Chem..

[12]  Xiaoqi Zheng,et al.  Accurate prediction of protein structural class using auto covariance transformation of PSI-BLAST profiles , 2011, Amino Acids.

[13]  Yuri N Zhuravlev Definition by Means of Indefiniteness , 2012, Journal of biomolecular structure & dynamics.

[14]  Lukasz A. Kurgan,et al.  Prediction of protein structural class using novel evolutionary collocation‐based sequence representation , 2008, J. Comput. Chem..

[15]  Chao Chen,et al.  Dual-layer wavelet SVM for predicting protein structural class via the general form of Chou's pseudo amino acid composition. , 2012, Protein and peptide letters.

[16]  Chih-Jen Lin,et al.  Working Set Selection Using Second Order Information for Training Support Vector Machines , 2005, J. Mach. Learn. Res..

[17]  Luhua Lai,et al.  Sequence preference of α-helix N-terminal tetrapeptide. , 2012, Protein and peptide letters.

[18]  Lihua Li,et al.  Improving protein structural class prediction using novel combined sequence information and predicted secondary structural features , 2011, J. Comput. Chem..

[19]  Lukasz A. Kurgan,et al.  Prediction of structural classes for protein sequences and domains - Impact of prediction algorithms, sequence representation and homology, and test procedures on accuracy , 2006, Pattern Recognit..

[20]  C. Zhang,et al.  Prediction of protein (domain) structural classes based on amino-acid index. , 1999, European journal of biochemistry.

[21]  Lukasz A. Kurgan,et al.  Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences , 2009, BMC Bioinformatics.

[22]  Xin Chen,et al.  Prediction of protein structural classes for low-homology sequences based on predicted secondary structure , 2010, BMC Bioinformatics.

[23]  Guo-Ping Zhou,et al.  An Intriguing Controversy over Protein Structural Class Prediction , 1998, Journal of protein chemistry.

[24]  Y D Cai,et al.  Using neural networks for prediction of domain structural classes. , 2000, Biochimica et biophysica acta.

[25]  Sheng-You Huang,et al.  Structural class tendency of polypeptide: A new conception in predicting protein structural class , 2007 .

[26]  Lukasz A. Kurgan,et al.  Secondary structure-based assignment of the protein structural classes , 2008, Amino Acids.

[27]  G. Fasman Prediction of Protein Structure and the Principles of Protein Conformation , 2012, Springer US.

[28]  Lukasz Kurgan,et al.  Prediction of protein structural class for the twilight zone sequences. , 2007, Biochemical and biophysical research communications.

[29]  Gazi Mohammad Shafiullah,et al.  Protein structural class prediction using support vector machine , 2010, International Conference on Electrical & Computer Engineering (ICECE 2010).

[30]  G. Fasman,et al.  Chou-Fasman Prediction of the Secondary Structure of Proteins , 1989 .

[31]  Yang Li,et al.  A novel protein structural classes prediction method based on predicted secondary structure. , 2012, Biochimie.

[32]  Leszek Konieczny,et al.  A tabular approach to the sequence-to-structure relation in proteins (tetrapeptide representation) for de novo protein design. , 2006, Medical science monitor : international medical journal of experimental and clinical research.

[33]  S Rackovsky On the nature of the protein folding code. , 1993, Proceedings of the National Academy of Sciences of the United States of America.

[34]  Liaofu Luo,et al.  Use of  tetrapeptide signals for protein secondary-structure prediction , 2008, Amino Acids.

[35]  Angelo M Facchiano,et al.  Prediction of the protein structural class by specific peptide frequencies. , 2009, Biochimie.