A Novel Approach to Protein Structure Prediction Using PCA Based Extreme Learning Machines and Multiple Kernels

In the area of bio-informatics, large amount of data is harvested with functional and genetic features of proteins. The structure of protein plays an important role in its biological and genetic functions. In this study, we propose a protein structure prediction scheme based novel learning algorithms --- the extreme learning machine and the Support Vector Machine using multiple kernel learning, The experimental validation of the proposed approach on a publicly available protein data set shows a significant improvement in performance of the proposed approach in terms of accuracy of classification of protein folds using multiple kernels where multiple heterogeneous feature space data are available. The proposed method provides the higher recognition ratio as compared to other methods reported in previous studies.

[1]  K. Dill,et al.  The Protein Folding Problem , 1993 .

[2]  Chun-Xia Zhang,et al.  RotBoost: A technique for combining Rotation Forest and AdaBoost , 2008, Pattern Recognit. Lett..

[3]  Loris Nanni,et al.  Ensemble of classifiers for protein fold recognition , 2006, Neurocomputing.

[4]  N.R. Pal,et al.  Prediction of Protein Folds: Extraction of New Features, Dimensionality Reduction, and Fusion of Heterogeneous Classifiers , 2009, IEEE Transactions on NanoBioscience.

[5]  S. Eddy Hidden Markov models. , 1996, Current opinion in structural biology.

[6]  Katarzyna Stapor,et al.  A hybrid discriminative/generative approach to protein fold recognition , 2012, Neurocomputing.

[7]  Xieping Gao,et al.  A novel hierarchical ensemble classifier for protein fold recognition. , 2008, Protein engineering, design & selection : PEDS.

[8]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[9]  M. Madera,et al.  A comparison of profile hidden Markov model procedures for remote homology detection. , 2002, Nucleic acids research.

[10]  Michael Kuperberg,et al.  Markov Models , 2019, Earthquake Statistical Analysis through Multi-state Modeling.

[11]  D. Serre Matrices: Theory and Applications , 2002 .

[12]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[13]  Girija Chetty,et al.  A novel image watermarking scheme using Extreme Learning Machine , 2012, The 2012 International Joint Conference on Neural Networks (IJCNN).

[14]  Dharmendra Sharma,et al.  A Hybrid Approach to Increase the Performance of Protein Folding Recognition Using Support Vector Machines , 2012, MLDM.

[15]  Kuo-Chen Chou,et al.  Ensemble classifier for protein fold pattern recognition , 2006, Bioinform..

[16]  Narasimhan Sundararajan,et al.  Fully complex extreme learning machine , 2005, Neurocomputing.

[17]  Chee Kheong Siew,et al.  Real-time learning capability of neural networks , 2006, IEEE Trans. Neural Networks.

[18]  K. Chou,et al.  Hum-mPLoc: an ensemble classifier for large-scale human protein subcellular location prediction by incorporating samples with multiple sites. , 2007, Biochemical and biophysical research communications.

[19]  Vojislav Kecman,et al.  Protein fold recognition with adaptive local hyperplane algorithm , 2009, 2009 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology.

[20]  Infotech Oulu,et al.  Protein Fold Recognition with K-Local Hyperplane Distance Nearest Neighbor Algorithm , 2004 .

[21]  Yorgos Goletsis,et al.  Sequence-based protein structure prediction using a reduced state-space hidden Markov model , 2007, Comput. Biol. Medicine.

[22]  Dimitrios I. Fotiadis,et al.  Improving the protein fold recognition accuracy of a reduced state-space hidden Markov model , 2009, Comput. Biol. Medicine.

[23]  Chris H. Q. Ding,et al.  Multi-class protein fold recognition using support vector machines and neural networks , 2001, Bioinform..

[24]  Loris Nanni A novel ensemble of classifiers for protein fold recognition , 2006, Neurocomputing.