BAMOKNN: A novel computational method for predicting the apoptosis protein locations

In this paper, we propose a novel hybrid binary animal migration optimization (BAMO) with k-nearest neighbor approach (KNN) to predict apoptosis protein sequences using statistical factors and dipeptide composition. Binary animal migration optimization is used for selecting a near-optimal subset of informative features that is most relevant for the classification. K-nearest neighbor approach is used as the classifier with the jackknife cross-validation. Finally, BAMOKNN is tested on a dataset including 317 proteins. Our method achieves the accuracy of 92.43%. Then, our model also tests on a testing dataset including 98 apoptosis proteins and obtains the accuracy of 94.90%. High prediction accuracy and successful prediction of apoptosis proteins suggest that BAMOKNN can be a useful approach to identify apoptosis protein locations.

[1]  K. Chou,et al.  Plant-mPLoc: A Top-Down Strategy to Augment the Power for Predicting Plant Protein Subcellular Localization , 2010, PloS one.

[2]  Q. Pan,et al.  Using pseudo amino acid composition to predict protein subcellular location: approached with amino acid composition distribution , 2008, Amino Acids.

[3]  Ying-Li Chen,et al.  Prediction of apoptosis protein subcellular location using improved hybrid approach and pseudo-amino acid composition. , 2007, Journal of theoretical biology.

[4]  Qiang Cheng,et al.  The Fisher-Markov Selector: Fast Selecting Maximally Separable Feature Subset for Multiclass Classification with Applications to High-Dimensional Data , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Jian-Ding Qiu,et al.  Predicting subcellular location of apoptosis proteins based on wavelet transform and support vector machine , 2010, Amino Acids.

[6]  Paul Horton,et al.  Nucleic Acids Research Advance Access published May 21, 2007 WoLF PSORT: protein localization predictor , 2007 .

[7]  K. Chou,et al.  Recent progress in protein subcellular location prediction. , 2007, Analytical biochemistry.

[8]  Jing Huang,et al.  Support Vector Machines for Predicting Apoptosis Proteins Types , 2005, Acta biotheoretica.

[9]  Ying-Li Chen,et al.  Prediction of the subcellular location of apoptosis proteins. , 2007, Journal of theoretical biology.

[10]  Thomas Martinetz,et al.  Prediction of apoptosis protein locations with genetic algorithms and support vector machines through a new mode of pseudo amino acid composition. , 2010, Protein and peptide letters.

[11]  W. Atchley,et al.  Solving the protein sequence metric problem. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[12]  J. Koenderink Q… , 2014, Les noms officiels des communes de Wallonie, de Bruxelles-Capitale et de la communaute germanophone.

[13]  Chen Ying-li Prediction of the Subcellular Location of Apoptosis Proteins Using the Algorithm of Measure of Diversity , 2004 .

[14]  Minghao Yin,et al.  Animal migration optimization: an optimization algorithm inspired by animal migration behavior , 2014, Neural Computing and Applications.

[15]  B. Rost,et al.  Mimicking cellular sorting improves prediction of subcellular localization. , 2005, Journal of molecular biology.

[16]  Yongsheng Ding,et al.  Using Chou's pseudo amino acid composition to predict subcellular localization of apoptosis proteins: An approach with immune genetic algorithm-based ensemble classifier , 2008, Pattern Recognit. Lett..

[17]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[18]  Zhen-Hui Zhang,et al.  A novel method for apoptosis protein subcellular localization prediction combining encoding based on grouped weight and support vector machine , 2006, FEBS letters.

[19]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[20]  K. Chou,et al.  PseAAC: a flexible web server for generating various kinds of protein pseudo amino acid composition. , 2008, Analytical biochemistry.