Binary particle swarm optimization based prediction of G-protein-coupled receptor families with feature selection

G-protein-coupled receptors (GPCRs), the largest family of membrane protein, play an important role in production of therapeutic drugs. The functions of GPCRs are closely correlated with their families. It is crucial to develop powerful tools to predict GPCRs families. In this study, Binary particle swarm optimization (BPSO) algorithm, which has a better optimization performance on discrete binary variables than particle swarm optimization (PSO), is applied to extract effective feature for amino acids pair compositions of GPCRs protein sequence. Ensemble classifier is used as prediction engine, of which the basic classifier is the fuzzy K-nearest neighbor (FKNN). Each basic classifier is trained with different feature sets. The results obtained by jackknife test are quite encouraging, indicating that the proposed method might become a potentially useful tool for GPCR prediction, or play a complimentary role to the existing methods in the relevant areas.

[1]  Yongsheng Ding,et al.  Protein Subcellular Location Prediction Based on Pseudo Amino Acid Composition and Immune Genetic Algorithm , 2006, ICIC.

[2]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[3]  G. Li,et al.  Classifying G protein-coupled receptors and nuclear receptors on the basis of protein power spectrum from fast Fourier transform , 2006, Amino Acids.

[4]  K. Chou,et al.  Recent progress in protein subcellular location prediction. , 2007, Analytical biochemistry.

[5]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[6]  Lukasz A. Kurgan,et al.  Prediction of structural classes for protein sequences and domains - Impact of prediction algorithms, sequence representation and homology, and test procedures on accuracy , 2006, Pattern Recognit..

[7]  Russell C. Eberhart,et al.  A discrete binary version of the particle swarm algorithm , 1997, 1997 IEEE International Conference on Systems, Man, and Cybernetics. Computational Cybernetics and Simulation.

[8]  Kuo-Chen Chou,et al.  Prediction of G-protein-coupled receptor classes. , 2005, Journal of proteome research.

[9]  Kuo-Chen Chou,et al.  Fuzzy KNN for predicting membrane protein types from pseudo-amino acid composition. , 2006, Journal of theoretical biology.

[10]  H. Hamm,et al.  Roles of G-protein-coupled receptor signaling in cancer biology and gene transcription. , 2007, Current opinion in genetics & development.

[11]  K. Chou,et al.  Euk-mPLoc: a fusion classifier for large-scale eukaryotic protein subcellular location prediction by incorporating multiple sites. , 2007, Journal of proteome research.

[12]  Kuo-Chen Chou,et al.  Prediction of protein structure classes with pseudo amino acid composition and fuzzy support vector machine network. , 2007, Protein and peptide letters.

[13]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  Yongsheng Ding,et al.  Prediction of protein subcellular location using hydrophobic patterns of amino acid sequence , 2006, Comput. Biol. Chem..

[15]  Y. Rahmat-Samii,et al.  Particle swarm optimization in electromagnetics , 2004, IEEE Transactions on Antennas and Propagation.

[16]  K. Chou,et al.  Using maximum entropy model to predict protein secondary structure with single sequence. , 2009, Protein and peptide letters.

[17]  Yong-Sheng Ding,et al.  Prediction of subcellular location apoptosis proteins with ensemble classifier and feature selection , 2010, Amino Acids.

[18]  Kuo-Chen Chou,et al.  Prediction protein structural classes with pseudo-amino acid composition: approximate entropy and hydrophobicity pattern. , 2008, Journal of theoretical biology.

[19]  Gert Vriend,et al.  GPCRDB information system for G protein-coupled receptors , 2003, Nucleic Acids Res..

[20]  Peixiang Cai,et al.  Predicting protein structural class with pseudo-amino acid composition and support vector machine fusion network. , 2006, Analytical biochemistry.

[21]  R. Stevens,et al.  High-Resolution Crystal Structure of an Engineered Human β2-Adrenergic G Protein–Coupled Receptor , 2007, Science.

[22]  Xavier Deupi,et al.  Conformational complexity of G-protein-coupled receptors. , 2007, Trends in pharmacological sciences.

[23]  David Haussler,et al.  Classifying G-protein coupled receptors with support vector machines , 2002, Bioinform..

[24]  Tongliang Zhang,et al.  Using pseudo amino acid composition and binary-tree support vector machines to predict protein structural classes , 2007, Amino Acids.

[25]  Gajendra P. S. Raghava,et al.  GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors , 2004, Nucleic Acids Res..

[26]  Q Gu,et al.  Prediction of G-protein-coupled receptor classes in low homology using Chou's pseudo amino acid composition with approximate entropy and hydrophobicity patterns. , 2010, Protein and peptide letters.

[27]  Xiaoying Jiang,et al.  Using the concept of Chou's pseudo amino acid composition to predict apoptosis proteins subcellular location: an approach by approximate entropy. , 2008, Protein and peptide letters.

[28]  K. Palczewski,et al.  Crystal Structure of Rhodopsin: A G‐Protein‐Coupled Receptor , 2002, Chembiochem : a European journal of chemical biology.

[29]  K. Palczewski,et al.  Crystal Structure of Rhodopsin: A G‐Protein‐Coupled Receptor , 2000, Science.

[30]  Cheol-Goo Hur,et al.  A combined approach for the classification of G protein-coupled receptors and its application to detect GPCR splice variants , 2007, Comput. Biol. Chem..

[31]  Stavros J. Hamodrakas,et al.  Bioinformatics Original Paper Prediction of the Coupling Specificity of Gpcrs to Four Families of G-proteins Using Hidden Markov Models and Artificial Neural Networks , 2022 .

[32]  K. Chou Prediction of protein cellular attributes using pseudo‐amino acid composition , 2001, Proteins.