An Efficient Approach for the Prediction of G-Protein Coupled Receptors and Their Subfamilies

G-protein coupled receptors are responsible for many physiochemical processes such as neurotransmission, metabolism, cellular growth and immune response. So it necessary to design a robust and efficient approach for the prediction of G-protein coupled receptors their subfamilies. To address the issue of efficient classification G-protein coupled receptors and their subfamilies, here in this paper we propose to use a weighted k-nearest neighbor classifier with UNION of best 50 features selected by Fisher score based feature selection, ReliefF, fast correlation based filter, minimum redundancy maximum relevancy and support vector machine based recursive feature elimination feature selection methods. The proposed method achieved an overall accuracy of 99.9, 98.3 % MCC values of 1.00, 0.98 ROC area values of 1.00, 0.998 and precision of 99.9 and 98.3 % using 10-fold cross validation to predict the G-protein coupled receptors and their subfamilies respectively.

[1]  Chris H. Q. Ding,et al.  Minimum Redundancy Feature Selection from Microarray Gene Expression Data , 2005, J. Bioinform. Comput. Biol..

[2]  Adam Godzik,et al.  Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences , 2006, Bioinform..

[3]  Xin Chen,et al.  An improved classification of G-protein-coupled receptors using sequence-derived features , 2010, BMC Bioinformatics.

[4]  Z. R. Li,et al.  Update of PROFEAT: a web server for computing structural and physicochemical features of proteins and peptides from amino acid sequence , 2006, Nucleic Acids Res..

[5]  Q Gu,et al.  Prediction of G-protein-coupled receptor classes in low homology using Chou's pseudo amino acid composition with approximate entropy and hydrophobicity patterns. , 2010, Protein and peptide letters.

[6]  Kong Yinghu Combined feature selection of ReliefF-SVM RFE used in face recognition , 2013 .

[7]  Gajendra P. S. Raghava,et al.  GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors , 2004, Nucleic Acids Res..

[8]  Zheng-Zhi Wang,et al.  Classification of G-protein coupled receptors at four levels. , 2006, Protein engineering, design & selection : PEDS.

[9]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[10]  Larry A. Rendell,et al.  The Feature Selection Problem: Traditional Methods and a New Algorithm , 1992, AAAI.

[11]  Huan Liu,et al.  Feature Selection for High-Dimensional Data: A Fast Correlation-Based Filter Solution , 2003, ICML.

[12]  Gajendra P. S. Raghava,et al.  GPCRsclass: a web tool for the classification of amine type of G-protein-coupled receptors , 2005, Nucleic Acids Res..

[13]  Yongsheng Ding,et al.  Binary particle swarm optimization based prediction of G-protein-coupled receptor families with feature selection , 2009, GEC '09.