Classifying G-Protein Coupled Receptors with Hydropathy Blocks and Support Vector Machines

This paper developes a new method for recognizing G-Protein Coupled Receptors (GPCRs) based on features generated from the hydropathy properties of the amino acid sequences. Using the hydropathy characteristics, namely hydropathy blocks, the protein sequences are converted into fixed-dimensional feature vectors. Subsequently, the Support Vector Machine (SVM) classifier is utilized to identify the GPCR proteins belonging to the same families or subfamilies. The experimental results on GPCR datasets show that the proteins belonging to the same family or subfamily can be identified using features generated based on the hydropathy blocks.

[1]  Douglas L. Brutlag,et al.  Bayesian Segmentation of Protein Secondary Structure , 2000, J. Comput. Biol..

[2]  Richard M. Everson,et al.  Intelligent Data Engineering and Automated Learning – IDEAL 2004 , 2004, Lecture Notes in Computer Science.

[3]  David Haussler,et al.  Classifying G-protein coupled receptors with support vector machines , 2002, Bioinform..

[4]  D. Brutlag,et al.  Discovering structural correlations in α‐helices , 1994 .

[5]  Jonas S. Almeida,et al.  Alignment-free sequence comparison-a review , 2003, Bioinform..

[6]  W. Taylor,et al.  Identification of protein sequence homology by consensus template alignment. , 1986, Journal of molecular biology.

[7]  D. Haussler,et al.  Hidden Markov models in computational biology. Applications to protein modeling. , 1993, Journal of molecular biology.

[8]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[9]  M. Bouvier,et al.  Structural and functional aspects of G protein-coupled receptor oligomerization. , 1998, Biochemistry and cell biology = Biochimie et biologie cellulaire.

[10]  Josef Pánek,et al.  A new method for identification of protein (sub)families in a set of proteins based on hydropathy distribution in proteins , 2005, Proteins.

[11]  D. Slotboom,et al.  Hydropathy profile alignment: a tool to search for structural homologues of membrane proteins. , 1998, FEMS microbiology reviews.

[12]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[13]  Xing-Ming Zhao,et al.  A Novel Hybrid GA/SVM System for Protein Sequences Classification , 2004, IDEAL.

[14]  Gajendra P. S. Raghava,et al.  GPCRpred: an SVM-based method for prediction of families and subfamilies of G-protein coupled receptors , 2004, Nucleic Acids Res..

[15]  M. Degli Esposti,et al.  A critical evaluation of the hydropathy profile of membrane proteins. , 1990, European journal of biochemistry.

[16]  D. Slotboom,et al.  Estimation of structural similarity of membrane proteins by hydropathy profile alignment. , 1998, Molecular membrane biology.