Vector-G: Multi-Modular SVM-Based Heterotrimeric G Protein Prediction

Heterotrimeric G proteins interact with G protein-coupled receptors in response to stimulation by hormones, neurotransmitters, chemokines, and sensory signals to intracellular signaling cascades. Recently reported studies indicate that G protein subunits play a significant role in different eukaryotic diseases including inflammation, neurological diseases, cardiovascular diseases, endocrine disorders as well as plant pathogen response, infectious hyphae growth, differentiation and virulence of pathogenic fungi. Thus a study of their functions, signaling pathways, and protein interactions may lead to the development of various preventive approaches. The diversity of alpha, beta and gamma subunits of G proteins necessitates a prediction algorithm that helps in the identification of new proteins such as Gbeta where WD-40 repeats are not well characterized. The currently available techniques for finding G proteins are homology based search analyses and wet lab experiments, which are not very effective in finding new classes of proteins. We present here a robust computational method for finding new G proteins and their homologs using a SVM based pattern recognition algorithm. Several physicochemical and compositional properties including dipeptide, tripeptide and hydrophobicity composition are used for generating the SVM classifiers. This method has 96.17%, 95.38%, 97.6% sensitivity and 99.45%, 100%, 100% specificity on test sets for G protein alpha, beta, and gamma subunits, respectively. This algorithm correctly predicts the known alpha, beta and gamma subunits reported in literature. One important contribution of this algorithm is that it helps in improving genome annotation of several proteins as G proteins and serves as a useful tool for comparative genomic analysis of G proteins. Using this method, novel G protein subunits are predicted in 31 genomes covering plant, fungi and animal kingdom. The software is available at the website http://biomine.cs.uah.edu/bioinformatics/svm_prog/scripts/GProteins/vectorg.html. Supplementary files: The supplementary files are available on http://www.bioinfo.de/isb/2008/08/0013/supplementary_ material/.

[1]  Chittibabu Guda,et al.  TARGET: a new method for predicting protein subcellular localization in eukaryotes , 2005, Bioinform..

[2]  J. Ramírez,et al.  The Gβ(KlSte4p) subunit of the heterotrimeric G protein has a positive and essential role in the induction of mating in the yeast Kluyveromyces lactis , 2005 .

[3]  K. Miura,et al.  G‐Protein β3 Subunit Variant C825T is a Risk Factor for Hypertension in Japanese Females ‐A Prospective Cohort Study Over 5 Years , 2006 .

[4]  R. A. Baasiri,et al.  Regulation of Conidiation and Adenylyl Cyclase Levels by the Gα Protein GNA-3 in Neurospora crassa , 2000, Molecular and Cellular Biology.

[5]  J. Heitman,et al.  Cryptococcus neoformans mating and virulence are regulated by the G-protein alpha subunit GPA1 and cAMP. , 1997, Genes & development.

[6]  Wen-cheng Chung,et al.  Suramin Disrupts Receptor-G Protein Coupling by Blocking Association of G Protein α and βγ Subunits , 2005, Journal of Pharmacology and Experimental Therapeutics.

[7]  N. Ryba,et al.  A Novel GTP-binding Protein γ-Subunit, Gγ8, Is Expressed during Neurogenesis in the Olfactory and Vomeronasal Neuroepithelia (*) , 1995, The Journal of Biological Chemistry.

[8]  Jaques Reifman,et al.  Support vector machines with selective kernel scaling for protein classification and identification of key amino acid positions , 2002, Bioinform..

[9]  S Karlin,et al.  Methods and algorithms for statistical analysis of protein sequences. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[10]  M. Fujii,et al.  G-protein γ subunit GNG11 strongly regulates cellular senescence , 2006 .

[11]  H. Hamm,et al.  Heterotrimeric G proteins. , 1996, Current opinion in cell biology.

[12]  Lei Wang,et al.  Heterotrimeric G protein α subunit is involved in rice brassinosteroid response , 2006, Cell Research.

[13]  Srinivasan Ramachandran,et al.  SPAAN: a software program for prediction of adhesins and adhesin-like proteins using neural networks , 2004, Bioinform..

[14]  H. Ishimoto,et al.  G‐protein gamma subunit 1 is required for sugar reception in Drosophila , 2005, The EMBO journal.

[15]  H. Irving,et al.  Nod factors activate both heterotrimeric and monomeric G-proteins in Vigna unguiculata (L.) Walp , 2003, Planta.

[16]  J. Heitman,et al.  Signal transduction cascades regulating pseudohyphal differentiation of Saccharomyces cerevisiae. , 2000, Current opinion in microbiology.

[17]  J. Guarro,et al.  Distinct signalling pathways coordinately contribute to virulence of Fusarium oxysporum on mammalian hosts. , 2006, Microbes and infection.

[18]  O. Melien Heterotrimeric G proteins and disease. , 2007, Methods in molecular biology.

[19]  N. Fuse,et al.  Heterotrimeric G Proteins Regulate Daughter Cell Size Asymmetry in Drosophila Neuroblast Divisions , 2003, Current Biology.

[20]  H. Rootwelt,et al.  The G protein ?3 subunit 825C allele is associated with sudden infant death due to infection , 2006, Acta paediatrica.

[21]  Jill K Thompson,et al.  Gib2, A Novel Gβ-like/RACK1 Homolog, Functions as a Gβ Subunit in cAMP Signaling and Is Essential in Cryptococcus neoformans* , 2006, Journal of Biological Chemistry.

[22]  H. Otani,et al.  G protein signaling mediates developmental processes and pathogenesis of Alternaria alternata. , 2006, Molecular plant-microbe interactions : MPMI.

[23]  J. Robishaw,et al.  Runx-dependent regulation of G-protein γ3 expression in T-cells , 2006 .

[24]  Pantelis G Bagos,et al.  A database for G proteins and their interaction with GPCRs , 2004, BMC Bioinformatics.

[25]  A. Spiegel,et al.  Defects in G protein-coupled signal transduction in human disease. , 1996, Annual review of physiology.

[26]  G. Roman,et al.  G(o) signaling is required for Drosophila associative learning , 2006, Nature Neuroscience.

[27]  K. Borkovich,et al.  A G-Protein β Subunit Required for Sexual and Vegetative Development and Maintenance of Normal Gα Protein Levels in Neurospora crassa , 2002, Eukaryotic Cell.

[28]  J. Robishaw,et al.  Cloning and sequencing of a rat heart cDNA encoding a G-protein beta subunit related to the human retinal beta 3 subunit. , 1994, Gene.

[29]  Kuo-Chen Chou,et al.  Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes , 2005, Bioinform..

[30]  Gajendra P S Raghava,et al.  Prediction of Mitochondrial Proteins Using Support Vector Machine and Hidden Markov Model* , 2006, Journal of Biological Chemistry.

[31]  C. Bardin,et al.  Reduced expression of multiple forms of the alpha subunit of the stimulatory GTP-binding protein in pseudohypoparathyroidism type Ia. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[32]  R. Plasterk,et al.  The G‐protein γ subunit gpc‐1 of the nematode C.elegans is involved in taste adaptation , 2002 .

[33]  Michael J. Wise,et al.  The POPPs: clustering and searching using peptide probability profiles , 2002, ISMB.

[34]  H. Bourne,et al.  The expanding spectrum of G protein diseases. , 1999, The New England journal of medicine.

[35]  B. Zinman,et al.  G protein beta3 subunit gene variant and blood pressure variation in Canadian Oji-Cree. , 1998, HYPERTENSION.

[36]  K. Borkovich,et al.  The Heterotrimeric G-Protein Subunits GNG-1 and GNB-1 Form a Gβγ Dimer Required for Normal Female Fertility, Asexual Development, and Gα Protein Levels in Neurospora crassa , 2005, Eukaryotic Cell.

[37]  Rolf Apweiler,et al.  InterProScan: protein domains identifier , 2005, Nucleic Acids Res..

[39]  Alan M. Jones,et al.  Differential Roles of Arabidopsis Heterotrimeric G-Protein Subunits in Modulating Cell Division in Roots1[W] , 2006, Plant Physiology.

[40]  M. Hahn,et al.  Different signalling pathways involving a Gα protein, cAMP and a MAP kinase control germination of Botrytis cinerea conidia , 2006, Molecular microbiology.

[41]  D. Zaharevitz,et al.  Anticancer Activity of BIM-46174, a New Inhibitor of the Heterotrimeric Gα/Gβγ Protein Complex , 2006 .

[42]  Amos Bairoch,et al.  ScanProsite: a reference implementation of a PROSITE scanning tool. , 2002, Applied bioinformatics.

[43]  E. Cuppen,et al.  The G-protein beta-subunit GPB-2 in Caenorhabditis elegans regulates the G(o)alpha-G(q)alpha signaling network through interactions with the regulator of G-protein signaling proteins EGL-10 and EAT-16. , 2001, Genetics.

[44]  R. Roberts,et al.  Human Genome and Diseases:¶WD-repeat proteins: structure characteristics, biological function, and their involvement in human diseases , 2001, Cellular and Molecular Life Sciences CMLS.

[45]  J. Heitman,et al.  The G-Protein β Subunit GPB1 Is Required for Mating and Haploid Fruiting in Cryptococcus neoformans , 2000, Molecular and Cellular Biology.

[46]  Yanay Ofran,et al.  Proteins of the same fold and unrelated sequences have similar amino acid composition , 2006, Proteins.

[47]  David Haussler,et al.  Classifying G-protein coupled receptors with support vector machines , 2002, Bioinform..

[48]  Arun Krishnan,et al.  pSLIP: SVM based protein subcellular localization prediction using multiple physicochemical properties , 2005, BMC Bioinformatics.

[49]  S. Assmann G Protein Regulation of Disease Resistance During Infection of Rice with Rice Blast Fungus , 2005, Science's STKE.

[50]  Yuichi Iino,et al.  Goα regulates olfactory adaptation by antagonizing Gqα-DAG signaling in Caenorhabditis elegans , 2006 .

[51]  J. Heitman,et al.  The Gα Protein Gpa2 Controls Yeast Differentiation by Interacting with Kelch Repeat Proteins that Mimic Gβ Subunits , 2002 .

[52]  P. Schenk,et al.  Heterotrimeric G Proteins Facilitate Arabidopsis Resistance to Necrotrophic Pathogens and Are Involved in Jasmonate Signaling1 , 2005, Plant Physiology.

[53]  Thorsten Joachims,et al.  Making large scale SVM learning practical , 1998 .

[54]  Marie E Burns,et al.  Prolonged Photoresponses and Defective Adaptation in Rods of Gβ5-/- Mice , 2003, The Journal of Neuroscience.

[55]  Yuji Imaizumi,et al.  Functional Interaction between T2R Taste Receptors and G-Protein α Subunits Expressed in Taste Receptor Cells , 2003, The Journal of Neuroscience.

[56]  J. Esko,et al.  Amino acid determinants that drive heparan sulfate assembly in a proteoglycan. , 1994, The Journal of biological chemistry.

[57]  J Zuegge,et al.  Deciphering apicoplast targeting signals--feature extraction from nuclear-encoded precursors of Plasmodium falciparum apicoplast proteins. , 2001, Gene.

[58]  li her,et al.  Neurospora crassa , 2013 .

[59]  Temple F. Smith,et al.  The WD repeat: a common architecture for diverse functions. , 1999, Trends in biochemical sciences.

[60]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[61]  Alan M. Jones,et al.  A Golgi-localized hexose transporter is involved in heterotrimeric G protein-mediated early development in Arabidopsis. , 2006, Molecular biology of the cell.

[62]  O. Hiort,et al.  Determination of Gsα a Protein Activity in Albright's Hereditary Osteodystrophy , 2006, Journal of pediatric endocrinology & metabolism : JPEM.

[63]  Enrique García Jordá,et al.  Mus musculus. , 2020, Clinical & translational oncology : official publication of the Federation of Spanish Oncology Societies and of the National Cancer Institute of Mexico.

[64]  S. Offermanns,et al.  G12/G13 Family G Proteins Regulate Marginal Zone B Cell Maturation, Migration, and Polarization1 , 2006, The Journal of Immunology.

[65]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[66]  H. Dohlman,et al.  Activation of the Phosphatidylinositol 3-Kinase Vps34 by a G Protein α Subunit at the Endosome , 2006, Cell.

[67]  G. Schneider,et al.  How many potentially secreted proteins are contained in a bacterial genome? , 1999, Gene.