Incorporating efficient radial basis function networks and significant amino acid pairs for predicting GTP binding sites in transport proteins

BackgroundGuanonine-protein (G-protein) is known as molecular switches inside cells, and is very important in signals transmission from outside to inside cell. Especially in transport protein, most of G-proteins play an important role in membrane trafficking; necessary for transferring proteins and other molecules to a variety of destinations outside and inside of the cell. The function of membrane trafficking is controlled by G-proteins via Guanosine triphosphate (GTP) binding sites. The GTP binding sites active G-proteins initiated to membrane vesicles by interacting with specific effector proteins. Without the interaction from GTP binding sites, G-proteins could not be active in membrane trafficking and consequently cause many diseases, i.e., cancer, Parkinson… Thus it is very important to identify GTP binding sites in membrane trafficking, in particular, and in transport protein, in general.ResultsWe developed the proposed model with a cross-validation and examined with an independent dataset. We achieved an accuracy of 95.6% for evaluating with cross-validation and 98.7% for examining the performance with the independent data set. For newly discovered transport protein sequences, our approach performed remarkably better than similar methods such as GTPBinder, NsitePred and TargetSOS. Moreover, a friendly web server was developed for identifying GTP binding sites in transport proteins available for all users.ConclusionsWe approached a computational technique using PSSM profiles and SAAPs for identifying GTP binding residues in transport proteins. When we included SAAPs into PSSM profiles, the predictive performance achieved a significant improvement in all measurement metrics. Furthermore, the proposed method could be a power tool for determining new proteins that belongs into GTP binding sites in transport proteins and can provide useful information for biologists.

[1]  Ming Zhang,et al.  Rab7: roles in membrane trafficking and disease. , 2009, Bioscience reports.

[2]  P. Novick,et al.  The role of GTP-binding proteins in transport along the exocytic pathway. , 1993, Annual review of cell biology.

[3]  Mark Johnson,et al.  NCBI BLAST: a better web interface , 2008, Nucleic Acids Res..

[4]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[5]  M. O. Dayhoff A model of evolutionary change in protein , 1978 .

[6]  Yu-Yen Ou,et al.  Incorporating significant amino acid pairs to identify O-linked glycosylation sites on transmembrane proteins and non-transmembrane proteins , 2010, BMC Bioinformatics.

[7]  G. Crooks,et al.  WebLogo: a sequence logo generator. , 2004, Genome research.

[8]  Shu Yang,et al.  The Roles of Monomeric GTP-Binding Proteins in Macroautophagy in Saccharomyces cerevisiae , 2014, International journal of molecular sciences.

[9]  Yu-Yen Ou,et al.  Prediction of FAD binding sites in electron transport proteins according to efficient radial basis function networks and significant amino acid pairs , 2016, BMC Bioinformatics.

[10]  M. Strong,et al.  The emerging role of guanine nucleotide exchange factors in ALS and other neurodegenerative diseases , 2014, Front. Cell. Neurosci..

[11]  K. Mullis,et al.  Specific synthesis of DNA in vitro via a polymerase-catalyzed chain reaction. , 1987, Methods in enzymology.

[12]  Ian H. Witten,et al.  Data mining in bioinformatics using Weka , 2004, Bioinform..

[13]  S. Henikoff,et al.  Amino acid substitution matrices from protein blocks. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Yu-Yen Ou,et al.  Protein disorder prediction by condensed PSSM considering propensity for order or disorder , 2006, BMC Bioinformatics.

[15]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[16]  Zheng Rong Yang,et al.  Bio-basis function neural network for prediction of protease cleavage sites in proteins , 2005, IEEE Transactions on Neural Networks.

[17]  Gajendra P. S. Raghava,et al.  Open Access Research Article Prediction of Gtp Interacting Residues, Dipeptides and Tripeptides in a Protein from Its Evolutionary Information , 2022 .

[18]  Jing-Yu Yang,et al.  A New Supervised Over-Sampling Algorithm with Application to Protein-Nucleotide Binding Residue Prediction , 2014, PloS one.

[19]  Andrew P. Bradley,et al.  The use of the area under the ROC curve in the evaluation of machine learning algorithms , 1997, Pattern Recognit..

[20]  Ian T. Paulsen,et al.  TransportDB: a relational database of cellular membrane transport systems , 2004, Nucleic Acids Res..

[21]  Yu-Yen Ou,et al.  Using Efficient RBF Networks to Classify Transport Proteins Based on PSSM Profiles and Biochemical Properties , 2009, IWANN.

[22]  M. Gromiha,et al.  Classification of transporters using efficient radial basis function networks with position‐specific scoring matrices and biochemical properties , 2010, Proteins.

[23]  Lukasz A. Kurgan,et al.  Prediction and analysis of nucleotide-binding residues using sequence and sequence-derived structural descriptors , 2012, Bioinform..

[24]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[25]  Milton H. Saier,et al.  TCDB: the Transporter Classification Database for membrane transport protein analyses and information , 2005, Nucleic Acids Res..

[26]  Tzong-Yi Lee,et al.  Incorporating Distant Sequence Features and Radial Basis Function Networks to Identify Ubiquitin Conjugation Sites , 2011, PloS one.

[27]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[28]  Yu-Yen Ou,et al.  Prediction of transporter targets using efficient RBF networks with PSSM profiles and biochemical properties , 2011, Bioinform..

[29]  K. Mullis,et al.  9 – Specific Synthesis of DNA in Vitro via a Polymerase-Catalyzed Chain Reaction , 1989 .

[30]  Yen-Jen Oyang,et al.  A novel radial basis function network classifier with centers set by hierarchical clustering , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[31]  Yu-Yen Ou,et al.  TMBETADISC-RBF: Discrimination of beta-barrel membrane proteins using RBF networks and PSSM profiles , 2008, Comput. Biol. Chem..

[32]  Hao Lin,et al.  High prevalence of genital human papillomavirus type 52 and 58 infection in women attending gynecologic practitioners in South Taiwan. , 2006, Gynecologic oncology.

[33]  P. Novick,et al.  Role of Rab GTPases in membrane traffic and cell physiology. , 2011, Physiological reviews.

[34]  De-Shuang Huang,et al.  Prediction of inter-residue contacts map based on genetic algorithm optimized radial basis function neural network and binary input encoding scheme , 2004, J. Comput. Aided Mol. Des..

[35]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.