TMBETADISC-RBF: Discrimination of beta-barrel membrane proteins using RBF networks and PSSM profiles

Discriminating outer membrane proteins (OMPs) from other folding types of globular and membrane proteins is an important task both for identifying OMPs from genomic sequences and for the successful prediction of their secondary and tertiary structures. We have developed a method based on radial basis function networks and position specific scoring matrix (PSSM) profiles generated by PSI-BLAST and non-redundant protein database. Our approach with PSSM profiles has correctly predicted the OMPs with a cross-validated accuracy of 96.4% in a set of 1251 proteins, which contain 206 OMPs, 667 globular proteins and 378 alpha-helical inner membrane proteins. Furthermore, we applied our method on a dataset containing 114 OMPs, 187 TMH proteins and 195 globular proteins obtained with less than 20% sequence identity and obtained the cross-validated accuracy of 95%. This accuracy of discriminating OMPs is higher than other methods in the literature and our method could be used as an effective tool for dissecting OMPs from genomic sequences. We have developed a prediction server, TMBETADISC-RBF, which is available at http://rbf.bioinfo.tw/~sachen/OMP.html.

[1]  Harpreet Kaur,et al.  Prediction of transmembrane regions of beta-barrel proteins using ANN- and SVM-based methods. , 2004, Proteins.

[2]  Ingvar Eidhammer,et al.  BOMP: a program to predict integral ?barrel outer membrane proteins encoded within genomes of Gram-negative bacteria , 2004, Nucleic Acids Res..

[3]  Stavros J. Hamodrakas,et al.  A Hidden Markov Model method, capable of predicting and discriminating β-barrel outer membrane proteins , 2004, BMC Bioinformatics.

[4]  De-Shuang Huang,et al.  Prediction of inter-residue contacts map based on genetic algorithm optimized radial basis function neural network and binary input encoding scheme , 2004, J. Comput. Aided Mol. Des..

[5]  D T Jones,et al.  Protein secondary structure prediction based on position-specific scoring matrices. , 1999, Journal of molecular biology.

[6]  Patrice Koehl,et al.  The ASTRAL Compendium in 2004 , 2003, Nucleic Acids Res..

[7]  M. Michael Gromiha,et al.  A simple statistical method for discriminating outer membrane proteins with better accuracy , 2005, Bioinform..

[8]  William H. Press,et al.  Numerical Recipes in C, 2nd Edition , 1992 .

[9]  Makiko Suwa,et al.  Discrimination of outer membrane proteins using machine learning algorithms , 2006, Proteins.

[10]  M Michael Gromiha,et al.  Motifs in outer membrane protein sequences: applications for discrimination. , 2005, Biophysical chemistry.

[11]  Yu-Yen Ou,et al.  Protein disorder prediction by condensed PSSM considering propensity for order or disorder , 2006, BMC Bioinformatics.

[12]  S. Krishnaswamy,et al.  Profiles from structure based sequence alignment of porins can identify ß stranded integral membrane proteins , 2000, Bioinform..

[13]  Burkhard Rost,et al.  PROFtmb: a web server for predicting bacterial transmembrane beta barrel proteins , 2006, Nucleic Acids Res..

[14]  Qi Liu,et al.  Identification of -barrel membrane proteins based on amino acid composition properties and predicted secondary structure , 2003, Comput. Biol. Chem..

[15]  Paul Horton,et al.  Discrimination of outer membrane proteins using support vector machines , 2005, Bioinform..

[16]  Makiko Suwa,et al.  Current developments on beta-barrel membrane proteins: sequence and structure analysis, discrimination and prediction. , 2007, Current protein & peptide science.

[17]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[18]  Adam Godzik,et al.  Clustering of highly homologous sequences to reduce the size of large protein databases , 2001, Bioinform..

[19]  Ao Li,et al.  LOCSVMPSI: a web server for subcellular localization of eukaryotic proteins using SVM and profile of PSI-BLAST , 2005, Nucleic Acids Res..

[20]  William H. Press,et al.  Numerical recipes in C , 2002 .

[21]  David R. Westhead,et al.  TMB-Hunt: a web server to screen sequence sets for transmembrane β-barrel proteins , 2005, Nucleic Acids Res..

[22]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[23]  Shandar Ahmad,et al.  Application of residue distribution along the sequence for discriminating outer membrane proteins , 2005, Comput. Biol. Chem..

[24]  Zsuzsanna Dosztányi,et al.  PDB_TM: selection and membrane localization of transmembrane proteins in the protein data bank , 2004, Nucleic Acids Res..

[25]  Andrew G. Garrow,et al.  A consensus algorithm to screen genomes for novel families of transmembrane β barrel proteins , 2007, Proteins.

[26]  Piero Fariselli,et al.  A sequence-profile-based HMM for predicting and discriminating beta barrel membrane proteins , 2002, ISMB.

[27]  Zheng Rong Yang,et al.  Bio-basis function neural network for prediction of protease cleavage sites in proteins , 2005, IEEE Transactions on Neural Networks.

[28]  Yen-Jen Oyang,et al.  A novel radial basis function network classifier with centers set by hierarchical clustering , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[29]  Akihiko Konagaya,et al.  Selecting effective siRNA sequences by using radial basis function network and decision tree learning , 2006, BMC Bioinformatics.

[30]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.

[31]  Milton H. Saier,et al.  TCDB: the Transporter Classification Database for membrane transport protein analyses and information , 2005, Nucleic Acids Res..