EBGW_OMP: A sequence-based method for accurate prediction of outer membrane proteins

Outer membrane proteins (OMPs) play important roles in bacterial cellular processes. Discriminating OMPs from different fold types of proteins is helpful for successful prediction of their structures and for exact designs of OMP-targeted drugs. In this paper, we developed a novel prediction method based on primary sequence features and support vector machine (SVM) algorithms. For protein sequences, discriminative features were extracted by the combination of sequence encoding based on grouped weights (EBGW), amino acid compositions and biochemical properties. Feature subsets were screened using F-score algorithm for training a SVM-based classifier, namely EBGW_OMP. The performance of EBGW_OMP was examined on a benchmark dataset of 1087 proteins. The results show that EBGW_OMP can discriminate OMPs from globular proteins, α-helical membrane proteins or non-OMPs with cross-validated accuracy of 98.0%, 97.6% or 97.9%, respectively, which outperformed existing sequence-based methods. EBGW_OMP also successfully distinguished 681 out of 722 OMPs with 97.0% accuracy in another benchmark dataset of 2657 proteins. Genome-wide tests show that EBGW_OMP has excellent capability of correctly detecting OMPs and is considerable for genomic OMPs prediction. The web server implements EBGW_OMP is freely accessible at http://bioinfo.tmmu.edu.cn/EBGW_ OMP.

[1]  William C. Wimley,et al.  The versatile β-barrel membrane protein , 2003 .

[2]  P. van Gelder,et al.  Structure and function of bacterial outer membrane proteins: barrels in a nutshell , 2000, Molecular microbiology.

[3]  Ingvar Eidhammer,et al.  BOMP: a program to predict integral ?barrel outer membrane proteins encoded within genomes of Gram-negative bacteria , 2004, Nucleic Acids Res..

[4]  Jing Hu,et al.  Discrimination of outer membrane proteins with improved performance , 2008, BMC Bioinformatics.

[5]  Jia He,et al.  Improving discrimination of outer membrane proteins by fusing different forms of pseudo amino acid composition. , 2010, Analytical biochemistry.

[6]  Pedro de Alcântara dos Santos Neto,et al.  LabSystem Gen, a tool for structuring and analyzing genetic data in histocompatibility laboratories , 2012, Comput. Biol. Medicine.

[7]  David R. Westhead,et al.  TMB-Hunt: An amino acid composition based method to screen proteomes for beta-barrel transmembrane proteins , 2005, BMC Bioinformatics.

[8]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[9]  Arne Elofsson,et al.  BOCTOPUS: improved topology prediction of transmembrane β barrel proteins , 2012, Bioinform..

[10]  Luonan Chen,et al.  Discrimination of outer membrane proteins by a new measure of information discrepancy. , 2007, Protein and peptide letters.

[11]  Zhen-Hui Zhang,et al.  A novel method for apoptosis protein subcellular localization prediction combining encoding based on grouped weight and support vector machine , 2006, FEBS letters.

[12]  Johannes Söding,et al.  HHomp—prediction and classification of outer membrane proteins , 2009, Nucleic Acids Res..

[13]  Burkhard Rost,et al.  PROFtmb: a web server for predicting bacterial transmembrane beta barrel proteins , 2006, Nucleic Acids Res..

[14]  Qi Liu,et al.  Identification of -barrel membrane proteins based on amino acid composition properties and predicted secondary structure , 2003, Comput. Biol. Chem..

[15]  Thomas C. Freeman,et al.  A highly accurate statistical approach for the prediction of transmembrane beta-barrels , 2010, Bioinform..

[16]  M. Michael Gromiha,et al.  A simple statistical method for discriminating outer membrane proteins with better accuracy , 2005, Bioinform..

[17]  Makiko Suwa,et al.  Discrimination of outer membrane proteins using machine learning algorithms , 2006, Proteins.

[18]  Piero Fariselli,et al.  A sequence-profile-based HMM for predicting and discriminating beta barrel membrane proteins , 2002, ISMB.

[19]  Ian T. Paulsen,et al.  TransportDB: a comprehensive database resource for cytoplasmic membrane transport systems and outer membrane channels , 2006, Nucleic Acids Res..

[20]  Zhen Chen,et al.  Outer membrane proteins can be simply identified using secondary structure element alignment , 2011, BMC Bioinformatics.

[21]  Thomas C. Freeman,et al.  A Highly Accurate Statistical Approach for the Prediction of Transmembrane β-Barrels , 2010 .

[22]  William C Wimley,et al.  The versatile beta-barrel membrane protein. , 2003, Current opinion in structural biology.

[23]  Paul Horton,et al.  Discrimination of outer membrane proteins using support vector machines , 2005, Bioinform..

[24]  C. Yan,et al.  Discrimination of outer membrane proteins using a K-nearest neighbor method , 2007, Amino Acids.

[25]  M. Gromiha,et al.  Classification of transporters using efficient radial basis function networks with position‐specific scoring matrices and biochemical properties , 2010, Proteins.

[26]  Li Yang,et al.  Toward an improved discrimination of outer membrane proteins using a sequence-based approach , 2011, Biosyst..

[27]  M Michael Gromiha,et al.  Motifs in outer membrane protein sequences: applications for discrimination. , 2005, Biophysical chemistry.

[28]  Piero Fariselli,et al.  Improving the detection of transmembrane β-barrel chains with N-to-1 extreme learning machines , 2011, Bioinform..

[29]  Shigeki Mitaku,et al.  Amphiphilicity index of polar amino acids as an aid in the characterization of amino acid preference at membrane-water interfaces , 2002, Bioinform..

[30]  Jing Hu,et al.  A method for discovering transmembrane beta-barrel proteins in Gram-negative bacterial proteomes , 2008, Comput. Biol. Chem..