A method for discovering transmembrane beta-barrel proteins in Gram-negative bacterial proteomes

Transmembrane beta-barrel (TMB) proteins play pivotal roles in many aspects of bacterial functions. This paper presents a k-nearest neighbor (K-NN) method for discriminating TMB and non-TMB proteins. We start with a method that makes predictions based on a distance computed from residue composition and gradually improve the prediction performance by including homologous sequences and searching for a set of residues and di-peptides for calculating the distance. The final method achieves an accuracy of 97.1%, with 0.876 MCC, 86.4% sensitivity and 98.8% specificity. A web server based on the proposed method is available at http://yanbioinformatics.cs.usu.edu:8080/TMBKNNsubmit.

[1]  David R. Westhead,et al.  TMB-Hunt: An amino acid composition based method to screen proteomes for beta-barrel transmembrane proteins , 2005, BMC Bioinformatics.

[2]  S. Krishnaswamy,et al.  Profiles from structure based sequence alignment of porins can identify ß stranded integral membrane proteins , 2000, Bioinform..

[3]  Burkhard Rost,et al.  PROFtmb: a web server for predicting bacterial transmembrane beta barrel proteins , 2006, Nucleic Acids Res..

[4]  Qi Liu,et al.  Identification of -barrel membrane proteins based on amino acid composition properties and predicted secondary structure , 2003, Comput. Biol. Chem..

[5]  P. van Gelder,et al.  Structure and function of bacterial outer membrane proteins: barrels in a nutshell , 2000, Molecular microbiology.

[6]  Ingvar Eidhammer,et al.  BOMP: a program to predict integral ?barrel outer membrane proteins encoded within genomes of Gram-negative bacteria , 2004, Nucleic Acids Res..

[7]  T. Becker,et al.  Prediction of the plant β‐barrel proteome: A case study of the chloroplast outer envelope , 2003, Protein science : a publication of the Protein Society.

[8]  David R. Westhead,et al.  TMB-Hunt: a web server to screen sequence sets for transmembrane β-barrel proteins , 2005, Nucleic Acids Res..

[9]  William C. Wimley,et al.  The versatile β-barrel membrane protein , 2003 .

[10]  Christophe G. Lambert,et al.  PSORTdb: a protein subcellular localization database for bacteria , 2004, Nucleic Acids Res..

[11]  Stavros J. Hamodrakas,et al.  Evaluation of methods for predicting the topology of β-barrel outer membrane proteins and a consensus prediction method , 2005, BMC Bioinformatics.

[12]  M. Saier,et al.  The β‐barrel finder (BBF) program, allowing identification of outer membrane β‐barrel proteins encoded within prokaryotic genomes , 2002 .

[13]  G. Schulz β-Barrel membrane proteins , 2000 .

[14]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[15]  Stavros J. Hamodrakas,et al.  PRED-TMBB: a web server for predicting the topology of ?barrel outer membrane proteins , 2004, Nucleic Acids Res..

[16]  Milton H. Saier,et al.  TCDB: the Transporter Classification Database for membrane transport protein analyses and information , 2005, Nucleic Acids Res..

[17]  Martin Ester,et al.  Sequence analysis PSORTb v . 2 . 0 : Expanded prediction of bacterial protein subcellular localization and insights gained from comparative proteome analysis , 2004 .

[18]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[19]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[20]  Pierre Baldi,et al.  Assessing the accuracy of prediction algorithms for classification: an overview , 2000, Bioinform..

[21]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[22]  A G Murzin,et al.  SCOP: a structural classification of proteins database for the investigation of sequences and structures. , 1995, Journal of molecular biology.