On filtering false positive transmembrane protein predictions.

While helical transmembrane (TM) region prediction tools achieve high (>90%) success rates for real integral membrane proteins, they produce a considerable number of false positive hits in sequences of known nontransmembrane queries. We propose a modification of the dense alignment surface (DAS) method that achieves a substantial decrease in the false positive error rate. Essentially, a sequence that includes possible transmembrane regions is compared in a second step with TM segments in a sequence library of documented transmembrane proteins. If the performance of the query sequence against the library of documented TM segment-containing sequences in this test is lower than an empirical threshold, it is classified as a non-transmembrane protein. The probability of false positive prediction for trusted TM region hits is expressed in terms of E-values. The modified DAS method, the DAS-TMfilter algorithm, has an unchanged high sensitivity for TM segments ( approximately 95% detected in a learning set of 128 documented transmembrane proteins). At the same time, the selectivity measured over a non-redundant set of 526 soluble proteins with known 3D structure is approximately 99%, mainly because a large number of falsely predicted single membrane-pass proteins are eliminated by the DAS-TMfilter algorithm.

[1]  G von Heijne,et al.  Consensus predictions of membrane protein topology , 2000, FEBS letters.

[2]  Regularities in the primary structure of proteins. , 2009, International journal of peptide and protein research.

[3]  P. Tompa,et al.  Prion protein: Evolution caught en route , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Rolf Apweiler,et al.  Evaluation of methods for the prediction of membrane spanning regions , 2001, Bioinform..

[5]  G. Nikiforovich A novel, non-statistical method for predicting breaks in transmembrane helices. , 1998, Protein engineering.

[6]  G. Schulz,et al.  Structure of porin refined at 1.8 A resolution. , 1992, Journal of molecular biology.

[7]  B. Rost,et al.  Transmembrane helices predicted at 95% accuracy , 1995, Protein science : a publication of the Protein Society.

[8]  I Simon,et al.  New alignment strategy for transmembrane proteins. , 1994, Journal of molecular biology.

[9]  Shigeki Mitaku,et al.  SOSUI: classification and secondary structure prediction system for membrane proteins , 1998, Bioinform..

[10]  István Simon,et al.  Topology of Membrane Proteins , 2001, J. Chem. Inf. Comput. Sci..

[11]  S H White,et al.  Energetics, stability, and prediction of transmembrane helices. , 2001, Journal of molecular biology.

[12]  P. Bork,et al.  Prediction of potential GPI-modification sites in proprotein sequences. , 1999, Journal of molecular biology.

[13]  I. Simon,et al.  Predicting isomorphic residue replacements for protein design. , 2009, International journal of peptide and protein research.

[14]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[15]  J. Andreadis,et al.  Use of immobilized PCR primers to generate covalently immobilized DNAs for in vitro transcription/translation reactions. , 2000, Nucleic acids research.

[16]  P. Argos,et al.  Protein structure prediction: recognition of primary, secondary, and tertiary structural features from amino acid sequence. , 1995, Critical reviews in biochemistry and molecular biology.

[17]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[18]  Yoshinori Fujiyoshi,et al.  Atomic model of plant light-harvesting complex by electron crystallography , 1994, Nature.

[19]  D A Parry,et al.  Quantitative comparison of the ability of hydropathy scales to recognize surface β‐strands in proteins , 2001, Proteins.

[20]  W R Taylor,et al.  A model recognition approach to the prediction of all-helical membrane protein structure and topology. , 1994, Biochemistry.

[21]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[22]  S J Hamodrakas,et al.  A novel method for predicting transmembrane segments in proteins based on a statistical analysis of the SwissProt database: the PRED-TMR algorithm. , 1999, Protein engineering.

[23]  A Elofsson,et al.  Prediction of transmembrane alpha-helices in prokaryotic membrane proteins: the dense alignment surface method. , 1997, Protein engineering.

[24]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[25]  Liam J. McGuffin,et al.  The PSIPRED protein structure prediction server , 2000, Bioinform..

[26]  S J Hamodrakas,et al.  An hierarchical artificial neural network system for the classification of transmembrane proteins. , 1999, Protein engineering.

[27]  J. Lakey,et al.  Pore-forming colicins and their relatives. , 2001, Current topics in microbiology and immunology.

[28]  A. Fiser,et al.  Predicting protein conformation by statistical methods. , 2001, Biochimica et biophysica acta.

[29]  M. Kanehisa,et al.  Cluster analysis of amino acid indices for prediction of protein structure and function. , 1988, Protein engineering.

[30]  P Bork,et al.  Post-translational GPI lipid anchor modification of proteins in kingdoms of life: analysis of protein sequence data from complete genomes. , 2001, Protein engineering.

[31]  U. Hobohm,et al.  Enlarged representative set of protein structures , 1994, Protein science : a publication of the Protein Society.

[32]  G. Heijne Membrane protein structure prediction. Hydrophobicity analysis and the positive-inside rule. , 1992, Journal of molecular biology.

[33]  P. Loll,et al.  The X-ray crystal structure of the membrane protein prostaglandin H2 synthase-1 , 1994, Nature.

[34]  G. Tusnády,et al.  Principles governing amino acid composition of integral membrane proteins: application to topology prediction. , 1998, Journal of molecular biology.

[35]  Davor Juretic,et al.  Preference Functions for Prediction of Membrane-buried Helices in Integral Membrane Proteins , 1998, Comput. Chem..

[36]  F. Hucho,et al.  Beta-structure in the membrane-spanning part of the nicotinic acetylcholine receptor (or how helical are transmembrane helices?). , 1994, Trends in biochemical sciences.

[37]  István Simon,et al.  The HMMTOP transmembrane topology prediction server , 2001, Bioinform..

[38]  T. Steitz,et al.  Identifying nonpolar transbilayer helices in amino acid sequences of membrane proteins. , 1986, Annual review of biophysics and biophysical chemistry.

[39]  Rolf Apweiler,et al.  A collection of well characterised integral membrane proteins , 2000, Bioinform..