Transmembranehelixpredictioninproteinsusinghydrophobicityproperties andhigher-orderstatistics

Prediction of the transmembrane (TM) helices is important in the study of membrane proteins. A novel method to predict the location and length of both single and multiple TM helices in human proteins is presented. The proposed method is based on a combination of hydrophobicity and higher-order statistics, resulting in a TM prediction tool, namely K4HTM. A training dataset of 117 human single TM proteins and two test-datasets containing 499 and 484 human single and multiple TM proteins, respectively, were drawn from the SWISS-PROT public database and used for the optimisation and evaluation of K4HTM. Validation results showed that K4HTM correctly predicts the entire topology for 99.68% and 93.08% of the sequences in the single and multiple test-datasets, respectively. These results compare favourably with existing methods, such as SPLIT4, TMHMM2, WAVETM and SOSUI, constituting an alternative approach to the TM helix prediction problem. 2008 Elsevier Ltd. All rights reserved.

[1]  C. DeLisi,et al.  Hydrophobicity scales and computational techniques for detecting amphipathic structures in proteins. , 1987, Journal of molecular biology.

[2]  Antony Browne,et al.  Use Of Neural Networks To Predict And Analyse Membrane Proteins In The Proteome , 2005, BMC Bioinformatics.

[3]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[4]  Patrick Argos,et al.  Prediction of Membrane Protein Topology Utilizing Multiple Sequence Alignments , 1997, Journal of protein chemistry.

[5]  Pierre Soille,et al.  Morphological Image Analysis , 1999 .

[6]  David T. Jones,et al.  A method for α‐helical integral membrane protein fold prediction , 1994 .

[7]  A. Kernytsky,et al.  Transmembrane helix predictions revisited , 2002, Protein science : a publication of the Protein Society.

[8]  Shigeki Mitaku,et al.  SOSUI: classification and secondary structure prediction system for membrane proteins , 1998, Bioinform..

[9]  G. Tusnády,et al.  Principles governing amino acid composition of integral membrane proteins: application to topology prediction. , 1998, Journal of molecular biology.

[10]  Pietro Liò,et al.  Wavelet change-point prediction of transmembrane proteins , 2000, Bioinform..

[11]  S J Hamodrakas,et al.  A novel method for predicting transmembrane segments in proteins based on a statistical analysis of the SwissProt database: the PRED-TMR algorithm. , 1999, Protein engineering.

[12]  Pierre Soille,et al.  Morphological Image Analysis: Principles and Applications , 2003 .

[13]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[14]  T. Steitz,et al.  Identifying nonpolar transbilayer helices in amino acid sequences of membrane proteins. , 1986, Annual review of biophysics and biophysical chemistry.

[15]  G. von Heijne,et al.  Topogenic signals in integral membrane proteins. , 1988, European journal of biochemistry.

[16]  G. Heijne Membrane protein structure prediction. Hydrophobicity analysis and the positive-inside rule. , 1992, Journal of molecular biology.

[17]  Stavros J. Hamodrakas,et al.  waveTM: Wavelet-based transmembrane segment prediction , 2004, Silico Biol..

[18]  G. Heijne A new method for predicting signal sequence cleavage sites. , 1986 .

[19]  Gillian Rhodes Crystallography Made Crystal Clear , 1993 .

[20]  C. L. Nikias,et al.  Higher-order spectra analysis : a nonlinear signal processing framework , 1993 .

[21]  K. R. Woods,et al.  Prediction of protein antigenic determinants from amino acid sequences. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[22]  P. Argos,et al.  Structural prediction of membrane-bound proteins. , 2005, European journal of biochemistry.

[23]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[24]  D. Juretic,et al.  Basic Charge Clusters and Predictions of Membrane Protein Topology , 2002, J. Chem. Inf. Comput. Sci..

[25]  D. Doyle,et al.  Transmembrane helix prediction: a comparative evaluation and analysis. , 2005, Protein engineering, design & selection : PEDS.

[26]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[27]  G vonHeijne,et al.  Membrane proteins: the amino acid composition of membrane-penetrating segments. , 1981, European journal of biochemistry.

[28]  Vikram Pakrashi,et al.  Structural damage detection and calibration using a wavelet-kurtosis technique , 2007 .

[29]  Pierre-Olivier Amblard,et al.  Wavelet packets and de-noising based on higher-order-statistics for transient detection , 2001, Signal Process..

[30]  G. Heijne Computer analysis of DNA and protein sequences , 1991 .

[31]  G. von Heijne,et al.  Predicting the topology of eukaryotic membrane proteins. , 1993, European journal of biochemistry.

[32]  W R Taylor,et al.  A model recognition approach to the prediction of all-helical membrane protein structure and topology. , 1994, Biochemistry.