Estimation of Membrane Proteins in the Human Proteome

Genomics and proteomics have added valuable information to our knowledgebase of the human biological system including the discovery of therapeutic targets and disease biomarkers. However, molecular profiling studies commonly result in the identification of novel proteins of unknown localization. A class of proteins of special interest is membrane proteins, in particular plasma membrane proteins. Despite their biological and medical significance, the 3-dimensional structures of less than 1% of plasma membrane proteins have been determined. In order to aid in identification of membrane proteins, a number of computational methods have been developed. These tools operate by predicting the presence of transmembrane segments. Here, we utilized five topology prediction methods (TMHMM, SOSUI, waveTM, HMMTOP, and TopPred II) in order to estimate the ratio of integral membrane proteins in the human proteome. These methods employ different algorithms and include a newly-developed method (waveTM) that has yet to be tested on a large proteome database. Since these tools are prone for error mainly as a result of falsely predicting signal peptides as transmembrane segments, we have utilized an additional method, SignalP. Based on our analyses, the ratio of human proteins with transmembrane segments is estimated to fall between 15% and 39% with a consensus of 13%. Agreement among the programs is reduced further when both a positive identification of a membrane protein and the number of transmembrane segments per protein are considered. Such a broad range of prediction depends on the selectivity of the individual method in predicting integral membrane proteins. These methods can play a critical role in determining protein structure and, hence, identifying suitable drug targets in humans.

[1]  Shigeki Mitaku,et al.  SOSUI: classification and secondary structure prediction system for membrane proteins , 1998, Bioinform..

[2]  T. Stevens,et al.  Do more complex organisms have a greater proportion of membrane proteins in their genomes? , 2000, Proteins.

[3]  J. Baselga,et al.  HER-Targeted Tyrosine-Kinase Inhibitors , 2002, Oncology.

[4]  Manuel G. Claros,et al.  TopPred II: an improved software for membrane protein structure predictions , 1994, Comput. Appl. Biosci..

[5]  G. Heijne,et al.  Genome‐wide analysis of integral membrane proteins from eubacterial, archaean, and eukaryotic organisms , 1998, Protein science : a publication of the Protein Society.

[6]  Toshio Shimizu,et al.  Evaluating transmembrane topology prediction methods for the effect of signal peptide in topology prediction , 2002, Silico Biol..

[7]  A. Krogh,et al.  A combined transmembrane topology and signal peptide prediction method. , 2004, Journal of molecular biology.

[8]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[9]  David C. Jones,et al.  A mutation data matrix for transmembrane proteins , 1994, FEBS letters.

[10]  G. von Heijne,et al.  Predicting the topology of eukaryotic membrane proteins. , 1993, European journal of biochemistry.

[11]  Masami Ikeda,et al.  Transmembrane topology prediction methods: A re-assessment and improvement by a consensus method using a dataset of experimentally-characterized transmembrane topology , 2001, Silico Biol..

[12]  G. Tusnády,et al.  Principles governing amino acid composition of integral membrane proteins: application to topology prediction. , 1998, Journal of molecular biology.

[13]  A. Krogh,et al.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes. , 2001, Journal of molecular biology.

[14]  N. Blom,et al.  Feature-based prediction of non-classical and leaderless protein secretion. , 2004, Protein engineering, design & selection : PEDS.

[15]  Rolf Apweiler,et al.  Evaluation of methods for the prediction of membrane spanning regions , 2001, Bioinform..

[16]  E. Rowinsky,et al.  The ErbB receptor family: a therapeutic target for cancer. , 2002, Trends in molecular medicine.

[17]  David T. Jones,et al.  A method for α‐helical integral membrane protein fold prediction , 1994 .

[18]  Lukas Käll,et al.  Reliability of transmembrane predictions in whole‐genome data , 2002, FEBS letters.

[19]  G. Heijne Membrane protein structure prediction. Hydrophobicity analysis and the positive-inside rule. , 1992, Journal of molecular biology.

[20]  Stavros J. Hamodrakas,et al.  waveTM: Wavelet-based transmembrane segment prediction , 2004, Silico Biol..

[21]  István Simon,et al.  The HMMTOP transmembrane topology prediction server , 2001, Bioinform..

[22]  G. von Heijne,et al.  Prediction of partial membrane protein topologies using a consensus approach , 2002, Protein science : a publication of the Protein Society.

[23]  A. Krogh,et al.  Reliability measures for membrane protein topology prediction algorithms. , 2003, Journal of molecular biology.

[24]  Masami Ikeda,et al.  The presence of signal peptide significantly affects transmembrane topology prediction , 2002, Bioinform..

[25]  A. Hopkins,et al.  The druggable genome , 2002, Nature Reviews Drug Discovery.

[26]  A. Elofsson,et al.  Best α‐helical transmembrane protein topology predictions are achieved using hidden Markov models and evolutionary information , 2004 .

[27]  J. Drews Drug discovery: a historical perspective. , 2000, Science.