Refining Neural Network Predictions for Helical Transmembrane Proteins by Dynamic Programming

For transmembrane proteins experimental determination of three-dimensional structure is problematic. However, membrane proteins have important impact for molecular biology in general, and for drug design in particular. Thus, prediction method are needed. Here we introduce a method that started from the output of the profile-based neural network system PHDhtm (Rost, et al. 1995). Instead of choosing the neural network output unit with maximal value as prediction, we implemented a dynamic programming-like refinement procedure that aimed at producing the best model for all transmembrane helices compatible with the neural network output. The refined prediction was used successfully to predict transmembrane topology based on an empirical rule for the charge difference between extra- and intra-cytoplasmic regions (positive-inside rule). Preliminary results suggest that the refinement was clearly superior to the initial neural network system; and that the method predicted all transmembrane helices correctly for more proteins than a previously applied empirical filter. The resulting accuracy in predicting topology was better than 80%. Although a more thorough evaluation of the method on a larger data set will be required, the results compared favourably with alternative methods. The results reflected the strength of the refinement procedure which was the successful incorporation of global information: whereas the residue preferences output by the neural network were derived from stretches of 17 adjacent residues, the refinement procedure involved constraints on the level of the entire protein.

[1]  G. Heijne Membrane protein structure prediction. Hydrophobicity analysis and the positive-inside rule. , 1992, Journal of molecular biology.

[2]  Søren Brunak,et al.  Protein Folds: A Distance-Based Approach , 1995 .

[3]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[4]  T. Steitz,et al.  Identifying nonpolar transbilayer helices in amino acid sequences of membrane proteins. , 1986, Annual review of biophysics and biophysical chemistry.

[5]  G. Heijne The distribution of positively charged residues in bacterial inner membrane proteins correlates with the trans‐membrane topology , 1986, The EMBO journal.

[6]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[7]  B. Rost,et al.  Redefining the goals of protein secondary structure prediction. , 1994, Journal of molecular biology.

[8]  D. T. Jones,et al.  A method for alpha-helical integral membrane protein fold prediction. , 1994, Proteins.

[9]  B. Rost,et al.  Transmembrane helices predicted at 95% accuracy , 1995, Protein science : a publication of the Protein Society.

[10]  W. Cramer,et al.  Membrane protein structure prediction: cytochrome b. , 1991, Trends in biochemical sciences.

[11]  B. Matthews Comparison of the predicted and observed secondary structure of T4 phage lysozyme. , 1975, Biochimica et biophysica acta.

[12]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[13]  A. Bairoch,et al.  The SWISS-PROT protein sequence data bank: current status. , 1994, Nucleic acids research.

[14]  B. Rost,et al.  Prediction of protein secondary structure at better than 70% accuracy. , 1993, Journal of molecular biology.

[15]  P Argos,et al.  Prediction of transmembrane segments in proteins utilising multiple sequence alignments. , 1994, Journal of molecular biology.

[16]  J. Beckwith,et al.  A genetic approach to analyzing membrane protein topology. , 1986, Science.

[17]  J. Broome-Smith,et al.  Gene-fusion techniques for determining membrane-protein topology , 1993 .

[18]  David T. Jones,et al.  A method for α‐helical integral membrane protein fold prediction , 1994 .

[19]  B Rost,et al.  Bridging the protein sequence-structure gap by structure predictions. , 1996, Annual review of biophysics and biomolecular structure.

[20]  T A Rapoport,et al.  Predicting the orientation of eukaryotic membrane-spanning proteins. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[21]  B Rost,et al.  Progress of 1D protein structure prediction at last , 1995, Proteins.

[22]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[23]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[24]  G. Vonheijne,et al.  Control of topology and mode of assembly of a polytopic membrane protein by positively charged residues , 1989, Nature.

[25]  C. Sander,et al.  Database of homology‐derived protein structures and the structural meaning of sequence alignment , 1991, Proteins.

[26]  W R Taylor,et al.  A model recognition approach to the prediction of all-helical membrane protein structure and topology. , 1994, Biochemistry.

[27]  G. von Heijne,et al.  Topogenic signals in integral membrane proteins. , 1988, European journal of biochemistry.

[28]  E E Lattman,et al.  Protein crystallography for all , 1994, Proteins.

[29]  R. Fleischmann,et al.  Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. , 1995, Science.

[30]  Piero Fariselli,et al.  HTP: a neural network-based method for predicting the topology of helical transmembrane domains in proteins , 1996, Comput. Appl. Biosci..

[31]  J Edelman,et al.  Quadratic minimization of predictors for protein secondary structure. Application to transmembrane alpha-helices. , 1993, Journal of molecular biology.

[32]  Jon Beckwith,et al.  The role of charged amino acids in the localization of secreted and membrane proteins , 1990, Cell.

[33]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1978, Archives of biochemistry and biophysics.

[34]  B. Rost,et al.  Topology prediction for helical transmembrane proteins at 86% accuracy–Topology prediction at 86% accuracy , 1996, Protein science : a publication of the Protein Society.

[35]  G. von Heijne,et al.  Predicting the topology of eukaryotic membrane proteins. , 1993, European journal of biochemistry.

[36]  B. Rost PHD: predicting one-dimensional protein structure by profile-based neural networks. , 1996, Methods in enzymology.