Structure optimization of an artificial neural filter detecting membrane-spanning amino acid sequences.

An artificial neural network has been developed for the recognition and prediction of transmembrane regions in the amino acid sequences of human integral membrane proteins. It provides an additional prediction method besides the common hydrophobicity analysis by statistical means. Membrane/nonmembrane transition regions are predicted with 92% accuracy in both training and independent test data. The method used for the development of the neural filter is the algorithm of structure evolution. It subjects both the architecture and parameters of the system to a systematical optimization process and carries out local search in the respective structure and parameter spaces. The training technique of incomplete induction as part of the structure evolution provides for a comparatively general solution of the problem that is described by input-output relations only. Seven physiochemical side-chain properties were used to encode the amino acid sequences. It was found that geometric parameters like side-chain volume, bulkiness, or surface area are of minor importance. The properties polarity, refractivity, and hydrophobicity, however, turned out to support feature extraction. It is concluded that membrane transition regions in proteins are encoded in sequences as a characteristic feature based on the respective side-chain properties. The method of structure evolution is described in detail for this particular application and suggestions for further development of amino acid sequence filters are made.

[1]  W. Vent,et al.  Rechenberg, Ingo, Evolutionsstrategie — Optimierung technischer Systeme nach Prinzipien der biologischen Evolution. 170 S. mit 36 Abb. Frommann‐Holzboog‐Verlag. Stuttgart 1973. Broschiert , 1975 .

[2]  G. Schneider,et al.  PROFI: a tool for the analysis of protein sequence features using a simple artificial neural network , 1993 .

[3]  G. von Heijne,et al.  Topogenic signals in integral membrane proteins. , 1988, European journal of biochemistry.

[4]  G. Dombi,et al.  Analysis of protein transmembrane helical regions by a neural network , 1994, Protein science : a publication of the Protein Society.

[5]  Georg E. Schulz,et al.  Principles of Protein Structure , 1979 .

[6]  G. Rose,et al.  Hydrophobicity of amino acid residues in globular proteins. , 1985, Science.

[7]  M J Sternberg,et al.  Prediction of structural and functional features of protein and nucleic acid sequences by artificial neural networks. , 1992, Biochemistry.

[8]  D. D. Jones,et al.  Amino acid properties and side-chain orientation in proteins: a cross correlation appraoch. , 1975, Journal of theoretical biology.

[9]  G. von Heijne,et al.  Signal sequences: The limits of variation , 1985 .

[10]  J. Beckwith,et al.  Determinants of membrane protein topology. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[11]  G Vriend,et al.  Modeling of transmembrane seven helix bundles. , 1993, Protein engineering.

[12]  R. Doolittle,et al.  A simple method for displaying the hydropathic character of a protein. , 1982, Journal of molecular biology.

[13]  F. Jähnig,et al.  Structure predictions of membrane proteins are not that bad. , 1990, Trends in biochemical sciences.

[14]  D. Eisenberg,et al.  The hydrophobic moment detects periodicity in protein hydrophobicity. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Gunnar von Heijne,et al.  Fine-tuning the topology of a polytopic membrane protein: Role of positively and negatively charged amino acids , 1990, Cell.

[16]  H. P. Schwefel,et al.  Numerische Optimierung von Computermodellen mittels der Evo-lutionsstrategie , 1977 .

[17]  F. Jähnig,et al.  The structure of the lactose permease derived from Raman spectroscopy and prediction methods. , 1985, The EMBO journal.

[18]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[19]  G Schneider,et al.  The rational design of amino acid sequences by artificial neural networks and simulated molecular evolution: de novo design of an idealized leader peptidase cleavage site. , 1994, Biophysical journal.

[20]  K. R. Woods,et al.  Prediction of protein antigenic determinants from amino acid sequences. , 1981, Proceedings of the National Academy of Sciences of the United States of America.

[21]  David T. Jones,et al.  A method for α‐helical integral membrane protein fold prediction , 1994 .

[22]  R. Lohmann,et al.  A neural network model for the prediction of membrane‐spanning amino acid sequences , 1994, Protein science : a publication of the Protein Society.

[23]  William T. Katz,et al.  Artificial Neural Networks , 2018, Encyclopedia of Image Processing.

[24]  T. Steitz,et al.  Identifying nonpolar transbilayer helices in amino acid sequences of membrane proteins. , 1986, Annual review of biophysics and biophysical chemistry.

[25]  H. Lodish,et al.  Sequence of human asialoglycoprotein receptor cDNA. An internal signal sequence for membrane insertion. , 1985, The Journal of biological chemistry.

[26]  C DeLisi,et al.  The detection and classification of membrane-spanning proteins. , 1985, Biochimica et biophysica acta.

[27]  J A Bangham,et al.  Data-sieving hydrophobicity plots. , 1988, Analytical biochemistry.

[28]  G. Heijne The distribution of positively charged residues in bacterial inner membrane proteins correlates with the trans‐membrane topology , 1986, The EMBO journal.

[29]  John R. Koza,et al.  Genetic programming - on the programming of computers by means of natural selection , 1993, Complex adaptive systems.

[30]  G. Vonheijne,et al.  Control of topology and mode of assembly of a polytopic membrane protein by positively charged residues , 1989, Nature.

[31]  T. Sejnowski,et al.  Predicting the secondary structure of globular proteins using neural network models. , 1988, Journal of molecular biology.

[32]  J. Nathans,et al.  Isolation and nucleotide sequence of the gene encoding human rhodopsin. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[33]  M. Degli Esposti,et al.  A critical evaluation of the hydropathy profile of membrane proteins. , 1990, European journal of biochemistry.

[34]  A. Zamyatnin,et al.  Protein volume in solution. , 1972, Progress in biophysics and molecular biology.

[35]  Y. Matsuo,et al.  Three novel molecular forms of biliary glycoprotein deduced from cDNA clones from a human leukocyte library. , 1991, Biochemical and biophysical research communications.

[36]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[37]  R M Stroud,et al.  Amphipathic analysis and possible formation of the ion channel in an acetylcholine receptor. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[38]  C. Chothia The nature of the accessible and buried surfaces in proteins. , 1976, Journal of molecular biology.

[39]  W A Gilbert,et al.  The prediction of transmembrane protein sequences and their conformation: an evaluation. , 1990, Trends in biochemical sciences.