A new method for building protein conformations from sequence alignments with homologues of known structure.

We describe a largely automatic procedure for building protein structures from sequence alignments with homologues of known structure. This procedure uses simple rules by which multiple sequence alignments can be translated into distance and chirality constraints, which are then used as input for distance geometry calculations. By this means one obtains an ensemble of conformations for the unknown structure that are compatible with the rules employed, and the differences among these conformations provide an indication of the reliability of the structure prediction. The overall approach is demonstrated here by applying it to several Kazal-type trypsin inhibitors, for which experimentally determined structures are available. On the basis of our experience with these test problems, we have further predicted the conformation of the human pancreatic secretory trypsin inhibitor, for which no experimentally determined structure is presently available.

[1]  Jay W. Ponder,et al.  Tertiary Templates for Proteins Use of Packing Criteria in the Enumeration of Allowed Different Structural Classes Sequences , 1987 .

[2]  Timothy F. Havel,et al.  Solution conformation of proteinase inhibitor IIA from bull seminal plasma by 1H nuclear magnetic resonance and distance geometry. , 1985, Journal of molecular biology.

[3]  S. B. Needleman,et al.  A general method applicable to the search for similarities in the amino acid sequence of two proteins. , 1970, Journal of molecular biology.

[4]  K. Wüthrich NMR of proteins and nucleic acids , 1988 .

[5]  Timothy F. Havel,et al.  A distance geometry program for determining the structures of small proteins and other macromolecules from nuclear magnetic resonance measurements of intramolecular1H−1H proximities in solution , 1984 .

[6]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[7]  M. Karplus,et al.  Prediction of the folding of short polypeptide segments by uniform conformational sampling , 1987, Biopolymers.

[8]  J F Riordan,et al.  A preliminary three-dimensional structure of angiogenin. , 1986, Proceedings of the National Academy of Sciences of the United States of America.

[9]  M Bolognesi,et al.  Three-dimensional structure of the complex between pancreatic secretory trypsin inhibitor (Kazal type) and trypsinogen at 1.8 A resolution. Structure solution, crystallographic refinement and preliminary structural interpretation. , 1982, Journal of molecular biology.

[10]  Christus,et al.  A General Method Applicable to the Search for Similarities in the Amino Acid Sequence of Two Proteins , 2022 .

[11]  J. Devereux,et al.  A comprehensive set of sequence analysis programs for the VAX , 1984, Nucleic Acids Res..

[12]  Protein Structure and Function by Comparative Model Building , 1985 .

[13]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[14]  T. Blundell,et al.  Knowledge based modelling of homologous proteins, Part I: Three-dimensional frameworks derived from the simultaneous superposition of multiple structures. , 1987, Protein engineering.

[15]  M Levitt,et al.  The predicted structure of immunoglobulin D1.3 and its comparison with the crystal structure , 1986, Science.

[16]  L M Amzel,et al.  Calculating three‐dimensional changes in protein structure due to amino‐acid substitutions: The variable region of immunoglobulins , 1986, Proteins.

[17]  A M Lesk,et al.  Evolution of proteins formed by beta-sheets. I. Plastocyanin and azurin. , 1982, Journal of molecular biology.

[18]  J. Moult,et al.  An algorithm for determining the conformation of polypeptide segments in proteins by systematic search , 1986, Proteins.

[19]  M Karplus,et al.  Analysis of side-chain orientations in homologous proteins. , 1987, Journal of molecular biology.

[20]  Timothy F. Havel,et al.  The sampling properties of some distance geometry algorithms applied to unconstrained polypeptide chains: A study of 1830 independently computed conformations , 1990, Biopolymers.

[21]  B. Furie,et al.  PART II. COMPUTER‐ASSISTED MACROMOLECULAR STRUCTURE GENERATION: EXTENSION OF EXISTING INFORMATION: On the Construction of Computer Models of Proteins by the Extension of Crystallographic Structures , 1985, Annals of the New York Academy of Sciences.

[22]  R. Huber,et al.  The crystal and molecular structure of the third domain of silver pheasant ovomucoid (OMSVP3). , 1985, European journal of biochemistry.

[23]  R. Huber,et al.  Crystallographic refinement of Japanese quail ovomucoid, a Kazal-type inhibitor, and model building studies of complexes with serine proteases. , 1982, Journal of molecular biology.

[24]  T L Blundell,et al.  Comparison of solvent-inaccessible cores of homologous proteins: definitions useful for protein modelling. , 1987, Protein engineering.

[25]  John P. Overington,et al.  Knowledge‐based protein modelling and design , 1988 .

[26]  H. Scheraga,et al.  4 – Conformational Analysis of Polypeptides: Application to Homologous Proteins , 1978 .

[27]  Timothy F. Havel,et al.  The theory and practice of distance geometry , 1983, Bulletin of Mathematical Biology.

[28]  M Karplus,et al.  Construction of side-chains in homology modelling. Application to the C-terminal lobe of rhizopuspepsin. , 1989, Journal of molecular biology.

[29]  B. L. Sibanda,et al.  Three-dimensional structure, specificity and catalytic mechanism of renin , 1983, Nature.

[30]  J. Greer,et al.  Model structure for the inflammatory protein C5a. , 1985, Science.

[31]  P. Kollman,et al.  An all atom force field for simulations of proteins and nucleic acids , 1986, Journal of computational chemistry.