Large-Scale Comparative Protein Modelling

Proteins are the working molecules in most biological processes and a large portion of experimental biology is aimed at understanding their function. To understand function it is necessary to know the shape of the protein and where or how it interacts with other proteins or substrates. This is approached by first solving the three-dimensional (3-D) structure and then changing the protein sequence to see how the structure and/or function is altered. Site-directed mutagenesis experiments are generally designed to provide information about the involvement of a protein’s specific residues in enzyme-based reactions, molecular recognition events, protein stability and during drug design projects. The experimental elucidation of 3-D structures by X-ray crystallography or NMR is, however, often hampered by difficulties in obtaining sufficient pure protein, diffracting crystals, the size of the protein or its subunits for NMR studies and many other technical aspects. This is illustrated by the fact that the SWISS-PROT/TrEMBL database (Bairoch and Apweiler 1997) contains around 175,000 sequences, while the Brookhaven Protein Data Bank (PDB) (Abola et al. 1996) contains about 6,000 3-D structures from 1,700 different proteins. The number of solved 3-D structures is increasing very slowly compared to the rate of sequencing of novel cDNAs, and no structural information is available for the vast majority of known protein sequences. This gap will of course, further increase as genome sequencing projects continue to yield large numbers of novel sequences.

[1]  D. Lipman,et al.  Improved tools for biological sequence comparison. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Jaime Prilusky,et al.  The Protein Data Bank: Current Status and Future Challenges , 1996, Journal of research of the National Institute of Standards and Technology.

[3]  W. Miller,et al.  A time-efficient, linear-space local similarity algorithm , 1991 .

[4]  Manuel C. Peitsch,et al.  Large Scale Protein Modeling and Model Repository , 1997, ISMB.

[5]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its new supplement TREMBL , 1996, Nucleic Acids Res..

[6]  D Chatterjee,et al.  Analysis of six protein structures predicted by comparative modeling techniques , 1995, Proteins.

[7]  Rolf Apweiler,et al.  The SWISS-PROT protein sequence data bank and its supplement TrEMBL , 1997, Nucleic Acids Res..

[8]  Manuel C. Peitsch,et al.  Protein Modeling by E-mail , 1995, Bio/Technology.

[9]  R D Appel,et al.  A new generation of information retrieval tools for biologists: the example of the ExPASy WWW server. , 1994, Trends in biochemical sciences.

[10]  J Bajorath,et al.  Knowledge‐based model building of proteins: Concepts and examples , 1993, Protein science : a publication of the Protein Society.

[11]  D. Eisenberg,et al.  Assessment of protein models with three-dimensional profiles , 1992, Nature.

[12]  A. Lesk,et al.  The relation between the divergence of sequence and structure in proteins. , 1986, The EMBO journal.

[13]  R D Appel,et al.  Large‐scale protein modelling and integration with the SWISS‐PROT and SWISS‐2DPAGE databases: The example of Escherichia coli , 1997, Electrophoresis.

[14]  M. Sippl Recognition of errors in three‐dimensional structures of proteins , 1993, Proteins.

[15]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[16]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .

[17]  G. Petsko,et al.  Effects of temperature on protein structure and dynamics: X-ray crystallographic studies of the protein ribonuclease-A at nine different temperatures from 98 to 320 K. , 1993, Biochemistry.

[18]  M C Peitsch,et al.  ProMod and Swiss-Model: Internet-based tools for automated comparative protein modelling. , 1996, Biochemical Society transactions.

[19]  R A Sayle,et al.  RASMOL: biomolecular graphics for all. , 1995, Trends in biochemical sciences.