A molecular dynamics approach for the generation of complete protein structures from limited coordinate data

Generation of full protein coordinates from limited information, e.g., the Cα coordinates, is an important step in protein homology modeling and structure determination, and molecular dynamics (MD) simulations may prove to be important in this task. We describe a new method, in which the protein backbone is built quickly in a rather crude way and then refined by minimization techniques. Subsequently, the side chains are positioned using extensive MD calculations. The method is tested on two proteins, and results compared to proteins constructed using two other MD‐based methods. In the first method, we supplemented an existing backbone building method with a new procedure to add side chains. The second one largely consists of available methodology. The constructed proteins are compared to the corresponding X‐ray structures, which became available during this study, and they are in good agreement (backbone RMS values of 0.5–0.7 Å, and all‐atom RMS values of 1.5–1.9 Å). This comparative study indicates that extensive MD simulations are able, to some extent, to generate details of the native protein structure, and may contribute to the development of a standardized methodology to predict reliably (parts of) protein structures when only partial coordinate data are available. © 1994 John Wiley & Sons, Inc.

[1]  J J Wendoloski,et al.  PROBIT: a statistical approach to modeling proteins from partial coordinate data using substructure libraries. , 1992, Journal of molecular graphics.

[2]  B. Kastner,et al.  Structure of spliceosomal snRNPs and their role in pre-mRNA splicing. , 1990, Biochimica et biophysica acta.

[3]  J. Ponder,et al.  Tertiary templates for proteins. Use of packing criteria in the enumeration of allowed sequences for different structural classes. , 1987, Journal of molecular biology.

[4]  M. Levitt Accurate modeling of protein conformation by automatic segment matching. , 1992, Journal of molecular biology.

[5]  S. Wodak,et al.  Modelling the polypeptide backbone with 'spare parts' from known protein structures. , 1989, Protein engineering.

[6]  W. Kabsch,et al.  Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features , 1983, Biopolymers.

[7]  P. Evans,et al.  Crystal structure of the RNA-binding domain of the U1 small nuclear ribonucleoprotein A , 1990, Nature.

[8]  B. Lee,et al.  The interpretation of protein structures: estimation of static accessibility. , 1971, Journal of molecular biology.

[9]  H. Berendsen,et al.  ALGORITHMS FOR MACROMOLECULAR DYNAMICS AND CONSTRAINT DYNAMICS , 1977 .

[10]  M. Sternberg,et al.  Analysis of the relationship between side-chain conformation and secondary structure in globular proteins. , 1987, Journal of molecular biology.

[11]  Paul E. Correa,et al.  The building of protein structures form α‐carbon coordinates , 1990 .

[12]  S. Harvey Treatment of electrostatic effects in macromolecular modeling , 1989, Proteins.

[13]  R. Bruccoleri,et al.  Criteria that discriminate between native proteins and incorrectly folded models , 1988, Proteins.

[14]  R. Lavery,et al.  A new approach to the rapid determination of protein side chain conformations. , 1991, Journal of biomolecular structure & dynamics.

[15]  L. Lebioda,et al.  Refined structure of yeast apo-enolase at 2.25 A resolution. , 1990, Journal of molecular biology.

[16]  J. M. Thornton,et al.  Prediction of super-secondary structure in proteins , 1983, Nature.

[17]  D. Scherly,et al.  Identification of the RNA binding segment of human U1 A protein and definition of its binding site on U1 snRNA. , 1989, The EMBO journal.

[18]  J Moult,et al.  Molecular dynamics study of the structure and dynamics of a protein molecule in a crystalline ionic environment, Streptomyces griseus protease A. , 1990, Biochemistry.

[19]  C. Sander,et al.  Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. , 1991, Journal of molecular biology.

[20]  D. Osguthorpe,et al.  Structure and energetics of ligand binding to proteins: Escherichia coli dihydrofolate reductase‐trimethoprim, a drug‐receptor system , 1988, Proteins.

[21]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[22]  Janet M. Thornton,et al.  Rebuilding flavodoxin from Cα coordinates: A test study , 1989 .

[23]  T. L. Blundell,et al.  Knowledge-based prediction of protein structures and the design of novel molecules , 1987, Nature.

[24]  R. Bruccoleri,et al.  Application of a directed conformational search for generating 3‐D coordinates for protein structures from α‐carbon coordinates , 1992, Proteins.

[25]  T. A. Jones,et al.  Using known substructures in protein model building and crystallography. , 1986, The EMBO journal.

[26]  M. Karplus,et al.  CHARMM: A program for macromolecular energy, minimization, and dynamics calculations , 1983 .