Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA.

Combined automated NOE assignment and structure determination module (CANDID) is a new software for efficient NMR structure determination of proteins by automated assignment of the NOESY spectra. CANDID uses an iterative approach with multiple cycles of NOE cross-peak assignment and protein structure calculation using the fast DYANA torsion angle dynamics algorithm, so that the result from each CANDID cycle consists of exhaustive, possibly ambiguous NOE cross-peak assignments in all available spectra and a three-dimensional protein structure represented by a bundle of conformers. The input for the first CANDID cycle consists of the amino acid sequence, the chemical shift list from the sequence-specific resonance assignment, and listings of the cross-peak positions and volumes in one or several two, three or four-dimensional NOESY spectra. The input for the second and subsequent CANDID cycles contains the three-dimensional protein structure from the previous cycle, in addition to the complete input used for the first cycle. CANDID includes two new elements that make it robust with respect to the presence of artifacts in the input data, i.e. network-anchoring and constraint-combination, which have a key role in de novo protein structure determinations for the successful generation of the correct polypeptide fold by the first CANDID cycle. Network-anchoring makes use of the fact that any network of correct NOE cross-peak assignments forms a self-consistent set; the initial, chemical shift-based assignments for each individual NOE cross-peak are therefore weighted by the extent to which they can be embedded into the network formed by all other NOE cross-peak assignments. Constraint-combination reduces the deleterious impact of artifact NOE upper distance constraints in the input for a protein structure calculation by combining the assignments for two or several peaks into a single upper limit distance constraint, which lowers the probability that the presence of an artifact peak will influence the outcome of the structure calculation. CANDID test calculations were performed with NMR data sets of four proteins for which high-quality structures had previously been solved by interactive protocols, and they yielded comparable results to these reference structure determinations with regard to both the residual constraint violations, and the precision and accuracy of the atomic coordinates. The CANDID approach has further been validated by de novo NMR structure determinations of four additional proteins. The experience gained in these calculations shows that once nearly complete sequence-specific resonance assignments are available, the automated CANDID approach results in greatly enhanced efficiency of the NOESY spectral analysis. The fact that the correct fold is obtained in cycle 1 of a de novo structure calculation is the single most important advance achieved with CANDID, when compared with previously proposed automated NOESY assignment methods that do not use network-anchoring and constraint-combination.

[1]  K Wüthrich,et al.  Improved efficiency of protein structure calculations from NMR data using the program DIANA with redundant dihedral angle constraints , 1991, Journal of biomolecular NMR.

[2]  Stephen W. Fesik,et al.  A computer-based protocol for semiautomated assignments and 3D structure determination of proteins , 1994, Journal of biomolecular NMR.

[3]  Richard R. Ernst,et al.  Elucidation of cross relaxation in liquids by two-dimensional N.M.R. spectroscopy , 1980 .

[4]  Kurt Wüthrich,et al.  The program ASNO for computer-supported collection of NOE upper distance constraints as input for protein structure determination , 1993 .

[5]  Werner Braun,et al.  Automated combined assignment of NOESY spectra and three-dimensional protein structure determination , 1997, Journal of biomolecular NMR.

[6]  I. Solomon Relaxation Processes in a System of Two Spins , 1955 .

[7]  Martin Billeter,et al.  Point-centered domain decomposition for parallel molecular dynamics simulation , 2000 .

[8]  K Wüthrich,et al.  A two-dimensional nuclear Overhauser enhancement (2D NOE) experiment for the elucidation of complete proton-proton cross-relaxation networks in biological macromolecules. , 1980, Biochemical and biophysical research communications.

[9]  K. Wüthrich NMR of proteins and nucleic acids , 1988 .

[10]  M. Billeter,et al.  The new program OPAL for molecular dynamics simulations and energy refinements of biological macromolecules , 1996, Journal of biomolecular NMR.

[11]  Michael Nilges,et al.  Floating stereospecific assignment revisited: Application to an 18 kDa protein and comparison with J-coupling data , 1997, Journal of biomolecular NMR.

[12]  Axel T. Brunger,et al.  X-PLOR Version 3.1: A System for X-ray Crystallography and NMR , 1992 .

[13]  R. Diamond,et al.  Treatment of NOE constraints involving equivalent or nonstereoassigned protons in calculations of biomacromolecular structures , 1996, Journal of biomolecular NMR.

[14]  K. Wüthrich,et al.  NMR solution structure of the pathogenesis-related protein P14a. , 1997, Journal of molecular biology.

[15]  R J Read,et al.  Crystallography & NMR system: A new software suite for macromolecular structure determination. , 1998, Acta crystallographica. Section D, Biological crystallography.

[16]  P Luginbühl,et al.  NMR structure reveals intramolecular regulation mechanism for pheromone binding and release , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[17]  H. Jane Dyson,et al.  SANE (Structure Assisted NOE Evaluation): An automated model-based approach for NOE assignment , 2001, Journal of biomolecular NMR.

[18]  G. Montelione,et al.  Solution NMR structure and folding dynamics of the N terminus of a rat non-muscle alpha-tropomyosin in an engineered chimeric protein. , 2001, Journal of molecular biology.

[19]  W. Braun,et al.  Automated assignment of simulated and experimental NOESY spectra of proteins by feedback filtering and self-correcting distance geometry. , 1995, Journal of molecular biology.

[20]  M. Billeter,et al.  MOLMOL: a program for display and analysis of macromolecular structures. , 1996, Journal of molecular graphics.

[21]  K. Wüthrich,et al.  Torsion angle dynamics for NMR structure calculation with the new program DYANA. , 1997, Journal of molecular biology.

[22]  M Nilges,et al.  A calculation strategy for the structure determination of symmetric demers by 1H NMR , 1993, Proteins.

[23]  K Wüthrich,et al.  Efficient computation of three-dimensional protein structures in solution from nuclear magnetic resonance data using the program DIANA and the supporting programs CALIBA, HABAS and GLOMSA. , 1991, Journal of molecular biology.

[24]  H. Scheraga,et al.  Energy parameters in polypeptides. 9. Updating of geometrical parameters, nonbonded interactions, and hydrogen bond interactions for the naturally occurring amino acids , 1983 .

[25]  S. Zinn-Justin,et al.  Variability in automated assignment of NOESY spectra and three-dimensional structure determination: A test case on three small disulfide-bonded proteins , 2001, Journal of biomolecular NMR.

[26]  M Nilges,et al.  Calculation of protein structures with ambiguous distance restraints. Automated assignment of ambiguous NOE crosspeaks and disulphide connectivities. , 1995, Journal of molecular biology.

[27]  H Oschkinat,et al.  Automated NOESY interpretation with ambiguous distance restraints: the refined NMR solution structure of the pleckstrin homology domain from beta-spectrin. , 1997, Journal of molecular biology.

[28]  M. Billeter,et al.  Automated peak picking and peak integration in macromolecular NMR spectra using AUTOPSY. , 1998, Journal of magnetic resonance.

[29]  K Wüthrich,et al.  NMR structure of the bovine prion protein. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Torsten Herrmann,et al.  NMR Structure and Metal Interactions of the CopZ Copper Chaperone* , 1999, The Journal of Biological Chemistry.

[31]  K Wüthrich,et al.  Pseudo-structures for the 20 common amino acids for use in studies of protein conformations by measurements of intramolecular proton-proton distance constraints with nuclear magnetic resonance. , 1983, Journal of molecular biology.

[32]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules , 1995 .

[33]  K Wüthrich,et al.  NMR structures of three single-residue variants of the human prion protein. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[34]  J. Thornton,et al.  Stereochemical quality of protein structure coordinates , 1992, Proteins.

[35]  P. Kollman,et al.  A Second Generation Force Field for the Simulation of Proteins, Nucleic Acids, and Organic Molecules J. Am. Chem. Soc. 1995, 117, 5179−5197 , 1996 .

[36]  Michael Nilges,et al.  Ambiguous NOEs and automated NOE assignment , 1998 .

[37]  Timothy F. Havel,et al.  Solution conformation of proteinase inhibitor IIA from bull seminal plasma by 1H nuclear magnetic resonance and distance geometry. , 1985, Journal of molecular biology.

[38]  K Wüthrich,et al.  Sequential resonance assignments as a basis for determination of spatial protein structures by high resolution proton nuclear magnetic resonance. , 1982, Journal of molecular biology.

[39]  R. Riek,et al.  NMR structure of the calreticulin P-domain , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[40]  H N Moseley,et al.  Automated analysis of NMR assignments and structures for proteins. , 1999, Current opinion in structural biology.

[41]  K Wüthrich,et al.  Conformational analysis of protein and nucleic acid fragments with the new grid search algorithm FOUND , 1998, Journal of biomolecular NMR.

[42]  Kurt Wüthrich,et al.  Ancestral βγ-crystallin precursor structure in a yeast killer toxin , 1996, Nature Structural Biology.

[43]  K Wüthrich,et al.  The program XEASY for computer-supported NMR spectral analysis of biological macromolecules , 1995, Journal of biomolecular NMR.