Free R value: cross-validation in crystallography.

Publisher Summary This chapter focuses on the free R value and other applications of cross-validation in crystallography. Powerful methods have been developed to lower the chances of misinterpreting or overinterpreting diffraction data. Some of the more important methods measure the agreement of the structure with empirical rules about protein folds, comprehensive conformational analyses, the real-space correlation coefficient, and the free R value. The protein-folding rules and conformational analyses depend on empirical knowledge of protein structure. They validate the model regardless of the fit to the diffraction data. In contrast, the real-space correlation coefficient and the free R value are entirely diffraction data based and are applicable to any macromolecule. They validate the extent to which the model explains the diffraction data. Even the best bulk solvent and multiconformer models result in free R values of around or slightly above 20% for the penicillopepsin crystal structure. A free R value of 20% is significantly higher than expected from the estimated high statistical quality of the diffraction data. Thus, it is conceivable that the present models for solvation and thermal motion are incomplete. Alternatively, the intensity data might be affected by systematic errors of unknown origin. These facts point to the need for the solution of benchmark macromolecular structures at high resolution and with accurate experimental phases.

[1]  M. Sippl Calculation of conformational ensembles from potentials of mean force. An approach to the knowledge-based prediction of local structures in globular proteins. , 1990, Journal of molecular biology.

[2]  P Gros,et al.  Inclusion of thermal motion in crystallographic structures by restrained molecular dynamics. , 1990, Science.

[3]  Hans Frauenfelder,et al.  Temperature-dependent X-ray diffraction as a probe of protein structural dynamics , 1979, Nature.

[4]  V. Luzzati,et al.  Traitement statistique des erreurs dans la determination des structures cristallines , 1952 .

[5]  G. Bricogne Maximum entropy and the foundations of direct methods , 1984 .

[6]  B Efron,et al.  Statistical Data Analysis in the Computer Age , 1991, Science.

[7]  M. Karplus,et al.  Anisotropy and anharmonicity of atomic fluctuations in proteins: implications for X-ray analysis. , 1988, Biochemistry.

[8]  D. Eisenberg,et al.  Assessment of protein models with three-dimensional profiles , 1992, Nature.

[9]  R Diamond,et al.  On the use of normal modes in thermal parameter refinement: theory and application to the bovine pancreatic trypsin inhibitor. , 1990, Acta crystallographica. Section A, Foundations of crystallography.

[10]  J. Zou,et al.  Improved methods for building protein models in electron density maps and the location of errors in these models. , 1991, Acta crystallographica. Section A, Foundations of crystallography.

[11]  Olga Kennard,et al.  Systematic analysis of structural data as a research technique in organic chemistry , 1983 .

[12]  W. Hendrickson Stereochemically restrained refinement of macromolecular structures. , 1985, Methods in enzymology.

[13]  M. James,et al.  Structure and refinement of penicillopepsin at 1.8 A resolution. , 1983, Journal of molecular biology.

[14]  R.D.B. Fraser,et al.  An improved method for calculating the contribution of solvent to the X‐ray diffraction pattern of biological molecules , 1978 .

[15]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[16]  Axel T. Brunger,et al.  Thermal Motion and Conformational Disorder in Protein Crystal Structures: Comparison of Multi‐Conformer and Time‐Averaging Models , 1994 .

[17]  T. Jones,et al.  Between objectivity and subjectivity , 1990, Nature.

[18]  J. Skehel,et al.  Refinement of the influenza virus hemagglutinin by simulated annealing. , 1991, Journal of molecular biology.

[19]  D. Eisenberg,et al.  Crystal structure of the unactivated form of ribulose-1,5-bisphosphate carboxylase/oxygenase from tobacco refined at 2.0-A resolution. , 1992, The Journal of biological chemistry.

[20]  R. Kretsinger,et al.  Refinement of the structure of carp muscle calcium-binding parvalbumin by model building and difference Fourier analysis. , 1976, Journal of molecular biology.

[21]  M. Karplus,et al.  Anisotropy and anharmonicity of atomic fluctuations in proteins: Analysis of a molecular dynamics simulation , 1987, Proteins.

[22]  G J Williams,et al.  The Protein Data Bank: a computer-based archival file for macromolecular structures. , 1977, Journal of molecular biology.

[23]  B. Matthews Solvent content of protein crystals. , 1968, Journal of molecular biology.

[24]  A T Brünger,et al.  Slow-cooling protocols for crystallographic refinement by simulated annealing. , 1990, Acta crystallographica. Section A, Foundations of crystallography.

[25]  J. Badger Multiple hydration layers in cubic insulin crystals. , 1993, Biophysical journal.

[26]  B. C. Wang Resolution of phase ambiguity in macromolecular crystallography. , 1985, Methods in enzymology.

[27]  M. Karplus,et al.  Crystallographic refinement by simulated annealing: application to crambin , 1989 .

[28]  S. Phillips,et al.  Structure and refinement of oxymyoglobin at 1.6 A resolution. , 1980, Journal of molecular biology.

[29]  Axel T. Brunger,et al.  Model bias in macromolecular crystal structures , 1992 .

[30]  John Kuriyan,et al.  Exploration of disorder in protein structures by X‐ray restrained molecular dynamics , 1991, Proteins.

[31]  M Karplus,et al.  Effect of anisotropy and anharmonicity on protein crystallographic refinement. An evaluation by molecular dynamics. , 1986, Journal of molecular biology.

[32]  A. Brünger,et al.  Torsion angle dynamics: Reduced variable conformational sampling enhances crystallographic structure refinement , 1994, Proteins.

[33]  A. Brunger Free R value: a novel statistical quantity for assessing the accuracy of crystal structures. , 1992 .

[34]  G J Kleywegt,et al.  Where freedom is given, liberties are taken. , 1995, Structure.

[35]  A T Brünger,et al.  Direct Observation of Protein Solvation and Discrete Disorder with Experimental Crystallographic Phases , 1996, Science.

[36]  William H. Press,et al.  Numerical recipes , 1990 .

[37]  M. James,et al.  Penicillopepsin from Penicillium janthinellum crystal structure at 2.8 Å and sequence homology with porcine pepsin , 1977, Nature.

[38]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[39]  W. Härdle,et al.  How Far are Automatically Chosen Regression Smoothing Parameters from their Optimum , 1988 .

[40]  Axel T. Brunger,et al.  X-PLOR Version 3.1: A System for X-ray Crystallography and NMR , 1992 .

[41]  B. Efron,et al.  The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[42]  Janet M. Thornton,et al.  Protein fold recognition , 1993, J. Comput. Aided Mol. Des..

[43]  T. A. Jones,et al.  Crystal structures of cellular retinoic acid binding proteins I and II in complex with all-trans-retinoic acid and a synthetic retinoid. , 1995, Structure.

[44]  M S Chapman,et al.  Tertiary structure of plant RuBisCO: domains and their contacts. , 1988, Science.

[45]  R. Read Improved Fourier Coefficients for Maps Using Phases from Partial Structures with Errors , 1986 .

[46]  W. C. Hamilton Significance tests on the crystallographic R factor , 1965 .

[47]  A T Brünger,et al.  Protein hydration observed by X-ray diffraction. Solvation properties of penicillopepsin and neuraminidase crystal structures. , 1994, Journal of molecular biology.

[48]  Benno P. Schoenborn,et al.  Hydration in protein crystals. A neutron diffraction analysis of carbonmonoxymyoglobin , 1990 .

[49]  Michael Levitt,et al.  Refinement of Large Structures by Simultaneous Minimization of Energy and R Factor , 1978 .

[50]  A. Gronenborn,et al.  Assessing the quality of solution nuclear magnetic resonance structures by complete cross-validation. , 1993, Science.

[51]  R. Huber,et al.  Accurate Bond and Angle Parameters for X-ray Protein Structure Refinement , 1991 .