The quality and validation of structures from structural genomics.

Quality control of three-dimensional structures of macromolecules is a critical step to ensure the integrity of structural biology data, especially those produced by structural genomics centers. Whereas the Protein Data Bank (PDB) has proven to be a remarkable success overall, the inconsistent quality of structures reveals a lack of universal standards for structure/deposit validation. Here, we review the state-of-the-art methods used in macromolecular structure validation, focusing on validation of structures determined by X-ray crystallography. We describe some general protocols used in the rebuilding and re-refinement of problematic structural models. We also briefly discuss some frontier areas of structure validation, including refinement of protein-ligand complexes, automation of structure redetermination, and the use of NMR structures and computational models to solve X-ray crystal structures by molecular replacement.

[1]  K Wüthrich,et al.  Hydration of proteins. A comparison of experimental residence times of water molecules solvating the bovine pancreatic trypsin inhibitor with theoretical model calculations. , 1993, Journal of molecular biology.

[2]  Alexander Wlodawer,et al.  Unmet challenges of structural genomics. , 2010, Current opinion in structural biology.

[3]  Seth Cooper,et al.  High-resolution structure of a retroviral protease folded as a monomer , 2011 .

[4]  F. Allen The Cambridge Structural Database: a quarter of a million crystal structures and rising. , 2002, Acta crystallographica. Section B, Structural science.

[5]  David Baker,et al.  proteins STRUCTURE O FUNCTION O BIOINFORMATICS Improving NMR protein structure quality by Rosetta refinement: A molecular , 2022 .

[6]  N. Colloc'h,et al.  Comparison of three algorithms for the assignment of secondary structure in proteins: the advantages of a consensus assignment. , 1993, Protein engineering.

[7]  V. Lamzin,et al.  Fragmentation-tree density representation for crystallographic modelling of bound ligands. , 2012, Journal of molecular biology.

[8]  Thomas C. Terwilliger,et al.  Improving macromolecular atomic models at moderate resolution by automated iterative model building, statistical density modification and refinement , 2003, Acta crystallographica. Section D, Biological crystallography.

[9]  Vincent B. Chen,et al.  Correspondence e-mail: , 2000 .

[10]  C. Sander,et al.  Errors in protein structures , 1996, Nature.

[11]  Z. Popovic,et al.  Crystal structure of a monomeric retroviral protease solved by protein folding game players , 2011, Nature Structural &Molecular Biology.

[12]  Dominik Gront,et al.  Assessing the accuracy of template-based structure prediction metaservers by comparison with structural genomics structures , 2012, Journal of Structural and Functional Genomics.

[13]  Garib N. Murshudov,et al.  JLigand: a graphical tool for the CCP4 template-restraint library , 2012, Acta crystallographica. Section D, Biological crystallography.

[14]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[15]  G. Otting,et al.  Proton exchange with internal water molecules in the protein BPTI in aqueous solution , 1991 .

[16]  P Andrew Karplus,et al.  Mapping the active site helix-to-strand conversion of CxxxxC peroxiredoxin Q enzymes. , 2012, Biochemistry.

[17]  S J Wodak,et al.  SFCHECK: a unified set of procedures for evaluating the quality of macromolecular structure-factor data and their agreement with the atomic model. , 1999, Acta crystallographica. Section D, Biological crystallography.

[18]  S. Wodak,et al.  Deviations from standard atomic volumes as a quality measure for protein crystal structures. , 1996, Journal of molecular biology.

[19]  G. N. Ramachandran,et al.  Stereochemistry of polypeptide chain configurations. , 1963, Journal of molecular biology.

[20]  Wladek Minor,et al.  HKL-3000: the integration of data reduction and structure solution--from diffraction images to an initial model in minutes. , 2006, Acta crystallographica. Section D, Biological crystallography.

[21]  R. Huber,et al.  Accurate Bond and Angle Parameters for X-ray Protein Structure Refinement , 1991 .

[22]  Thomas Terwilliger,et al.  SOLVE and RESOLVE: automated structure solution, density modification and model building. , 2004, Journal of synchrotron radiation.

[23]  M. Jaskólski,et al.  Protein crystallography for non‐crystallographers, or how to get the best (but not more) from published macromolecular structures , 2008, The FEBS journal.

[24]  P. Andrew Karplus,et al.  Linking Crystallographic Model and Data Quality , 2012, Science.

[25]  G Vriend,et al.  WHAT IF: a molecular modeling and drug design program. , 1990, Journal of molecular graphics.

[26]  A. Gronenborn,et al.  Crystal structure of interleukin 8: symbiosis of NMR and crystallography. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[27]  T J Oldfield,et al.  SQUID: a program for the analysis and display of data from crystallography and molecular dynamics. , 1992, Journal of molecular graphics.

[28]  Edwin Pozharski,et al.  Techniques, tools and best practices for ligand electron-density analysis and results from their application to deposited crystal structures. , 2013, Acta crystallographica. Section D, Biological crystallography.

[29]  O. Carugo,et al.  How many water molecules can be detected by protein crystallography? , 1999, Acta crystallographica. Section D, Biological crystallography.

[30]  Zukang Feng,et al.  Automated and accurate deposition of structures solved by X-ray diffraction to the Protein Data Bank. , 2004, Acta crystallographica. Section D, Biological crystallography.

[31]  Anastassis Perrakis,et al.  Automated protein model building combined with iterative structure refinement , 1999, Nature Structural Biology.

[32]  J. Thornton,et al.  PROCHECK: a program to check the stereochemical quality of protein structures , 1993 .

[33]  Maksymilian Chruszcz,et al.  Benefits of structural genomics for drug discovery research. , 2009, Infectious disorders drug targets.

[34]  Jure Pražnikar,et al.  PURY: a database of geometric restraints of hetero compounds for refinement in complexes with macromolecular structures , 2008, Acta crystallographica. Section D, Biological crystallography.

[35]  Alan E. Mark,et al.  Challenges in the determination of the binding modes of non-standard ligands in X-ray crystal complexes , 2011, J. Comput. Aided Mol. Des..

[36]  Randy J. Read,et al.  Improved molecular replacement by density- and energy-guided protein structure optimization , 2011, Nature.

[37]  Adrien Treuille,et al.  Predicting protein structures with a multiplayer online game , 2010, Nature.

[38]  David S. Wishart,et al.  PROSESS: a protein structure evaluation suite and server , 2010, Nucleic Acids Res..

[39]  Randy J. Read,et al.  Acta Crystallographica Section D Biological , 2003 .

[40]  Krista Joosten,et al.  PDB_REDO: constructive validation, more than just looking for errors , 2012, Acta crystallographica. Section D, Biological crystallography.

[41]  R. Bryant,et al.  The dynamics of water-protein interactions. , 1996, Annual review of biophysics and biomolecular structure.