The good, the bad and the dubious: VHELIBS, a validation helper for ligands and binding sites

BackgroundMany Protein Data Bank (PDB) users assume that the deposited structural models are of high quality but forget that these models are derived from the interpretation of experimental data. The accuracy of atom coordinates is not homogeneous between models or throughout the same model. To avoid basing a research project on a flawed model, we present a tool for assessing the quality of ligands and binding sites in crystallographic models from the PDB.ResultsThe Validation HElper for LIgands and Binding Sites (VHELIBS) is software that aims to ease the validation of binding site and ligand coordinates for non-crystallographers (i.e., users with little or no crystallography knowledge). Using a convenient graphical user interface, it allows one to check how ligand and binding site coordinates fit to the electron density map. VHELIBS can use models from either the PDB or the PDB_REDO databank of re-refined and re-built crystallographic models. The user can specify threshold values for a series of properties related to the fit of coordinates to electron density (Real Space R, Real Space Correlation Coefficient and average occupancy are used by default). VHELIBS will automatically classify residues and ligands as Good, Dubious or Bad based on the specified limits. The user is also able to visually check the quality of the fit of residues and ligands to the electron density map and reclassify them if needed.ConclusionsVHELIBS allows inexperienced users to examine the binding site and the ligand coordinates in relation to the experimental data. This is an important step to evaluate models for their fitness for drug discovery purposes such as structure-based pharmacophore development and protein-ligand docking experiments.

[1]  Paul N. Mortenson,et al.  Diverse, high-quality test set for the validation of protein-ligand docking performance. , 2007, Journal of medicinal chemistry.

[2]  Gert Vriend,et al.  PDB Improvement Starts with Data Deposition , 2007, Science.

[3]  J. Zou,et al.  Improved methods for building protein models in electron density maps and the location of errors in these models. , 1991, Acta crystallographica. Section A, Foundations of crystallography.

[4]  Gale Rhodes Model and Molecule , 2006 .

[5]  Gale Rhodes,et al.  Crystallography made crystal clear : a guide for users ofmacromolecular models , 1993 .

[6]  J. Martin,et al.  Structures of free and complexed forms of Escherichia coli xanthine-guanine phosphoribosyltransferase. , 1998, Journal of Molecular Biology.

[7]  W. Delano The PyMOL Molecular Graphics System , 2002 .

[8]  Keehyoung Joo,et al.  Improving physical realism, stereochemistry, and side‐chain accuracy in homology modeling: Four approaches that performed well in CASP8 , 2009, Proteins.

[9]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[10]  Bernhard Rupp,et al.  Visualizing ligand molecules in Twilight electron density. , 2013, Acta crystallographica. Section F, Structural biology and crystallization communications.

[11]  Randy J. Read,et al.  A New Generation of Crystallographic Validation Tools for the Protein Data Bank , 2011, Structure.

[12]  Haruki Nakamura,et al.  Announcing the worldwide Protein Data Bank , 2003, Nature Structural Biology.

[13]  G J Kleywegt,et al.  xdlMAPMAN and xdlDATAMAN - programs for reformatting, analysis and manipulation of biomacromolecular electron-density maps and reflection data sets. , 1996, Acta crystallographica. Section D, Biological crystallography.

[14]  Michele Magrane,et al.  UniProt Knowledgebase: a hub of integrated protein data , 2011, Database J. Biol. Databases Curation.

[15]  Gert Vriend,et al.  Increasing the precision of comparative models with YASARA NOVA—a self‐parameterizing force field , 2002, Proteins.

[16]  Eugeny V. Alexandrov,et al.  A topological method for the classification of entanglements in crystal networks , 2012 .

[17]  Randy J. Read,et al.  Overview of the CCP4 suite and current developments , 2011, Acta crystallographica. Section D, Biological crystallography.

[18]  C. Anfinsen Principles that govern the folding of protein chains. , 1973, Science.

[19]  Anthony Nicholls,et al.  Essential considerations for using protein-ligand structures in drug discovery. , 2012, Drug discovery today.

[20]  Zbigniew Dauter,et al.  Expectation bias and information content , 2013, Acta crystallographica. Section F, Structural biology and crystallization communications.

[21]  T. Jones,et al.  Between objectivity and subjectivity , 1990, Nature.

[22]  Robert M. Hanson,et al.  Jmol – a paradigm shift in crystallographic visualization , 2010 .

[23]  Sameer Velankar,et al.  Implementing an X-ray validation pipeline for the Protein Data Bank , 2012, Acta crystallographica. Section D, Biological crystallography.

[24]  Ian J. Tickle,et al.  Statistical quality indicators for electron-density maps , 2012, Acta crystallographica. Section D, Biological crystallography.

[25]  Yang Zhang,et al.  BioLiP: a semi-manually curated database for biologically relevant ligand–protein interactions , 2012, Nucleic Acids Res..

[26]  G. Scapin,et al.  (2S,3S)-3-Amino-4-(3,3-difluoropyrrolidin-1-yl)-N,N-dimethyl-4-oxo-2-(4-[1,2,4]triazolo[1,5-a]-pyridin-6-ylphenyl)butanamide: a selective alpha-amino amide dipeptidyl peptidase IV inhibitor for the treatment of type 2 diabetes. , 2006, Journal of medicinal chemistry.

[27]  Krista Joosten,et al.  PDB_REDO: constructive validation, more than just looking for errors , 2012, Acta crystallographica. Section D, Biological crystallography.

[28]  P. Bradley,et al.  Toward High-Resolution de Novo Structure Prediction for Small Proteins , 2005, Science.

[29]  T. A. Jones,et al.  The Uppsala Electron-Density Server. , 2004, Acta crystallographica. Section D, Biological crystallography.

[30]  Edwin Pozharski,et al.  Techniques, tools and best practices for ligand electron-density analysis and results from their application to deposited crystal structures. , 2013, Acta crystallographica. Section D, Biological crystallography.

[31]  Anastassis Perrakis,et al.  Automatic rebuilding and optimization of crystallographic structures in the Protein Data Bank , 2011, Bioinform..

[32]  David C Richardson,et al.  Studying and polishing the PDB's macromolecules. , 2013, Biopolymers.