Estimation of Uncertainties in the Global Distance Test (GDT_TS) for CASP Models

The Critical Assessment of techniques for protein Structure Prediction (or CASP) is a community-wide blind test experiment to reveal the best accomplishments of structure modeling. Assessors have been using the Global Distance Test (GDT_TS) measure to quantify prediction performance since CASP3 in 1998. However, identifying significant score differences between close models is difficult because of the lack of uncertainty estimations for this measure. Here, we utilized the atomic fluctuations caused by structure flexibility to estimate the uncertainty of GDT_TS scores. Structures determined by nuclear magnetic resonance are deposited as ensembles of alternative conformers that reflect the structural flexibility, whereas standard X-ray refinement produces the static structure averaged over time and space for the dynamic ensembles. To recapitulate the structural heterogeneous ensemble in the crystal lattice, we performed time-averaged refinement for X-ray datasets to generate structural ensembles for our GDT_TS uncertainty analysis. Using those generated ensembles, our study demonstrates that the time-averaged refinements produced structure ensembles with better agreement with the experimental datasets than the averaged X-ray structures with B-factors. The uncertainty of the GDT_TS scores, quantified by their standard deviations (SDs), increases for scores lower than 50 and 70, with maximum SDs of 0.3 and 1.23 for X-ray and NMR structures, respectively. We also applied our procedure to the high accuracy version of GDT-based score and produced similar results with slightly higher SDs. To facilitate score comparisons by the community, we developed a user-friendly web server that produces structure ensembles for NMR and X-ray structures and is accessible at http://prodata.swmed.edu/SEnCS. Our work helps to identify the significance of GDT_TS score differences, as well as to provide structure ensembles for estimating SDs of any scores.

[1]  Paul D. Adams,et al.  Evidence of Functional Protein Dynamics from X-Ray Crystallographic Ensembles , 2010, PLoS Comput. Biol..

[2]  James M Aramini,et al.  Assessment of template‐based protein structure predictions in CASP10 , 2014, Proteins.

[3]  C Venclovas,et al.  Processing and analysis of CASP3 protein structure predictions , 1999, Proteins.

[4]  James O. Wrabl,et al.  The role of protein conformational fluctuations in allostery, function, and evolution. , 2011, Biophysical chemistry.

[5]  Juergen Haas,et al.  The Protein Model Portal—a comprehensive resource for protein structure and model information , 2013, Database J. Biol. Databases Curation.

[6]  T. N. Bhat,et al.  The Protein Data Bank , 2000, Nucleic Acids Res..

[7]  Paul D Adams,et al.  Modelling dynamics in protein crystal structures by ensemble refinement , 2012, eLife.

[8]  Fei Long,et al.  The PDB_REDO server for macromolecular structure model optimization , 2014, IUCrJ.

[9]  A. Tramontano,et al.  Critical assessment of methods of protein structure prediction (CASP)—round IX , 2011, Proteins.

[10]  I. Shimada,et al.  Functional dynamics of proteins revealed by solution NMR. , 2012, Current opinion in structural biology.

[11]  B. Rost,et al.  Critical assessment of methods of protein structure prediction (CASP)—Round 6 , 2005, Proteins.

[12]  G N Murshudov,et al.  Use of TLS parameters to model anisotropic displacements in macromolecular refinement. , 2001, Acta crystallographica. Section D, Biological crystallography.

[13]  R Dustin Schaeffer,et al.  CASP 11 target classification , 2016, Proteins.

[14]  N. Grishin,et al.  Assessment of CASP11 contact‐assisted predictions , 2016, Proteins.

[15]  R. W. Janes,et al.  A comparison of X‐ray and NMR structures for human endothelin‐1 , 1995, Protein science : a publication of the Protein Society.

[16]  Bojan Zagrovic,et al.  X-ray refinement significantly underestimates the level of microscopic heterogeneity in biomolecular crystals , 2014, Nature Communications.

[17]  H. Berendsen,et al.  Collective protein dynamics in relation to function. , 2000, Current opinion in structural biology.

[18]  Marco Biasini,et al.  lDDT: a local superposition-free score for comparing protein structures and models using distance difference tests , 2013, Bioinform..

[19]  D. Kern,et al.  Dynamic personalities of proteins , 2007, Nature.

[20]  Pau Bernadó,et al.  Structural biology: Proteins in dynamic equilibrium , 2010, Nature.

[21]  Anna Tramontano,et al.  Assessment of the assessment: Evaluation of the model quality estimates in CASP10 , 2014, Proteins.

[22]  Randy J Read,et al.  Assessment of CASP7 predictions in the high accuracy template‐based modeling category , 2007, Proteins.

[23]  Adam Zemla,et al.  LGA: a method for finding 3D similarities in protein structures , 2003, Nucleic Acids Res..

[24]  Gert Vriend,et al.  Traditional Biomolecular Structure Determination by NMR Spectroscopy Allows for Major Errors , 2005, PLoS Comput. Biol..