Experimental Inferential Structure Determination of Ensembles for Intrinsically Disordered Proteins.

We develop a Bayesian approach to determine the most probable structural ensemble model from candidate structures for intrinsically disordered proteins (IDPs) that takes full advantage of NMR chemical shifts and J-coupling data, their known errors and variances, and the quality of the theoretical back-calculation from structure to experimental observables. Our approach differs from previous formulations in the optimization of experimental and back-calculation nuisance parameters that are treated as random variables with known distributions, as opposed to structural or ensemble weight optimization or use of a reference ensemble. The resulting experimental inferential structure determination (EISD) method is size extensive with O(N) scaling, with N = number of structures, that allows for the rapid ranking of large ensemble data comprising tens of thousands of conformations. We apply the EISD approach on singular folded proteins and a corresponding set of ∼25 000 misfolded states to illustrate the problems that can arise using Boltzmann weighted priors. We then apply the EISD method to rank IDP ensembles most consistent with the NMR data and show that the primary error for ranking or creating good IDP ensembles resides in the poor back-calculation from structure to simulated experimental observable. We show that a reduction by a factor of 3 in the uncertainty of the back-calculation error can improve the discrimination among qualitatively different IDP ensembles for the amyloid-beta peptide.

[1]  Teresa Head-Gordon,et al.  Hydrophobic potential of mean force as a solvation function for protein structure prediction. , 2007, Structure.

[2]  Charles D Schwieters,et al.  The Xplor-NIH NMR molecular structure determination package. , 2003, Journal of magnetic resonance.

[3]  Ad Bax,et al.  The use of 1JCαHα coupling constants as a probe for protein backbone conformation , 1993 .

[4]  Peter Güntert,et al.  Automated protein structure determination from NMR spectra. , 2006, Journal of the American Chemical Society.

[5]  Collin M. Stultz,et al.  Protein Structure along the Order–Disorder Continuum , 2011, Journal of the American Chemical Society.

[6]  Chris Neale,et al.  Characterization of disordered proteins with ENSEMBLE , 2013, Bioinform..

[7]  J. Brewer,et al.  Solution NMR Studies of the Aβ(1−40) and Aβ(1−42) Peptides Establish that the Met35 Oxidation State Affects the Mechanism of Amyloid Formation , 2004 .

[8]  Collin M. Stultz,et al.  Comparative studies of disordered proteins with similar sequences: application to Aβ40 and Aβ42. , 2013, Biophysical journal.

[9]  A. Sethi,et al.  Deducing conformational variability of intrinsically disordered proteins from infrared spectroscopy with Bayesian statistics. , 2013, Chemical physics.

[10]  J. Marsh,et al.  Structure and disorder in an unfolded state under nondenaturing conditions from ensemble models consistent with a large number of experimental restraints. , 2009, Journal of molecular biology.

[11]  Travis E. Oliphant,et al.  Python for Scientific Computing , 2007, Computing in Science & Engineering.

[12]  Benoît Roux,et al.  On the statistical equivalence of restrained-ensemble simulations with the maximum entropy method. , 2013, The Journal of chemical physics.

[13]  Byong-Seok Choi,et al.  Residual structure within the disordered C‐terminal segment of p21Waf1/Cip1/Sdi1 and its implications for molecular recognition , 2009, Protein science : a publication of the Protein Society.

[14]  D. Wemmer,et al.  Comparison of Structure Determination Methods for Intrinsically Disordered Amyloid-β Peptides , 2014, The journal of physical chemistry. B.

[15]  Carlo Camilloni,et al.  Molecular dynamics simulations with replica-averaged structural restraints generate structural ensembles according to the maximum entropy principle. , 2013, The Journal of chemical physics.

[16]  R. Henderson,et al.  Model for the structure of bacteriorhodopsin based on high-resolution electron cryo-microscopy. , 1990, Journal of molecular biology.

[17]  A. Bax,et al.  TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts , 2009, Journal of biomolecular NMR.

[18]  Yilin Yan,et al.  The Alzheimer's peptides Abeta40 and 42 adopt distinct conformations in water: a combined MD / NMR study. , 2007, Journal of molecular biology.

[19]  Heinz Rüterjans,et al.  Structure and Backbone Dynamics of Apo- and Holo-cellular Retinol-binding Protein in Solution* , 2002, The Journal of Biological Chemistry.

[20]  MATTHEW S. LIN,et al.  Reliable protein structure refinement using a physical energy function , 2011, J. Comput. Chem..

[21]  G. Makhatadze,et al.  Crystal and NMR structures of a Trp-cage mini-protein benchmark for computational fold prediction , 2012, Proceedings of the National Academy of Sciences.

[22]  A. Bax,et al.  SPARTA+: a modest improvement in empirical NMR chemical shift prediction by means of an artificial neural network , 2010, Journal of biomolecular NMR.

[23]  Nicolas L. Fawzi,et al.  Homogeneous and heterogeneous tertiary structure ensembles of amyloid-β peptides. , 2011, Biochemistry.

[24]  Luca Mollica,et al.  Towards a robust description of intrinsic protein disorder using nuclear magnetic resonance spectroscopy. , 2012, Molecular bioSystems.

[25]  H. Dyson,et al.  Intrinsically unstructured proteins: re-assessing the protein structure-function paradigm. , 1999, Journal of molecular biology.

[26]  J. Kendrew,et al.  A Three-Dimensional Model of the Myoglobin Molecule Obtained by X-Ray Analysis , 1958, Nature.

[27]  Mu-ming Poo,et al.  Efficient derivation of cortical glutamatergic neurons from human pluripotent stem cells: A model system to study neurotoxicity in Alzheimer's disease , 2014, Neurobiology of Disease.

[28]  Teresa Head-Gordon,et al.  Improved Energy Selection of Nativelike Protein Loops from Loop Decoys. , 2008, Journal of chemical theory and computation.

[29]  Konrad Hinsen The molecular modeling toolkit: A new approach to molecular simulations , 2000 .

[30]  Paul D. Adams,et al.  Evidence of Functional Protein Dynamics from X-Ray Crystallographic Ensembles , 2010, PLoS Comput. Biol..

[31]  C. Hogue,et al.  A fast method to sample real protein conformational space , 2000, Proteins.

[32]  Collin M. Stultz,et al.  Constructing ensembles for intrinsically disordered proteins. , 2011, Current opinion in structural biology.

[33]  Collin M. Stultz,et al.  The Effect of a ΔK280 Mutation on the Unfolded State of a Microtubule-Binding Repeat in Tau , 2008, PLoS Comput. Biol..

[34]  David W. Scott,et al.  Multivariate Density Estimation: Theory, Practice, and Visualization , 1992, Wiley Series in Probability and Statistics.

[35]  D. Wemmer,et al.  Differences in β-strand populations of monomeric Aβ40 and Aβ42. , 2013, Biophysical journal.

[36]  G. Marius Clore,et al.  Using Xplor-NIH for NMR molecular structure determination , 2006 .

[37]  D. W. Scott,et al.  Multivariate Density Estimation, Theory, Practice and Visualization , 1992 .

[38]  Gerhard Hummer,et al.  Bayesian ensemble refinement by replica simulations and reweighting. , 2015, The Journal of chemical physics.

[39]  L. D. Antonov,et al.  Bayesian inference of protein ensembles from SAXS data. , 2016, Physical chemistry chemical physics : PCCP.

[40]  N. Kallenbach,et al.  Peptide Conformation Analysis Using an Integrated Bayesian Approach , 2014, Journal of chemical theory and computation.

[41]  Mark Gerstein,et al.  Structural proteomics of an archaeon , 2000, Nature Structural Biology.

[42]  D. Wishart,et al.  Rapid and accurate calculation of protein 1H, 13C and 15N chemical shifts , 2003, Journal of Biomolecular NMR.

[43]  Jory Z. Ruscio,et al.  Structure and dynamics of the Abeta(21-30) peptide from the interplay of NMR experiments and molecular simulations. , 2008, Journal of the American Chemical Society.

[44]  Michael Nilges,et al.  Materials and Methods Som Text Figs. S1 to S6 References Movies S1 to S5 Inferential Structure Determination , 2022 .

[45]  Simon W. Ginzinger,et al.  SHIFTX2: significantly improved protein chemical shift prediction , 2011, Journal of biomolecular NMR.

[46]  H. Dyson,et al.  Linking folding and binding. , 2009, Current opinion in structural biology.

[47]  Torsten Herrmann,et al.  Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. , 2002, Journal of molecular biology.

[48]  Kresten Lindorff-Larsen,et al.  Combining Experiments and Simulations Using the Maximum Entropy Principle , 2014, PLoS Comput. Biol..

[49]  Collin M. Stultz,et al.  Modeling Intrinsically Disordered Proteins with Bayesian Statistics , 2010, Journal of the American Chemical Society.