Bayesian ensemble refinement by replica simulations and reweighting.

We describe different Bayesian ensemble refinement methods, examine their interrelation, and discuss their practical application. With ensemble refinement, the properties of dynamic and partially disordered (bio)molecular structures can be characterized by integrating a wide range of experimental data, including measurements of ensemble-averaged observables. We start from a Bayesian formulation in which the posterior is a functional that ranks different configuration space distributions. By maximizing this posterior, we derive an optimal Bayesian ensemble distribution. For discrete configurations, this optimal distribution is identical to that obtained by the maximum entropy "ensemble refinement of SAXS" (EROS) formulation. Bayesian replica ensemble refinement enhances the sampling of relevant configurations by imposing restraints on averages of observables in coupled replica molecular dynamics simulations. We show that the strength of the restraints should scale linearly with the number of replicas to ensure convergence to the optimal Bayesian result in the limit of infinitely many replicas. In the "Bayesian inference of ensembles" method, we combine the replica and EROS approaches to accelerate the convergence. An adaptive algorithm can be used to sample directly from the optimal ensemble, without replicas. We discuss the incorporation of single-molecule measurements and dynamic observables such as relaxation parameters. The theoretical analysis of different Bayesian ensemble refinement approaches provides a basis for practical applications and a starting point for further investigations.

[1]  Benoît Roux,et al.  Extension to the weighted histogram analysis method: combining umbrella sampling with free energy calculations , 2001 .

[2]  Fabrizio Marinelli,et al.  Ensemble-Biased Metadynamics: A Molecular Simulation Method to Sample Experimental Distributions. , 2015, Biophysical journal.

[3]  Michael Habeck,et al.  Bayesian approach to inverse statistical mechanics. , 2014, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  John Kuriyan,et al.  Exploration of disorder in protein structures by X‐ray restrained molecular dynamics , 1991, Proteins.

[5]  M. Vendruscolo,et al.  Statistical mechanics of the denatured state of a protein using replica-averaged metadynamics. , 2014, Journal of the American Chemical Society.

[6]  Carlo Camilloni,et al.  Molecular dynamics simulations with replica-averaged structural restraints generate structural ensembles according to the maximum entropy principle. , 2013, The Journal of chemical physics.

[7]  Oliver F. Lange,et al.  Recognition Dynamics Up to Microseconds Revealed from an RDC-Derived Ubiquitin Ensemble in Solution , 2008, Science.

[8]  Angel E García,et al.  High-resolution reversible folding of hyperstable RNA tetraloops using molecular dynamics simulations , 2013, Proceedings of the National Academy of Sciences.

[9]  Massimiliano Bonomi,et al.  Metainference: A Bayesian inference method for heterogeneous systems , 2015, Science Advances.

[10]  Kresten Lindorff-Larsen,et al.  Experimental parameterization of an energy function for the simulation of unfolded proteins. , 2008, Biophysical journal.

[11]  Carlo Camilloni,et al.  Replica-Averaged Metadynamics. , 2013, Journal of chemical theory and computation.

[12]  Kresten Lindorff-Larsen,et al.  Combining Experiments and Simulations Using the Maximum Entropy Principle , 2014, PLoS Comput. Biol..

[13]  M. DePristo,et al.  Simultaneous determination of protein structure and dynamics , 2005, Nature.

[14]  G. Crooks Beyond Boltzmann-Gibbs statistics: maximum entropy hyperensembles out of equilibrium. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[15]  Collin M. Stultz,et al.  Modeling Intrinsically Disordered Proteins with Bayesian Statistics , 2010, Journal of the American Chemical Society.

[16]  Carlo Camilloni,et al.  Erratum: “Molecular dynamics simulations with replica-averaged structural restraints generate structural ensembles according to the maximum entropy principle” [J. Chem. Phys. 138, 094112 (2013)] , 2013 .

[17]  Lee-Ping Wang,et al.  Systematic Parametrization of Polarizable Force Fields from Quantum Chemistry Data. , 2013, Journal of chemical theory and computation.

[18]  Greg L. Hura,et al.  Structure and flexibility within proteins as identified through small angle X-ray scattering. , 2009, General physiology and biophysics.

[19]  Nathan Schmid,et al.  Time-averaged order parameter restraints in molecular dynamics simulations , 2014, Journal of biomolecular NMR.

[20]  Dianne P. O'Leary,et al.  The Use of the L-Curve in the Regularization of Discrete Ill-Posed Problems , 1993, SIAM J. Sci. Comput..

[21]  Gerhard Hummer,et al.  Solution structure of the ESCRT-I and -II supercomplex: implications for membrane budding and scission. , 2012, Structure.

[22]  R. Crehuet,et al.  Application of the maximum entropy principle to determine ensembles of intrinsically disordered proteins from residual dipolar couplings. , 2014, Physical chemistry chemical physics : PCCP.

[23]  John D Chodera,et al.  On the Use of Experimental Observations to Bias Simulated Ensembles. , 2012, Journal of chemical theory and computation.

[24]  G. Hummer,et al.  Optimized molecular dynamics force fields applied to the helix-coil transition of polypeptides. , 2009, The journal of physical chemistry. B.

[25]  E. Jaynes On the rationale of maximum-entropy methods , 1982, Proceedings of the IEEE.

[26]  Michele Vendruscolo,et al.  Determination of the structures of distinct transition state ensembles for a β-sheet peptide with parallel folding pathways , 2002 .

[27]  Collin M. Stultz,et al.  Constructing ensembles for intrinsically disordered proteins. , 2011, Current opinion in structural biology.

[28]  Pilar Cossio,et al.  Bayesian analysis of individual electron microscopy images: towards structures of dynamic and heterogeneous biomolecular assemblies. , 2013, Journal of structural biology.

[29]  G. Hummer,et al.  SAXS ensemble refinement of ESCRT-III CHMP3 conformational transitions. , 2011, Structure.

[30]  Rhiju Das,et al.  Bayesian Energy Landscape Tilting: Towards Concordant Models of Molecular Ensembles , 2014, bioRxiv.

[31]  Michael Nilges,et al.  Materials and Methods Som Text Figs. S1 to S6 References Movies S1 to S5 Inferential Structure Determination , 2022 .

[32]  Gerhard Hummer,et al.  A maximum entropy approach to the study of residue‐specific backbone angle distributions in α‐synuclein, an intrinsically disordered protein , 2014, Protein science : a publication of the Protein Society.

[33]  Gerhard Hummer,et al.  Structural basis of p38α regulation by hematopoietic tyrosine phosphatase. , 2011, Nature chemical biology.

[34]  Gerhard Hummer,et al.  MERA: a webserver for evaluating backbone torsion angle distributions in dynamic and disordered proteins from NMR data , 2015, Journal of Biomolecular NMR.

[35]  Wei Yang,et al.  Catalytic mechanism of RNA backbone cleavage by ribonuclease H from quantum mechanics/molecular mechanics simulations. , 2011, Journal of the American Chemical Society.

[36]  S. Gull,et al.  Image reconstruction from incomplete and noisy data , 1978, Nature.

[37]  Gunnar F Schröder,et al.  Hybrid methods for macromolecular structure determination: experiment with expectations. , 2015, Current opinion in structural biology.

[38]  Kresten Lindorff-Larsen,et al.  Probabilistic Determination of Native State Ensembles of Proteins. , 2014, Journal of chemical theory and computation.

[39]  Peter G Wolynes,et al.  Simulation studies of the fidelity of biomolecular structure ensemble recreation. , 2006, The Journal of chemical physics.

[40]  Gerhard Hummer,et al.  Solution structure of the ESCRT-I complex by small-angle X-ray scattering, EPR, and FRET spectroscopy , 2011, Proceedings of the National Academy of Sciences.

[41]  Michael R. Shirts,et al.  Statistically optimal analysis of samples from multiple equilibrium states. , 2008, The Journal of chemical physics.

[42]  W F van Gunsteren,et al.  Calculation of NMR-relaxation parameters for flexible molecules from molecular dynamics simulations , 2001, Journal of biomolecular NMR.

[43]  C. Dobson,et al.  Mapping long-range interactions in alpha-synuclein using spin-label NMR and ensemble molecular dynamics simulations. , 2005, Journal of the American Chemical Society.

[44]  Andrej Sali,et al.  Integrative Structural Biology , 2013, Science.

[45]  J H Prestegard,et al.  A dynamic model for the structure of acyl carrier protein in solution. , 1989, Biochemistry.

[46]  Gerhard Hummer,et al.  Slow protein conformational dynamics from multiple experimental structures: the helix/sheet transition of arc repressor. , 2005, Structure.

[47]  Andrej Sali,et al.  Uncertainty in integrative structural modeling. , 2014, Current opinion in structural biology.

[48]  Haruki Nakamura,et al.  Outcome of the First wwPDB Hybrid/Integrative Methods Task Force Workshop. , 2015, Structure.

[49]  M. Vendruscolo,et al.  Determination of protein structures consistent with NMR order parameters. , 2004, Journal of the American Chemical Society.

[50]  Andrej Sali,et al.  Recovering a representative conformational ensemble from underdetermined macromolecular structural data. , 2013, Journal of the American Chemical Society.

[51]  R. Brüschweiler,et al.  Iterative Optimization of Molecular Mechanics Force Fields from NMR Data of Full-Length Proteins. , 2011, Journal of chemical theory and computation.

[52]  Gregory A Voth,et al.  Designing free energy surfaces that match experimental data with metadynamics. , 2015, Journal of chemical theory and computation.

[53]  Benoît Roux,et al.  On the statistical equivalence of restrained-ensemble simulations with the maximum entropy method. , 2013, The Journal of chemical physics.

[54]  Gregory A Voth,et al.  Efficient and Minimal Method to Bias Molecular Simulations with Experimental Data. , 2014, Journal of chemical theory and computation.