Bayesian calibration of force-fields from experimental data: TIP4P water.

Molecular dynamics (MD) simulations give access to equilibrium structures and dynamic properties given an ergodic sampling and an accurate force-field. The force-field parameters are calibrated to reproduce properties measured by experiments or simulations. The main contribution of this paper is an approximate Bayesian framework for the calibration and uncertainty quantification of the force-field parameters, without assuming parameter uncertainty to be Gaussian. To this aim, since the likelihood function of the MD simulation models is intractable in the absence of Gaussianity assumption, we use a likelihood-free inference scheme known as approximate Bayesian computation (ABC) and propose an adaptive population Monte Carlo ABC algorithm, which is illustrated to converge faster and scales better than the previously used ABCsubsim algorithm for the calibration of the force-field of a helium system. The second contribution is the adaptation of ABC algorithms for High Performance Computing to MD simulations within the Python ecosystem ABCpy. This adaptation includes a novel use of a dynamic allocation scheme for Message Passing Interface (MPI). We illustrate the performance of the developed methodology to learn posterior distribution and Bayesian estimates of Lennard-Jones force-field parameters of helium and the TIP4P system of water implemented for both simulated and experimental datasets collected using neutron and X-ray diffraction. For simulated data, the Bayesian estimate is in close agreement with the true parameter value used to generate the dataset. For experimental as well as for simulated data, the Bayesian posterior distribution shows a strong correlation pattern between the force-field parameters. Providing an estimate of the entire posterior distribution, our methodology also allows us to perform the uncertainty quantification of model prediction. This research opens up the possibility to rigorously calibrate force-fields from available experimental datasets of any structural and dynamic property.

[1]  C. Vega,et al.  A general purpose model for the condensed phases of water: TIP4P/2005. , 2005, The Journal of chemical physics.

[2]  H. Ohtaki,et al.  Structure and dynamics of hydrated ions , 1993 .

[3]  J. Oden,et al.  Adaptive selection and validation of models of complex systems in the presence of uncertainty , 2017, Research in the Mathematical Sciences.

[4]  Jukka-Pekka Onnela,et al.  ABCpy: A High-Performance Computing Perspective to Approximate Bayesian Computation. , 2017 .

[5]  Fabien Cailliez,et al.  A critical review of statistical calibration/prediction models handling data inconsistency and model inadequacy , 2016, 1611.04376.

[6]  Guillaume Deffuant,et al.  Adaptive approximate Bayesian computation for complex models , 2011, Computational Statistics.

[7]  Thomas E. Currie,et al.  War, space, and the evolution of Old World complex societies , 2013, Proceedings of the National Academy of Sciences.

[8]  Carlos Vega,et al.  Simulating water with rigid non-polarizable models: a general perspective. , 2011, Physical chemistry chemical physics : PCCP.

[9]  W. L. Jorgensen,et al.  Comparison of simple potential functions for simulating liquid water , 1983 .

[10]  C. Brooks Computer simulation of liquids , 1989 .

[11]  T. Darden,et al.  Particle mesh Ewald: An N⋅log(N) method for Ewald sums in large systems , 1993 .

[12]  F. RIZZI,et al.  Uncertainty Quantification in MD Simulations. Part I: Forward Propagation , 2012, Multiscale Model. Simul..

[13]  Peter G Bolhuis,et al.  A one-way shooting algorithm for transition path sampling of asymmetric barriers. , 2016, The Journal of chemical physics.

[14]  Ronald L. Graham,et al.  Bounds for certain multiprocessing anomalies , 1966 .

[15]  M. Parrinello,et al.  Polymorphic transitions in single crystals: A new molecular dynamics method , 1981 .

[16]  Guillermo Rus-Carlborg,et al.  Approximate Bayesian Computation by Subset Simulation , 2014, SIAM J. Sci. Comput..

[17]  Costas Papadimitriou,et al.  Bayesian uncertainty quantification and propagation in molecular dynamics simulations: a high performance computing framework. , 2012, The Journal of chemical physics.

[18]  Paul Fearnhead,et al.  Constructing summary statistics for approximate Bayesian computation: semi‐automatic approximate Bayesian computation , 2012 .

[19]  J. Tinsley Oden,et al.  Selection, calibration, and validation of coarse-grained models of atomistic systems , 2015 .

[20]  Steve Plimpton,et al.  Fast parallel algorithms for short-range molecular dynamics , 1993 .

[21]  Pascal Pernot,et al.  Calibration of forcefields for molecular simulation: Sequential design of computer experiments for building cost‐efficient kriging metamodels , 2014, J. Comput. Chem..

[22]  Alan K. Soper,et al.  The radial distribution functions of water and ice from 220 to 673 K and at pressures up to 400 MPa , 2000 .

[23]  Jean-Michel Marin,et al.  Approximate Bayesian computational methods , 2011, Statistics and Computing.

[24]  Bai Jiang,et al.  Learning Summary Statistic for Approximate Bayesian Computation via Deep Neural Network , 2015, 1510.02175.

[25]  B. Berne,et al.  Role of the active-site solvent in the thermodynamics of factor Xa ligand binding. , 2008, Journal of the American Chemical Society.

[26]  Richard A. Messerly,et al.  Uncertainty quantification and propagation of errors of the Lennard-Jones 12-6 parameters for n-alkanes. , 2017, The Journal of chemical physics.

[27]  M. Shiga,et al.  Rapid estimation of elastic constants by molecular dynamics simulation under constant stress , 2004 .

[28]  Richard Lavery,et al.  Significance of Molecular Dynamics Simulations for Life Sciences , 2014 .

[29]  J. Møller Discussion on the paper by Feranhead and Prangle , 2012 .

[30]  G. Torrie,et al.  Nonphysical sampling distributions in Monte Carlo free-energy estimation: Umbrella sampling , 1977 .

[31]  M. Gutmann,et al.  Fundamentals and Recent Developments in Approximate Bayesian Computation , 2016, Systematic biology.

[32]  Michael W. Mahoney,et al.  A five-site model for liquid water and the reproduction of the density anomaly by rigid, nonpolarizable potential functions , 2000 .

[33]  A. Laio,et al.  Free-energy landscape for beta hairpin folding from combined parallel tempering and metadynamics. , 2006, Journal of the American Chemical Society.

[34]  A. Laio,et al.  Escaping free-energy minima , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[35]  Pascal Pernot,et al.  Statistical approaches to forcefield calibration and prediction uncertainty in molecular simulation. , 2011, The Journal of chemical physics.

[36]  Berk Hess,et al.  LINCS: A linear constraint solver for molecular simulations , 1997, J. Comput. Chem..

[37]  T. Monz,et al.  Real-time dynamics of lattice gauge theories with a few-qubit quantum computer , 2016, Nature.

[38]  Anders Nilsson,et al.  Benchmark oxygen-oxygen pair-distribution function of ambient water from x-ray diffraction measurements with a wide Q-range. , 2013, The Journal of chemical physics.

[39]  LarssonPer,et al.  GROMACS 4.5 , 2013 .

[40]  P G Bolhuis,et al.  Dynamics of Hydration Water around Native and Misfolded α-Lactalbumin. , 2016, The journal of physical chemistry. B.

[41]  Max Welling,et al.  GPS-ABC: Gaussian Process Surrogate Approximate Bayesian Computation , 2014, UAI.

[42]  S. White,et al.  The EAGLE project: Simulating the evolution and assembly of galaxies and their environments , 2014, 1407.7040.

[43]  Berend Smit,et al.  Understanding Molecular Simulation , 2001 .

[44]  Jukka-Pekka Onnela,et al.  ABCpy: A User-Friendly, Extensible, and Parallel Library for Approximate Bayesian Computation , 2017, PASC.

[45]  Hoon Kim,et al.  Monte Carlo Statistical Methods , 2000, Technometrics.

[46]  H. Najm,et al.  On the Statistical Calibration of Physical Models: STATISTICAL CALIBRATION OF PHYSICAL MODELS , 2015 .

[47]  Ritabrata Dutta,et al.  Likelihood-free inference via classification , 2014, Stat. Comput..

[48]  P. Bolhuis,et al.  Water structure and dynamics in the hydration layer of a type III anti-freeze protein. , 2018, Physical chemistry chemical physics : PCCP.

[49]  Costas Papadimitriou,et al.  Data driven, predictive molecular dynamics for nanoscale flow simulations under uncertainty. , 2013, The journal of physical chemistry. B.

[50]  Simon R. Phillpot,et al.  Uncertainty Quantification in Multiscale Simulation of Materials: A Prospective , 2013 .

[51]  Peter M. Kasson,et al.  GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit , 2013, Bioinform..

[52]  Jean-Marie Cornuet,et al.  ABC model choice via random forests , 2014, 1406.6288.