Estimating error in diffusion coefficients derived from molecular dynamics simulations.

The computationally expensive nature of molecular dynamics simulation limits the access to length (nanometer) and time scales (nanosecond) that are orders of magnitude smaller than the experiment it models. This limitation warrants a careful estimation of statistical uncertainty associated with the properties calculated from these simulations. The assumption that a simulation is long enough so that the ergodic hypothesis applies is often invoked in the literature for the computation of properties of interest from a single molecular dynamics simulation. Here, we demonstrate that making this assumption without validation results in poor estimates of the self-diffusion coefficient from a single molecular dynamics simulation of Lennard-Jones fluid. This problem is shown to be even more severe when the diffusion coefficient of macromolecules is calculated from a single molecular dynamics simulation. We have shown that conducting multiple independent simulations is necessary to obtain reliable estimates of diffusion coefficients and their associated statistical uncertainties. We show that even a “routine” calculation of the self-diffusion coefficient for a Lennard-Jones fluid, as determined from a linear fit of the mean squared displacement of particles as a function of time, violates the key assumptions of linear regression. A rigorous approach for addressing these issues is presented.

[1]  Richard F. Gunst,et al.  Applied Regression Analysis , 1999, Technometrics.

[2]  H. G. Petersen,et al.  Error estimates on averages of correlated data , 1989 .

[3]  Berend Smit,et al.  Understanding molecular simulation: from algorithms to applications , 1996 .

[4]  Robert Zwanzig,et al.  STATISTICAL ERROR DUE TO FINITE TIME AVERAGING IN COMPUTER EXPERIMENTS. , 1969 .

[5]  Christopher P Calderon,et al.  Extracting Kinetic and Stationary Distribution Information from Short MD Trajectories via a Collection of Surrogate Diffusion Models. , 2009, Journal of chemical theory and computation.

[6]  Gerhard Hummer,et al.  System-Size Dependence of Diffusion Coefficients and Viscosities from Molecular Dynamics Simulations with Periodic Boundary Conditions , 2004 .

[7]  M. McGrath,et al.  Time-Dependent Properties of Liquid Water:  A Comparison of Car-Parrinello and Born-Oppenheimer Molecular Dynamics Simulations. , 2005, Journal of chemical theory and computation.

[8]  Ken A Dill,et al.  Use of the Weighted Histogram Analysis Method for the Analysis of Simulated and Parallel Tempering Simulations. , 2007, Journal of chemical theory and computation.

[9]  C. Brooks,et al.  Large-scale allosteric conformational transitions of adenylate kinase appear to involve a population-shift mechanism , 2007, Proceedings of the National Academy of Sciences.

[10]  H. Qian,et al.  Single particle tracking. Analysis of diffusion and flow in two-dimensional systems. , 1991, Biophysical journal.

[11]  Burkhard Dünweg,et al.  Molecular dynamics algorithms and hydrodynamic screening , 1993 .

[12]  Molecular dynamics simulation of fractal aggregate diffusion. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  From Ornstein-Uhlenbeck dynamics to long-memory processes and fractional Brownian motion. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  R. Jullien,et al.  A cluster-cluster aggregation model with tunable fractal dimension , 1994 .

[15]  Steve Plimpton,et al.  Fast parallel algorithms for short-range molecular dynamics , 1993 .

[16]  C. Calderon Estimation and Inference of Diffusion Coefficients in Complex Biomolecular Environments. , 2011, Journal of chemical theory and computation.

[17]  Kurt Kremer,et al.  Molecular dynamics simulation of a polymer chain in solution , 1993 .