WESTPA: an interoperable, highly scalable software package for weighted ensemble simulation and analysis.

The weighted ensemble (WE) path sampling approach orchestrates an ensemble of parallel calculations with intermittent communication to enhance the sampling of rare events, such as molecular associations or conformational changes in proteins or peptides. Trajectories are replicated and pruned in a way that focuses computational effort on underexplored regions of configuration space while maintaining rigorous kinetics. To enable the simulation of rare events at any scale (e.g., atomistic, cellular), we have developed an open-source, interoperable, and highly scalable software package for the execution and analysis of WE simulations: WESTPA (The Weighted Ensemble Simulation Toolkit with Parallelization and Analysis). WESTPA scales to thousands of CPU cores and includes a suite of analysis tools that have been implemented in a massively parallel fashion. The software has been designed to interface conveniently with any dynamics engine and has already been used with a variety of molecular dynamics (e.g., GROMACS, NAMD, OpenMM, AMBER) and cell-modeling packages (e.g., BioNetGen, MCell). WESTPA has been in production use for over a year, and its utility has been demonstrated for a broad set of problems, ranging from atomically detailed host–guest associations to nonspatial chemical kinetics of cellular signaling networks. The following describes the design and features of WESTPA, including the facilities it provides for running WE simulations and storing and analyzing WE simulation data, as well as examples of input and output.

[1]  Jianpeng Ma,et al.  CHARMM: The biomolecular simulation program , 2009, J. Comput. Chem..

[2]  A. Elcock,et al.  Striking Effects of Hydrodynamic Interactions on the Simulated Diffusion and Folding of Proteins. , 2009, Journal of chemical theory and computation.

[3]  B. Berne,et al.  Molecular dynamics study of an isomerizing diatomic in a Lennard‐Jones fluid , 1988 .

[4]  I. Bahar,et al.  An adaptive weighted ensemble procedure for efficient computation of free energies and first passage rates. , 2012, The Journal of chemical physics.

[5]  Ian M. Mitchell,et al.  Best Practices for Scientific Computing , 2012, PLoS biology.

[6]  Berend Smit,et al.  Understanding molecular simulation: from algorithms to applications , 1996 .

[7]  H. C. Andersen,et al.  Role of Repulsive Forces in Determining the Equilibrium Structure of Simple Liquids , 1971 .

[8]  Laxmikant V. Kalé,et al.  Scalable molecular dynamics with NAMD , 2005, J. Comput. Chem..

[9]  G. Huber,et al.  Weighted-ensemble Brownian dynamics simulations for protein association reactions. , 1996, Biophysical journal.

[10]  Peter M. Kasson,et al.  GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit , 2013, Bioinform..

[11]  Vijay S Pande,et al.  Enhanced modeling via network theory: Adaptive sampling of Markov state models. , 2010, Journal of chemical theory and computation.

[12]  R. Elber,et al.  Computing time scales from reaction coordinates by milestoning. , 2004, The Journal of chemical physics.

[13]  Thomas J Lane,et al.  MSMBuilder2: Modeling Conformational Dynamics at the Picosecond to Millisecond Scale. , 2011, Journal of chemical theory and computation.

[14]  William S. Hlavacek,et al.  BioNetGen: software for rule-based modeling of signal transduction based on the interactions of molecular domains , 2004, Bioinform..

[15]  A Rojnuckarin,et al.  Brownian dynamics simulations of protein folding: access to milliseconds time scale and beyond. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[16]  Adrian E. Roitberg,et al.  MOIL: A program for simulations of macromolecules , 1995 .

[17]  Daniel M Zuckerman,et al.  Transition-event durations in one-dimensional activated processes. , 2007, The Journal of chemical physics.

[18]  Travis E. Oliphant,et al.  Python for Scientific Computing , 2007, Computing in Science & Engineering.

[19]  Frank Noé,et al.  EMMA: A Software Package for Markov Model Building and Analysis. , 2012, Journal of chemical theory and computation.

[20]  Scott B. Baden,et al.  Fast Monte Carlo Simulation Methods for Biological Reaction-Diffusion Systems in Solution and on Surfaces , 2008, SIAM J. Sci. Comput..

[21]  James R Faeder,et al.  Efficient stochastic simulation of chemical kinetics networks using a weighted ensemble of trajectories. , 2013, The Journal of chemical physics.

[22]  Axel Arnold,et al.  The Flexible Rare Event Sampling Harness System (FRESHS) , 2014, Comput. Phys. Commun..

[23]  D. Frenkel,et al.  Understanding molecular simulation : from algorithms to applications. 2nd ed. , 2002 .

[24]  Douglas Thain,et al.  AWE-WQ: Fast-Forwarding Molecular Dynamics Using the Accelerated Weighted Ensemble , 2014, J. Chem. Inf. Model..

[25]  Holger Gohlke,et al.  The Amber biomolecular simulation programs , 2005, J. Comput. Chem..

[26]  Diwakar Shukla,et al.  OpenMM 4: A Reusable, Extensible, Hardware Independent Library for High Performance Molecular Simulation. , 2013, Journal of chemical theory and computation.

[27]  L. Chong,et al.  Simultaneous Computation of Dynamical and Equilibrium Information Using a Weighted Ensemble of Trajectories , 2012, Journal of chemical theory and computation.

[28]  Adrian H Elcock,et al.  An Improved, Bias-Reduced Probabilistic Functional Gene Network of Baker's Yeast, Saccharomyces cerevisiae , 2007, PloS one.

[29]  A Rojnuckarin,et al.  Bimolecular reaction simulation using Weighted Ensemble Brownian dynamics and the University of Houston Brownian Dynamics program. , 2000, Biophysical journal.

[30]  P. R. ten Wolde,et al.  Sampling rare switching events in biochemical networks. , 2004, Physical review letters.

[31]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[32]  C. Dellago,et al.  Transition path sampling and the calculation of rate constants , 1998 .

[33]  Divesh Bhatt,et al.  Simulations of the alternating access mechanism of the sodium symporter Mhp1. , 2011, Biophysical journal.

[34]  Stefan Behnel,et al.  Cython: The Best of Both Worlds , 2011, Computing in Science & Engineering.

[35]  Joshua L Adelman,et al.  Simulating rare events using a weighted ensemble-based string method. , 2012, The Journal of chemical physics.

[36]  S. Doerr,et al.  On-the-Fly Learning and Sampling of Ligand Binding by High-Throughput Molecular Simulations. , 2014, Journal of chemical theory and computation.

[37]  D. Bhatt,et al.  Heterogeneous path ensembles for conformational transitions in semi-atomistic models of adenylate kinase. , 2009, Journal of chemical theory and computation.

[38]  Loïc Salmon,et al.  Efficient in silico exploration of RNA interhelical conformations using Euler angles and WExplore , 2014, Nucleic acids research.

[39]  John D. Chodera,et al.  Long-Time Protein Folding Dynamics from Short-Time Molecular Dynamics Simulations , 2006, Multiscale Model. Simul..

[40]  Bin W. Zhang,et al.  Efficient and verified simulation of a path ensemble for conformational change in a united-residue model of calmodulin , 2007, Proceedings of the National Academy of Sciences.

[41]  Alex Dickson,et al.  WExplore: hierarchical exploration of high-dimensional spaces using the weighted ensemble algorithm. , 2014, The journal of physical chemistry. B.

[42]  Fernando Pérez,et al.  Python: An Ecosystem for Scientific Computing , 2011, Computing in Science & Engineering.

[43]  Christoph Dellago,et al.  On the calculation of reaction rate constants in the transition path ensemble , 1999 .

[44]  M. Shirts,et al.  Effects of Temperature Control Algorithms on Transport Properties and Kinetics in Molecular Dynamics Simulations. , 2013, Journal of chemical theory and computation.

[45]  L. Chong,et al.  Efficient Explicit-Solvent Molecular Dynamics Simulations of Molecular Association Kinetics: Methane/Methane, Na(+)/Cl(-), Methane/Benzene, and K(+)/18-Crown-6 Ether. , 2011, Journal of chemical theory and computation.

[46]  Bin W. Zhang,et al.  Steady-state simulations using weighted ensemble path sampling. , 2009, The Journal of chemical physics.

[47]  Daniel M Zuckerman,et al.  The "weighted ensemble" path sampling method is statistically exact for a broad class of stochastic processes and binning procedures. , 2008, The Journal of chemical physics.