Simulation of reaction diffusion processes over biologically relevant size and time scales using multi-GPU workstations

Simulation of in vivo cellular processes with the reaction-diffusion master equation (RDME) is a computationally expensive task. Our previous software enabled simulation of inhomogeneous biochemical systems for small bacteria over long time scales using the MPD-RDME method on a single GPU. Simulations of larger eukaryotic systems exceed the on-board memory capacity of individual GPUs, and long time simulations of modest-sized cells such as yeast are impractical on a single GPU. We present a new multi-GPU parallel implementation of the MPD-RDME method based on a spatial decomposition approach that supports dynamic load balancing for workstations containing GPUs of varying performance and memory capacity. We take advantage of high-performance features of CUDA for peer-to-peer GPU memory transfers and evaluate the performance of our algorithms on state-of-the-art GPU devices. We present parallel e ciency and performance results for simulations using multiple GPUs as system size, particle counts, and number of reactions grow. We also demonstrate multi-GPU performance in simulations of the Min protein system in E. coli. Moreover, our multi-GPU decomposition and load balancing approach can be generalized to other lattice-based problems.

[1]  A. van Oudenaarden,et al.  Using Gene Expression Noise to Understand Gene Regulation , 2012, Science.

[2]  John E. Stone,et al.  Long time-scale simulations of in vivo diffusion using GPU hardware , 2009, 2009 IEEE International Symposium on Parallel & Distributed Processing.

[3]  P A de Boer,et al.  Rapid pole-to-pole oscillation of a protein required for directing division to the middle of Escherichia coli. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Satoshi Matsuoka,et al.  Physis: An implicitly parallel programming model for stencil computations on large-scale GPU-accelerated supercomputers , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[5]  K. McNeil,et al.  Correlations in stochastic theories of chemical reactions , 1976 .

[6]  John E. Stone,et al.  GPU-accelerated computation and interactive display of molecular orbitals , 2010 .

[7]  Filipe Tostevin,et al.  A stochastic model of Min oscillations in Escherichia coli and Min protein segregation during cell division , 2005, Physical biology.

[8]  Klaus Schulten,et al.  Adapting a message-driven parallel application to GPU-accelerated clusters , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.

[9]  Bastien Chopard,et al.  Cellular Automata Modeling of Physical Systems: Index , 1998 .

[10]  David Fange,et al.  Noise-Induced Min Phenotypes in E. coli , 2006, PLoS Comput. Biol..

[11]  Julio O. Ortiz,et al.  Noise Contributions in an Inducible Genetic Switch: A Whole-Cell Simulation Study , 2011, PLoS Comput. Biol..

[12]  Daniel T Gillespie,et al.  Stochastic simulation of chemical kinetics. , 2007, Annual review of physical chemistry.

[13]  Klaus Schulten,et al.  Multilevel summation of electrostatic potentials using graphics processing units , 2009, Parallel Comput..

[14]  Michal Kierzynka,et al.  CaKernel --A parallel application programming framework for heterogenous computing architectures , 2011 .

[15]  Bing Wang,et al.  Abstract Next Subvolume Method: A logical process-based approach for spatial stochastic simulation of chemical reactions , 2011, Comput. Biol. Chem..

[16]  Samuel A. Isaacson,et al.  The Reaction-Diffusion Master Equation as an Asymptotic Approximation of Diffusion to a Small Target , 2009, SIAM J. Appl. Math..

[17]  John E. Stone,et al.  GPU clusters for high-performance computing , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[18]  S. Isaacson,et al.  Reaction-diffusion master equation, diffusion-limited reactions, and singular potentials. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  Klaus Schulten,et al.  GPU acceleration of cutoff pair potentials for molecular modeling applications , 2008, CF '08.

[20]  Jeffrey S. Vetter,et al.  Quantifying NUMA and contention effects in multi-GPU systems , 2011, GPGPU-4.

[21]  Jaap A. Kaandorp,et al.  Spatial stochastic modelling of the phosphoenolpyruvate-dependent phosphotransferase (PTS) pathway in Escherichia coli , 2006, Bioinform..

[22]  Kevin R. Sanft,et al.  Legitimacy of the stochastic Michaelis-Menten approximation. , 2011, IET systems biology.

[23]  John E. Stone,et al.  Lattice microbes: High‐performance stochastic simulation method for the reaction‐diffusion master equation , 2013, J. Comput. Chem..

[24]  Paulius Micikevicius,et al.  3D finite difference computation on GPUs using CUDA , 2009, GPGPU-2.

[25]  Murray Cole,et al.  PARTANS: An autotuning framework for stencil computation on multi-GPU systems , 2013, TACO.