Performance and accuracy of criticality calculations performed using WARP – A framework for continuous energy Monte Carlo neutron transport in general 3D geometries on GPUs

© 2017 Elsevier Ltd In this companion paper to “Algorithmic Choices in WARP – A Framework for Continuous Energy Monte Carlo Neutron Transport in General 3D Geometries on GPUs” (doi:http://dx.doi.org/10.1016/j.anucene.2014.10.039), the WARP Monte Carlo neutron transport framework for graphics processing units (GPUs) is benchmarked against production-level central processing unit (CPU) Monte Carlo neutron transport codes for both performance and accuracy. Neutron flux spectra, multiplication factors, runtimes, speedup factors, and costs of various GPU and CPU platforms running either WARP, Serpent 2.1.24, or MCNP 6.1 are compared. WARP compares well with the results of the production-level codes, and it is shown that on the newest hardware considered, GPU platforms running WARP are between 0.8 and 7.6 times as fast as CPU platforms running production codes. The GPU platforms running WARP are also shown to be between 15% and 50% as expensive to purchase and between 80% and 90% as expensive to operate as equivalent CPU platforms performing at an equal simulation rate.

[1]  N. M. Larson,et al.  ENDF/B-VII.1 Nuclear Data for Science and Technology: Cross Sections, Covariances, Fission Product Yields and Decay Data , 2011 .

[2]  Gaël Varoquaux,et al.  The NumPy Array: A Structure for Efficient Numerical Computation , 2011, Computing in Science & Engineering.

[3]  Jack J. Purdum,et al.  C programming guide , 1983 .

[4]  C. Suciu,et al.  COMPARISON OF SINGLE AND DOUBLE FLOATING POINT PRECISION PERFORMANCE FOR TESLA ARCHITECTURE GPUs , 2011 .

[5]  David M. Beazley,et al.  Automated scientific software scripting with SWIG , 2003, Future Gener. Comput. Syst..

[6]  Forrest B. Brown,et al.  Monte Carlo methods for radiation transport analysis on vector computers , 1984 .

[7]  Jack J. Dongarra,et al.  Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy , 2008, TOMS.

[8]  J. Dongarra,et al.  Exploiting the Performance of 32 bit Floating Point Arithmetic in Obtaining 64 bit Accuracy (Revisiting Iterative Refinement for Linear Systems) , 2006, ACM/IEEE SC 2006 Conference (SC'06).

[9]  Forrest B. Brown,et al.  Stationarity Modeling and Informatics-Based Diagnostics in Monte Carlo Criticality Calculations , 2005 .

[10]  Mark J. Harris,et al.  Parallel Prefix Sum (Scan) with CUDA , 2011 .

[11]  Margot Gerritsen,et al.  A CUDA Monte Carlo simulator for radiation therapy dosimetry based on Geant4 , 2014, ICS 2014.

[12]  Jasmina L. Vujic,et al.  Algorithmic choices in WARP – A framework for continuous energy Monte Carlo neutron transport in general 3D geometries on GPUs , 2015 .

[13]  William R. Martin,et al.  VECTORIZATION AND PARALLELIZATION OF A PRODUCTION REACTOR ASSEMBLY CODE , 1991 .

[14]  Christopher D. Carothers,et al.  A Monte Carlo Neutron Transport Code for Eigenvalue Calculations on a Dual-GPU System and CUDA Environment , 2012 .

[15]  Paul K. Romano,et al.  Using Fractional Cascading to Accelerate Cross Section Lookups in Monte Carlo Neutron Transport Calculations , 2015 .

[16]  Emprise Y.K. Chan Benchmarks for Intel MIC Architecture , 2014 .

[17]  J. F. Briesmeister MCNP-A General Monte Carlo N-Particle Transport Code , 1993 .

[18]  Xipeng Shen,et al.  On-the-fly elimination of dynamic irregularities for GPU computing , 2011, ASPLOS XVI.

[19]  Jaakko Leppänen,et al.  Performance of Woodcock delta-tracking in lattice physics applications using the Serpent Monte Carlo reactor physics burnup calculation code , 2010 .

[20]  J. Duderstadt,et al.  Nuclear reactor analysis , 1976 .

[21]  J. Leppänen Development of a New Monte Carlo reactor physics code , 2007 .