Parallel Algorithms for Monte Carlo Particle Transport Simulation on Exascale Computing Architectures

Monte Carlo particle transport methods are being considered as a viable option for high-fidelity simulation of nuclear reactors. While Monte Carlo methods offer several potential advantages over deterministic methods, there are a number of algorithmic shortcomings that would prevent their immediate adoption for full-core analyses. In this thesis, algorithms are proposed both to ameliorate the degradation in parallal efficiency typically observed for large numbers of processors and to offer a means of decomposing large tally data that will be needed for reactor analysis. A nearest-neighbor fission bank algorithm was proposed and subsequently implemented in the OpenMC Monte Carlo code. A theoretical analysis of the communication pattern shows that the expected cost is O( p N) whereas traditional fission bank algorithms are O(N) at best. The algorithm was tested on two supercomputers, the Intrepid Blue Gene/P and the Titan Cray XK7, and demonstrated nearly linear parallel scaling up to 163,840 processor cores on a full-core benchmark problem. An algorithm for reducing network communication arising from tally reduction was analyzed and implemented in OpenMC. The proposed algorithm groups only particle histories on a single processor into batches for tally purposes — in doing so it prevents all network communication for tallies until the very end of the simulation. The algorithm was tested, again on a full-core benchmark, and shown to reduce network communication substantially. A model was developed to predict the impact of load imbalances on the performance of domain decomposed simulations. The analysis demonstrated that load imbalances in domain decomposed simulations arise from two distinct phenomena: non-uniform particle densities and non-uniform spatial leakage. The dominant performance penalty for domain decomposition was shown to come from these physical effects rather than insufficient network bandwidth or high latency. The model predictions were verified with measured data from simulations in OpenMC on a full-core benchmark problem. Finally, a novel algorithm for decomposing large tally data was proposed, analyzed, and implemented/tested in OpenMC. The algorithm relies on disjoint sets of compute 3 processes and tally servers. The analysis showed that for a range of parameters relevant to LWR analysis, the tally server algorithm should perform with minimal overhead. Tests were performed on Intrepid and Titan and demonstrated that the algorithm did indeed perform well over a wide range of parameters. Thesis Supervisor: Benoit Forget Title: Assistant Professor of Nuclear Science and Engineering Thesis Reader: Kord S. Smith Title: KEPCO Professor of the Practice of Nuclear Science and Engineering

[1]  J. F. Briesmeister MCNP-A General Monte Carlo N-Particle Transport Code , 1993 .

[2]  R. Roy,et al.  Reactor Core Methods , 2010 .

[3]  T. M. Sutton,et al.  Comparison of Some Monte Carlo Models for Bound Hydrogen Scattering , 2009 .

[4]  F. B. Brown,et al.  Implementation of the probability table method in a continuous-energy Monte Carlo code system , 1998 .

[5]  Thomas M. Evans,et al.  Comparison of four parallel algorithms for domain decomposed implicit Monte Carlo , 2006, J. Comput. Phys..

[6]  G. L. Squires,et al.  Introduction to the Theory of Thermal Neutron Scattering: Neutron optics , 1978 .

[7]  R J Procassini,et al.  ENHANCEMENTS TO THE COMBINATORIAL GEOMETRY PARTICLE TRACKER IN THE MERCURY MONTE CARLO TRANSPORT CODE: EMBEDDED MESHES AND DOMAIN DECOMPOSITION , 2009 .

[8]  Forrest B. Brown,et al.  INSTRUCTIVE CONCEPTS ABOUT THE MONTE CARLO FISSION SOURCE DISTRIBUTION , 2009 .

[9]  T. H. Trumbull Treatment of Nuclear Data for Transport Problems Containing Detailed Temperature Distributions , 2006 .

[10]  Forrest B. Brown,et al.  High performance computing and Monte Carlo , 2004 .

[11]  William Gropp,et al.  Skjellum using mpi: portable parallel programming with the message-passing interface , 1994 .

[12]  Bryan E. Toth,et al.  A NOVEL SOURCE CONVERGENCE ACCELERATION SCHEME FOR MONTE CARLO CRITICALITY CALCULATIONS, PART II: IMPLEMENTATION & RESULTS , 2007 .

[13]  Bojan Petrovic,et al.  CONVERGENCE DIAGNOSTICS FOR EIGENVALUE PROBLEMS WITH LINEAR REGRESSION MODEL , 2011 .

[14]  Forrest B. Brown Stationarity Diagnostics Using Shannon Entropy in Monte Carlo Criticality Calculation I : F test , 2002 .

[15]  R. Dagan,et al.  Proof and implementation of the stochastic formula for ideal gas, energy dependent scattering kernel , 2009 .

[16]  Taro Ueki Information Theory and Undersampling Diagnostics for Monte Carlo Simulation of Nuclear Criticality , 2005 .

[17]  Jack Dongarra,et al.  A User''s Guide to PVM Parallel Virtual Machine , 1991 .

[18]  B. E. Watt,et al.  ENERGY SPECTRUM OF NEUTRONS FROM THERMAL FISSION OF U$sup 23$$sup 5$ , 1952 .

[19]  Pierre L'Ecuyer,et al.  Tables of linear congruential generators of different sizes and good lattice structure , 1999, Math. Comput..

[20]  Paul K. Romano APPLICATION OF THE STOCHASTIC OSCILLATOR TO ASSESS SOURCE CONVERGENCE IN MONTE CARLO CRITICALITY CALCULATIONS , 2009 .

[21]  Yu. G. Pashkin Accuracy of the Wigner approximation , 1970 .

[22]  E. D. Cashwell,et al.  A THIRD MONTE CARLO SAMPLER (A REVISION AND EXTENSION OF SAMPLERS 1 AND 2) , 1983 .

[23]  G. I. Bell,et al.  Nuclear Reactor Theory , 1952 .

[24]  E. Olusegun George,et al.  A modification of the Fisher-Cornish approximation for the student t percentiles , 1987 .

[25]  Forrest B. Brown,et al.  On-the-Fly Doppler Broadening for Monte Carlo Codes , 2012 .

[26]  E. Fridman,et al.  On the use of the Serpent Monte Carlo code for few-group cross section generation , 2011 .

[27]  Andrew R. Siegel,et al.  Analysis of communication costs for domain decomposed Monte Carlo methods in nuclear reactor analysis , 2012, J. Comput. Phys..

[28]  Toshihiro Yamamoto,et al.  Reliable Method for Fission Source Convergence of Monte Carlo Criticality Calculation with Wielandt's Method , 2004 .

[29]  Waclaw Gudowski,et al.  Fission matrix based Monte Carlo criticality calculations , 2009 .

[30]  Patrick S. Brantley,et al.  An efficient, robust, domain-decomposition algorithm for particle Monte Carlo , 2009, J. Comput. Phys..

[31]  Rajeev Thakur,et al.  Optimization of Collective Communication Operations in MPICH , 2005, Int. J. High Perform. Comput. Appl..

[32]  Andrew R. Siegel,et al.  The effect of load imbalances on the performance of Monte Carlo algorithms in LWR analysis , 2012, J. Comput. Phys..

[33]  F. Brown On the Use of Shannon Entropy of the Fission Distribution for Assessing Convergence of Monte Carlo Criticality Calculations , 2006 .

[34]  Paul K. Romano,et al.  Parallel Fission Bank Algorithms in Monte Carlo Criticality Calculations , 2012 .

[35]  J. Leppänen Two practical methods for unionized energy grid construction in continuous-energy Monte Carlo neutron transport calculation , 2009 .

[36]  Tuomas Viitanen,et al.  Explicit Treatment of Thermal Motion in Continuous-Energy Monte Carlo Tracking Routines , 2012 .

[37]  Paul K. Romano,et al.  Towards Scalable Parallelism in Monte Carlo Particle Transport Codes Using Remote Memory Access , 2011 .

[38]  Kord Smith,et al.  Investigation of CMFD Accelerated Monte Carlo Eigenvalue Calculation with Simplified Low Dimensional Multigroup Formulation , 2010 .

[39]  M. M. R. Williams,et al.  Slowing Down and Thermalization of Neutrons , 1966 .

[40]  F. B. Brown Random number generation with arbitrary strides , 1994 .

[41]  Garry H. Rodrigue,et al.  Domain Decomposition Models for Parallel Monte Carlo Transport , 2004, The Journal of Supercomputing.

[42]  Forrest B. Brown,et al.  On-The-Fly Neutron Doppler Broadening in MCNP , 2014, ICS 2014.

[43]  D. C. Carpenter,et al.  The MC21 Monte Carlo Transport Code , 2007 .

[44]  Richard J. Doyas,et al.  Interpolation of Tabular Secondary Neutron and Photon Energy Distributions , 1973 .

[45]  N. M. Larson,et al.  ENDF/B-VII.1 Nuclear Data for Science and Technology: Cross Sections, Covariances, Fission Product Yields and Decay Data , 2011 .

[46]  Charles R. Weisbin,et al.  Exact Doppler Broadening of Tabulated Cross Sections , 1976 .

[47]  Robert E. MacFarlane,et al.  Methods for Processing ENDF/B-VII with NJOY , 2010 .

[48]  S. Shapiro,et al.  An Analysis of Variance Test for Normality (Complete Samples) , 1965 .

[49]  Dermott E. Cullen,et al.  NEW CAPABILITIES IN MERCURY: A MODERN, MONTE CARLO PARTICLE TRANSPORT CODE , 2007 .

[50]  Forrest B. Brown,et al.  Monte Carlo methods for radiation transport analysis on vector computers , 1984 .

[51]  William R. Martin,et al.  CHALLENGES AND PROSPECTS FOR WHOLE-CORE MONTE CARLO ANALYSIS , 2012 .

[52]  Han Gyu Joo,et al.  Multigroup Monte Carlo Reactor Calculation with Coarse Mesh Finite Difference Formulation for Real Variance Reduction , 2010 .

[53]  Taro Ueki,et al.  Particle Population Diagnosis and Euclidean Minimum Spanning Tree in Monte Carlo Calculation of Power Distribution , 2011 .

[54]  William R. Martin,et al.  THE MONTE CARLO PERFORMANCE BENCHMARK TEST - AIMS, SPECIFICATIONS AND FIRST RESULTS , 2011 .

[55]  Anthony Skjellum,et al.  Using MPI - portable parallel programming with the message-parsing interface , 1994 .

[56]  E. S. Troubetzkoy,et al.  Monte Carlo radiation penetration calculations on a parallel computer , 1973 .

[57]  L. Levitt,et al.  The Probability Table Method for Treating Unresolved Neutron Resonances in Monte Carlo Calculations , 1972 .

[58]  Kord Smith,et al.  FULL-CORE, 2-D, LWR CORE CALCULATIONS WITH CASMO-4E , 2002 .

[59]  J. Leppänen Development of a New Monte Carlo reactor physics code , 2007 .

[60]  E. M. Gelbard Biases in Monte Carlo eigenvalue calculations , 1992 .

[61]  R. C. Geary The Frequency Distribution of the Quotient of Two Normal Variates , 2022 .

[62]  J. Eduard Hoogenboom,et al.  IS MONTE CARLO EMBARRASSINGLY PARALLEL , 2012 .

[63]  R J Procassini,et al.  Load Balancing of Parallel Monte Carlo Transport Calculations , 2004 .

[64]  Benoit Forget,et al.  The OpenMC Monte Carlo particle transport code , 2012 .

[65]  Ely M. Gelbard,et al.  Computation of standard deviations in Eigenvalue calculations , 1990 .

[66]  Farzad Rahnema,et al.  On the Diffusion Coefficients for Reactor Physics Applications , 2009 .

[67]  William R. Martin,et al.  A proposal for a benchmark to monitor the performance of detailed Monte Carlo calculation of power densities in a full size reactor core , 2009 .

[68]  Lubos Mitas,et al.  A global address space approach to automated data management for parallel Quantum Monte Carlo applications , 2012, 2012 19th International Conference on High Performance Computing.

[69]  Tuomas Viitanen,et al.  Explicit temperature treatment in Monte Carlo neutron tracking routines - First results , 2012 .

[70]  Andrew R. Siegel,et al.  Data decomposition of Monte Carlo particle transport simulations via tally servers , 2013, J. Comput. Phys..

[71]  Paul K. Romano,et al.  Development of a Graphical User Interface for In-Core Fuel Management Using MCODE , 2009 .

[72]  Anthony Foderaro,et al.  Elements of Neutron Interaction Theory , 1971 .

[73]  Bryan E. Toth,et al.  A NOVEL SOURCE CONVERGENCE ACCELERATION SCHEME FOR MONTE CARLO CRITICALITY CALCULATIONS, PART I: THEORY , 2007 .

[74]  R. J. Brissenden,et al.  Biases in the estimation of Keff and its error by Monte Carlo methods , 1986 .

[75]  G. I. Maldonado,et al.  Verification of the shift Monte Carlo code with the C5G7 reactor benchmark , 2012 .

[76]  J. Lieberoth MONTE CARLO TECHNIQUE TO SOLVE. THE STATIC EIGENVALUE PROBLEM OF THE BOLTZMANN TRANSPORT EQUATION. , 1968 .

[77]  F. B. Brown,et al.  Estimation of fission source bias in Monte Carlo eigenvalue calculations , 1993 .

[78]  Andrew Siegel,et al.  Progress and status of the OpenMC Monte Carlo code , 2013 .

[79]  J. Cetnar General solution of Bateman equations for nuclear transmutations , 2006 .

[80]  R. Macfarlane,et al.  The NJOY Nuclear Data Processing System , 2008 .

[81]  Frederica Darema,et al.  The SPMD Model : Past, Present and Future , 2001, PVM/MPI.

[82]  D. J. Kelly,et al.  MC21 analysis of the nuclear energy agency Monte Carlo performance benchmark problem , 2012 .

[83]  Forrest B. Brown,et al.  Temperature effects of resonance scattering for epithermal neutrons in MCNP , 2011 .

[84]  N. Metropolis,et al.  The Monte Carlo method. , 1949 .

[85]  Forrest B. Brown WIELANDT ACCELERATION FOR MCNP5 MONTE CARLO EIGENVALUE CALCULATIONS , 2007 .

[86]  L. Ridgway Scott,et al.  Scientific Parallel Computing , 2005 .

[87]  Bucholz,et al.  SCALE: a modular code system for performing standardized computer analyses for licensing evaluation , 1981 .

[88]  J. Watts,et al.  Interprocessor collective communication library (InterCom) , 1994, Proceedings of IEEE Scalable High Performance Computing Conference.

[89]  Forrest B. Brown,et al.  Fission Matrix Capability for MCNP Monte Carlo , 2014, ICS 2014.

[90]  Han Gyu Joo,et al.  MONTE CARLO REACTOR CALCULATION WITH SUBSTANTIALLY REDUCED NUMBER OF CYCLES , 2012 .

[91]  Forrest B. Brown,et al.  COMPARISON OF DISCRETE AND CONTINUOUS THERMAL NEUTRON SCATTERING TREATMENTS IN MCNP5 , 2012 .

[92]  Collin McCurdy,et al.  Early evaluation of IBM BlueGene/P , 2008, HiPC 2008.

[93]  F. B. Brown,et al.  Reproducibility and Monte Carlo eigenvalue calculations , 1992 .

[94]  M. Herman,et al.  ENDF-6 Formats Manual Data Formats and Procedures for the Evaluated Nuclear Data File ENDF/B-VI and ENDF/B-VII Written by the Members of the Cross Sections Evaluation Working Group Last Revision Edited by , 2009 .

[95]  Philip F. Rose,et al.  ENDF-6 Formats Manual , 1997 .

[96]  Taro Ueki On-the-Fly Diagnostics of Particle Population in Iterated-Source Monte Carlo Methods , 2008 .

[97]  Alan J. Wallcraft SPMD OpenMP versus MPI for ocean models , 2000, Concurr. Pract. Exp..