Massively parallel algorithms for method of characteristics neutral particle transport on shared memory computer architectures

Over the past 20 years, parallel computing has enabled computers to grow ever larger and more powerful while scientific applications have advanced in sophistication and resolution. This trend is being challenged, however, as the power consumption for conventional parallel computing architectures has risen to unsustainable levels and memory limitations have come to dominate compute performance. Multi-core processors and heterogeneous computing platforms, such as Graphics Processing Units (GPUs), are an increasingly popular paradigm for resolving these issues. This thesis explores the applicability of shared memory parallel platforms for solving deterministic neutron transport problems. A 2D method of characteristics code - OpenMOC - has been developed with solvers for shared memory multi-core platforms as well as GPUs. The multi-threading and memory locality methodologies for the multi-core CPU and GPU solvers are presented. Parallel scaling results using OpenMP demonstrate better than ideal weak scaling and nearly perfect strong scaling on both Intel Xeon and IBM Blue Gene/Q architectures. Performance results for the 2D C5G7 benchmark demonstrate up to 50x speedup for MOC on a GPU. The lessons learned from this thesis will provide the basis for further exploration of MOC on many-core platforms and GPUs as well as design decisions for hardware vendors exploring technologies for the next generation of machines for scientific computing.

[1]  Georg Hager,et al.  Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes , 2009, 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing.

[2]  John E. Stone,et al.  GPU clusters for high-performance computing , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[3]  Samuel H. Fuller,et al.  Computing Performance: Game Over or Next Level? , 2011, Computer.

[4]  Benoit Forget,et al.  The OpenMC Monte Carlo particle transport code , 2012 .

[5]  Tianyu Liu,et al.  EVALUATION OF SPEEDUP OF MONTE CARLO CALCULATIONS OF TWO SIMPLE REACTOR PHYSICS PROBLEMS CODED FOR THE GPU/CUDA ENVIRONMENT , 2011 .

[6]  Kai Lu,et al.  GPU accelerated Monte Carlo simulation of deep penetration neutron transport , 2012, 2012 2nd IEEE International Conference on Parallel, Distributed and Grid Computing.

[7]  Karl S. Hemmert Green HPC: From Nice to Necessity , 2010, Comput. Sci. Eng..

[8]  G.E. Moore,et al.  Cramming More Components Onto Integrated Circuits , 1998, Proceedings of the IEEE.

[9]  John Shalf,et al.  Exascale Computing Technology Challenges , 2010, VECPAR.

[10]  C. Kelley,et al.  Hybrid Deterministic/Monte Carlo Neutronics Using GPU Accelerators , 2012, 2012 11th International Symposium on Distributed Computing and Applications to Business, Engineering & Science.

[11]  Forrest B. Brown,et al.  Monte Carlo methods for radiation transport analysis on vector computers , 1984 .