Towards Scalable Parallelism in Monte Carlo Particle Transport Codes Using Remote Memory Access
暂无分享,去创建一个
One forthcoming challenge in the area of high-performance computing is having the ability to run large-scale problems while coping with less memory per compute node. In this work, we investigate a novel data decomposition method that would allow Monte Carlo transport calculations to be performed on systems with limited memory per compute node. In this method, each compute node remotely retrieves a small set of geometry and cross-section data as needed and remotely accumulates local tallies when crossing the boundary of the local spatial domain. Initial results demonstrate that while the method does allow large problems to be run in a memory-limited environment, achieving scalability may be difficult due to inefficiencies in the current implementation of RMA operations.
[1] Richard L. Graham,et al. Open MPI: A Flexible High Performance MPI , 2005, PPAM.