Performance modeling of a hierarchcial N-body algorithm for arbitrary particle distribution (Unrefereed Workshop Manuscript)
暂无分享,去创建一个
[1] Joshua E. Barnes,et al. A modified tree code: don't laugh; it runs , 1990 .
[2] Jeffrey S. Vetter,et al. Aspen: A domain specific language for performance modeling , 2012, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis.
[3] Rio Yokota,et al. Petascale turbulence simulation using a highly parallel fast multipole method on GPUs , 2011, Comput. Phys. Commun..
[4] Leslie Greengard,et al. A fast algorithm for particle simulations , 1987 .
[5] Lorena A. Barba,et al. A tuned and scalable fast multipole method as a preeminent algorithm for exascale systems , 2011, Int. J. High Perform. Comput. Appl..
[6] Kenjiro Taura,et al. Design and implementation of a customizable work stealing scheduler , 2013, ROSS '13.
[7] Rio Yokota,et al. An FMM Based on Dual Tree Traversal for Many-Core Architectures , 2012, ArXiv.
[8] Richard W. Vuduc,et al. A CPU: GPU Hybrid Implementation and Model-Driven Scheduling of the Fast Multipole Method , 2014, GPGPU@ASPLOS.