A massively parallel adaptive fast-multipole method on heterogeneous architectures
暂无分享,去创建一个
Lexing Ying | George Biros | Tuan Anh Nguyen | Richard Vuduc | Denis Zorin | Aparna Chandramowlishwaran | Harper Langston | Ilya Lashuk | Rahul S. Sampath | Aashay Shringarpure
[1] Lexing Ying,et al. A New Parallel Kernel-Independent Fast Multipole Method , 2003, ACM/IEEE SC 2003 Conference (SC'03).
[2] Klaus Schulten,et al. Adapting a message-driven parallel application to GPU-accelerated clusters , 2008, 2008 SC - International Conference for High Performance Computing, Networking, Storage and Analysis.
[3] D. Zorin,et al. A kernel-independent adaptive fast multipole algorithm in two and three dimensions , 2004 .
[4] Matthew G. Knepley,et al. Biomolecular electrostatics using a fast multipole BEM on up to 512 gpus and a billion unknowns , 2010, Comput. Phys. Commun..
[5] Shang-Hua Teng,et al. Provably Good Partitioning and Load Balancing Algorithms for Parallel Adaptive N-Body Simulation , 1998, SIAM J. Sci. Comput..
[6] Joseph JáJá,et al. An Introduction to Parallel Algorithms , 1992 .
[7] Andrew W. Moore,et al. 'N-Body' Problems in Statistical Learning , 2000, NIPS.
[8] F. Sevilgen,et al. A Unifying Data Structure for Hierarchical Methods , 1999, ACM/IEEE SC 1999 Conference (SC'99).
[9] Ramani Duraiswami,et al. Fast multipole methods on graphics processors , 2008, J. Comput. Phys..
[10] William Gropp,et al. A Parallel Version of the Fast Multipole Method-Invited Talk , 1987, PPSC.
[11] Ananth Grama,et al. Scalable parallel formulations of the Barnes-Hut method for n-body simulations , 1994, Proceedings of Supercomputing '94.
[12] Hari Sundar,et al. Dendro: parallel algorithms for multigrid and AMR methods on 2:1 balanced octrees , 2008, HiPC 2008.
[13] Makoto Taiji,et al. 42 TFlops hierarchical N-body simulations on GPUs with applications in both astrophysics and turbulence , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.
[14] L. Greengard,et al. Regular Article: A Fast Adaptive Multipole Algorithm in Three Dimensions , 1999 .
[15] Jakub Kurzak,et al. Massively parallel implementation of a fast multipole method for distributed memory machines , 2005, J. Parallel Distributed Comput..
[16] Srinivas Aluru,et al. Fast, parallel, GPU-based construction of space filling curves and octrees , 2008, I3D '08.
[17] George Karypis,et al. Introduction to Parallel Computing , 1994 .
[18] M. S. Warren,et al. A parallel hashed Oct-Tree N-body algorithm , 1993, Supercomputing '93.
[19] Hari Sundar,et al. Bottom-Up Construction and 2: 1 Balance Refinement of Linear Octrees in Parallel , 2008, SIAM J. Sci. Comput..
[20] Srinivas Aluru,et al. Efficient parallel algorithms and software for compressed octrees with applications to hierarchical methods , 2005, Parallel Comput..
[21] Rajiv K. Kalia,et al. Scalable and portable implementation of the fast multipole method on parallel computers , 2003 .
[22] Leslie Greengard,et al. A fast algorithm for particle simulations , 1987 .
[23] Richard W. Vuduc,et al. Petascale Direct Numerical Simulation of Blood Flow on 200K Cores and Heterogeneous Architectures , 2010, 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis.