Preliminary Implementation of PETSc Using GPUs
暂无分享,去创建一个
[1] Ignacio E. Grossmann,et al. Computers and Chemical Engineering , 2014 .
[2] Wilfred Pinfold,et al. Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis , 2009, HiPC 2009.
[3] Sébastien Ourselin,et al. High-Speed Nonlinear Finite Element Analysis for Surgical Simulation Using Graphics Processing Units , 2008, IEEE Transactions on Medical Imaging.
[4] Sébastien Ourselin,et al. Real-Time Nonlinear Finite Element Analysis for Surgical Simulation Using Graphics Processing Units , 2007, MICCAI.
[5] Murat Efe Guney,et al. On the limits of GPU acceleration , 2010 .
[6] Karol Miller,et al. Real-Time Nonlinear Finite Element Computations on GPU - Application to Neurosurgical Simulation. , 2010, Computer methods in applied mechanics and engineering.
[7] Zhuo Feng,et al. Multigrid on GPU: Tackling Power Grid Analysis on parallel SIMT platforms , 2008, 2008 IEEE/ACM International Conference on Computer-Aided Design.
[8] Eitan Grinspun,et al. Sparse matrix solvers on the GPU: conjugate gradients and multigrid , 2003, SIGGRAPH Courses.
[9] Pheng-Ann Heng,et al. A hybrid condensed finite element model with GPU acceleration for interactive 3D soft tissue cutting , 2004, Comput. Animat. Virtual Worlds.
[10] Tamara G. Kolda,et al. An overview of the Trilinos project , 2005, TOMS.
[11] Matthew G. Knepley,et al. Biomolecular electrostatics using a fast multipole BEM on up to 512 gpus and a billion unknowns , 2010, Comput. Phys. Commun..
[12] Timothy C. Warburton,et al. Nodal discontinuous Galerkin methods on graphics processors , 2009, J. Comput. Phys..
[13] Satoshi Matsuoka,et al. Fast Conjugate Gradients with Multiple GPUs , 2009, ICCS.
[14] Rajesh Bordawekar,et al. Optimizing Sparse Matrix-Vector Multiplication on GPUs , 2009 .
[15] Michael Garland,et al. Efficient Sparse Matrix-Vector Multiplication on CUDA , 2008 .
[16] Alan B. Williams,et al. A Light-weight API for Portable Multicore Programming , 2010, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing.
[17] William Gropp,et al. Efficient Management of Parallelism in Object-Oriented Numerical Software Libraries , 1997, SciTools.
[18] Roland Keunings,et al. Parallel finite element algorithms applied to computational rheology , 1995 .
[19] Rong Liu,et al. A finite element model study on wear resistance of pseudoelastic TiNi alloy , 2000 .
[20] Rajesh Bordawekar,et al. Optimizing Sparse Matrix-Vector Multiplication on GPUs using Compile-time and Run-time Strategies , 2008 .
[21] Michael Garland,et al. Implementing sparse matrix-vector multiplication on throughput-oriented processors , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.
[22] Guillaume Caumon,et al. Concurrent Number Cruncher: An Efficient Sparse Linear Solver on the GPU , 2007, HPCC.
[23] Robert B. Haber,et al. A space-time discontinuous Galerkin method for linearized elastodynamics with element-wise momentum balance , 2006 .
[24] D. Komatitsch,et al. The spectral element method: An efficient tool to simulate the seismic response of 2D and 3D geological structures , 1998, Bulletin of the Seismological Society of America.