Fast development of dense linear algebra codes on graphics processors
暂无分享,去创建一个
[1] Helmar Burkhart,et al. Algorithmic performance studies on graphics processing units , 2008, J. Parallel Distributed Comput..
[2] Rafael Mayo,et al. Solving Dense Linear Systems on Graphics Processors , 2008, Euro-Par.
[3] Golub Gene H. Et.Al. Matrix Computations, 3rd Edition , 2007 .
[4] Robert A. van de Geijn,et al. Using PLAPACK - parallel linear algebra package , 1997 .
[5] Rafael Mayo,et al. Evaluation and tuning of the Level 3 CUBLAS for graphics processors , 2008, 2008 IEEE International Symposium on Parallel and Distributed Processing.
[6] William Gropp,et al. PETSc 2.0 users manual , 2000 .
[7] Robert A. van de Geijn,et al. The science of deriving dense linear algebra algorithms , 2005, TOMS.
[8] Robert H. Halstead,et al. Matrix Computations , 2011, Encyclopedia of Parallel Computing.
[9] Robert A. van de Geijn,et al. Representing linear algebra algorithms in code: the FLAME application program interfaces , 2005, TOMS.
[10] Jack Dongarra,et al. MPI: The Complete Reference , 1996 .