Designing SLATE: Software for Linear Algebra Targeting Exascale

[1]  Jack J. Dongarra,et al.  Accelerating Numerical Dense Linear Algebra Calculations with GPUs , 2014, Numerical Computations with GPUs.

[2]  Jack Dongarra,et al.  Roadmap for the Development of a Linear Algebra Library for Exascale Computing: SLATE: Software for Linear Algebra Targeting Exascale , 2017 .

[3]  Fred G. Gustavson,et al.  A recursive formulation of Cholesky factorization of a matrix in packed storage , 2001, TOMS.

[4]  Thomas Hérault,et al.  Flexible Development of Dense Linear Algebra Algorithms on Massively Parallel Architectures with DPLASMA , 2011, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum.

[5]  Jack Dongarra,et al.  C++ API for BLAS and LAPACK , 2017 .

[6]  Lars Karlsson,et al.  Parallel and Cache-Efficient In-Place Matrix Storage Format Conversion , 2012, TOMS.

[7]  Jack J. Dongarra,et al.  Task-Based Cholesky Decomposition on Knights Corner Using OpenMP , 2016, ISC Workshops.

[8]  John A. Gunnels,et al.  A Recursive Formulation of the Inversion of Symmetric Positive Definite Matrices in Packed Storage Data Format , 2002, PARA.

[9]  Thomas Hérault,et al.  PaRSEC: Exploiting Heterogeneity to Enhance Scalability , 2013, Computing in Science & Engineering.

[10]  Jack Dongarra,et al.  Design and Implementation of the PULSAR Programming System for Large Scale Computing , 2017, Supercomput. Front. Innov..

[11]  Asim YarKhan,et al.  Dynamic Task Execution on Shared and Distributed Memory Architectures , 2012 .

[12]  Jack J. Dongarra,et al.  Porting the PLASMA Numerical Library to the OpenMP Standard , 2017, International Journal of Parallel Programming.

[13]  Isak Jonsson,et al.  Recursive Blocked Data Formats and BLAS's for Dense Linear Algebra Algorithms , 1998, PARA.