论文信息 - Model-Based Memory Hierarchy Optimizations for Sparse Matrices

Model-Based Memory Hierarchy Optimizations for Sparse Matrices

Sparse matrix-vector multiplication is an important computational kernel used in numerical algorithms. It tends to run much more slowly than its dense counterpart, and its performance depends heavily on both the nonzero structure of the sparse matrix and on the machine architecture. In this paper we address the problem of optimizing sparse matrix-vector multiplication for the memory hierarchies that exist on modern machines and how machine-speciic or matrix-speciic prooling information can be used to decide which optimizations should be applied and what parameters should be used. We also consider a variation of the problem in which a matrix is multiplied by a set of vectors. Performance is measured on a 167 MHz Ultra-sparc I, 200 MHz Pentium Pro, and 450 MHz DEC Alpha 21164. Experiments show these optimization techniques to have signiicant payoo, although the eeectiveness of each depends on the matrix structure and machine.

Eun-Jin Im | E. Im

[1] James Demmel,et al. Optimizing matrix multiply using PHiPAC: a portable, high-performance, ANSI C coding methodology , 1997, ICS '97.

[2] Joel H. Saltz,et al. Applying the CHAOS/PARTI library to irregular problems in computational chemistry and computational aerodynamics , 1993, Proceedings of Scalable Parallel Libraries Conference.

[3] Steven Mark Carr,et al. Memory-hierarchy management , 1993 .

[4] D LamMonica,et al. The cache performance and optimizations of blocked algorithms , 1991 .

[5] K. Pingali,et al. Compiling Parallel Code for Sparse Matrix Applications , 1997, ACM/IEEE SC 1997 Conference (SC'97).

[6] Kathryn S. McKinley,et al. Tile size selection using cache organization and data layout , 1995, PLDI '95.

[7] Michael E. Wolf,et al. Improving locality and parallelism in nested loops , 1992 .

[8] John N. Shadid,et al. Aztec user`s guide. Version 1 , 1995 .

[9] Gene H. Golub,et al. The block Lanczos method for computing eigenvalues , 2007, Milestones in Matrix Computation.

[10] Aart J. C. Bik,et al. Compiler support for sparse matrix computations , 1996 .

[11] R. C. Whaley,et al. Automatically Tuned Linear Algebra Software (ATLAS) , 2011, Encyclopedia of Parallel Computing.

[12] Jack J. Dongarra,et al. An extended set of FORTRAN basic linear algebra subprograms , 1988, TOMS.