Matrix Inversion on a PIM (Processor-in-Memory)

Many computational applications in science and engineering involve solving matrix inversions with large data sets. To reduce the computation time, parallel architectures are used with appropriate data partitioning and placement schemes. In this paper, matrices are inverted on a PIM (processor-in-memory) architecture by applying a blockwise analytical matrix inversion scheme in a recursive manner, and whose performance is evaluated and analyzed.

[1]  M. Bartholomew-Biggs A Matrix Modification Method for Calculating Approximate Solutions to Systems of Linear Equations , 1979 .

[2]  Sandeep K. Gupta,et al.  Data Partitioning and Placement Schemes for Matrix Multiplications on a PIM Architecture , 2008, 2008 International Symposium on Parallel and Distributed Computing.

[3]  Jang Sun Lee,et al.  I/O and memory-efficient matrix multiplication with user-controllable parallel I/O , 1997, Proceedings 1997 International Conference on Parallel and Distributed Systems.

[4]  Katherine Yelick,et al.  A Case for Intelligent RAM: IRAM , 1997 .

[5]  Julien Langou,et al.  The Impact of Multicore on Math Software , 2006, PARA.

[6]  Peter M. Kogge,et al.  PIM architectures to support petaflops level computation in the HTMT machine , 1999, Innovative Architecture for Future Generation High-Performance Processors and Systems (Cat. No.PR00650).

[7]  Frederic T. Chong,et al.  Active pages: a computation model for intelligent memory , 1998, ISCA.

[8]  Ralf Östermark A recursive partitioning algorithm for matrix inversion on parallel computers , 1998 .

[9]  Todd M. Austin,et al.  The SimpleScalar tool set, version 2.0 , 1997, CARN.

[10]  Marshall C. Pease Matrix Inversion Using Parallel Processing , 1967, JACM.

[11]  Adi Ben-Israel,et al.  Generalized inverses: theory and applications , 1974 .

[12]  Katherine Yelick,et al.  A Case for Intelligent DRAM: IRAM , 1998 .

[13]  E. Caron,et al.  Parallel out-of-core matrix inversion , 2002, Proceedings 16th International Parallel and Distributed Processing Symposium.

[14]  L. Csanky,et al.  Fast parallel matrix inversion algorithms , 1975, 16th Annual Symposium on Foundations of Computer Science (sfcs 1975).

[15]  F. R. Gantmakher The Theory of Matrices , 1984 .

[16]  L. Mirsky,et al.  The Theory of Matrices , 1961, The Mathematical Gazette.