A statement based parallelizing framework for processor-in-memory architectures

It is widely known that current memory architecture is one of the bottlenecks for high-performance computers due to the increasing gap between the processor speed and memory latency. For this reason, several architectures, called intelligent memory (IRAM) or processor-in-memory (PIM), have been studied in recent years aiming to integrate the processor and memory together. A merit of PIM architecture is that the PIM chips can be used to replace the main memory chips in a workstation and act as coprocessors when main processor spawns them. This approach has been adopted by Active Page, DIVA, and FlexRAM, among others. This class of architectures provides a hierarchical hybrid multiprocessor environment: host (main) processors and memory processors. Host processor is more powerful with a deep cache hierarchies and higher latency to access memory. By contrast, memory processors are usually less powerful but with a lower

[1]  Prithviraj Banerjee,et al.  Advanced compilation techniques in the PARADIGM compiler for distributed-memory multicomputers , 1995, ICS '95.

[2]  William H. Press,et al.  Numerical Recipes in Fortran 77 , 1992 .

[3]  William H. Press,et al.  Numerical Recipes: FORTRAN , 1988 .

[4]  Tsung-Chuan Huang,et al.  A new analyzing approach for intelligent memory systems , 2001, Computers and Their Applications.

[5]  Tsung-Chuan Huang,et al.  Improving workload balance and code optimization in processor-in-memory systems , 2001, Proceedings. Eighth International Conference on Parallel and Distributed Systems. ICPADS 2001.

[6]  Seung-Moon Yoo,et al.  FlexRAM: toward an advanced intelligent memory system , 1999, Proceedings 1999 IEEE International Conference on Computer Design: VLSI in Computers and Processors (Cat. No.99CB37040).

[7]  Hiroshi Nakashima,et al.  Optimized code generation for heterogeneous computing environment using parallelizing compiler TINPAR , 1998, Proceedings. 1998 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.98EX192).

[8]  David J. Kuck,et al.  A Survey of Parallel Machine Organization and Programming , 1977, CSUR.