论文信息 - A smart cache for improved vector performance

A smart cache for improved vector performance

Abstract As the speed of microprocessors increases at a breath-taking rate, the gap between processor and memory system performance is getting worse. To alleviate this problem, all modern processors contain caches, but even using caches, processors cannot achieve their peak performance. We propose a mechanism, smart caching , which extends the power of conventional memory subsystems by including a prefetch unit. This prefetch unit is responsible for efficiently using the available memory bandwidth by fetching memory data before they are actually needed. Prefetching allows high-level application knowledge to increase memory performance, which is currently constraining the performance of most systems. While prefetching does not reduce the latency of memory accesses, it hides this latency by overlapping memory access and instruction execution.

Michael K. Gschwind | Thomas J. Pietsch | M. Gschwind

[1] G. Amdhal,et al. Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).

[2] David A. Patterson,et al. Computer Architecture: A Quantitative Approach , 1969 .

[3] D. Morris,et al. Pathlength reduction features in the PA-RISC architecture , 1992, Digest of Papers COMPCON Spring 1992.

[4] Norman P. Jouppi,et al. Improving direct-mapped cache performance by the addition of a small fully-associative cache and pre , 1990, ISCA 1990.

[5] Wen-mei W. Hwu,et al. Proceedings of the 25th annual international symposium on Microarchitecture , 1992, MICRO.

[6] Gerry Kane,et al. MIPS RISC Architecture , 1987 .

[7] Anne Rogers,et al. Software support for speculative loads , 1992, ASPLOS V.

[8] Janak H. Patel,et al. Stride directed prefetching in scalar processors , 1992, MICRO 1992.

[9] Norman P. Jouppi,et al. Computer technology and architecture: an evolving interaction , 1991, Computer.

[10] Henry M. Levy,et al. An architecture for software-controlled data prefetching , 1991, ISCA '91.

[11] Jean-Loup Baer,et al. Proceedings of the 39th Annual International Symposium on Computer Architecture , 1983, International Symposium on Computer Architecture.