Evaluation of a 'stall' cache: an efficient restricted onchip instruction cache

The paper compares the cost and performance of a new kind of restricted instruction cache architecture-the stall cache-against several other conventional cache architectures. The stall cache minimizes the size of an onchip instruction cache by caching only those instructions whose instruction fetch phase collides with the memory access phase of a preceding load or store instruction. Many existing machines provide a single cycle external cache memory. The results show that, under this assumption, the stall cache always outperforms an equivalent sized on-chip instruction cache, reducing external memory access stalls by approximately 10%. In addition results are presented for a system using an on-chip data cache, and for one with a double width data bus and short instruction prefetch buffer.<<ETX>>

[1]  James R. Goodman,et al.  Instruction Cache Replacement Policies and Organizations , 1985, IEEE Transactions on Computers.

[2]  Janak H. Patel,et al.  Accurate Low-Cost Methods for Performance Evaluation of Cache Memory Systems , 1988, IEEE Trans. Computers.

[3]  Michael J. K. Nielsen DECstation 5000 Model 200 , 1991, COMPCON Spring '91 Digest of Papers.

[4]  Anant Agrawal,et al.  Implementing Sparc in ECL , 1990, IEEE Micro.

[5]  Mark D. Hill,et al.  Aspects of Cache Memory and Instruction , 1987 .

[6]  Veljko M. Milutinovic,et al.  GaAs microprocessors and digital systems: an overview of R&D efforts , 1988, IEEE Micro.

[7]  David A. Patterson,et al.  Reduced instruction set computers , 1985, CACM.

[8]  Edward H. Frank,et al.  Sun's SPARCstation 1: a workstation for the 1990s , 1990, Digest of Papers Compcon Spring '90. Thirty-Fifth IEEE Computer Society International Conference on Intellectual Leverage.

[9]  Mark D. Hill,et al.  A case for direct-mapped caches , 1988, Computer.

[10]  David A. Patterson,et al.  The case for the reduced instruction set computer , 1980, CARN.

[11]  George Radin,et al.  The 801 minicomputer , 1982, ASPLOS I.

[12]  Alan Jay Smith,et al.  Branch Prediction Strategies and Branch Target Buffer Design , 1995, Computer.

[13]  JOHN L. HENNESSY,et al.  VLSI Processor Architecture , 1984, IEEE Transactions on Computers.

[14]  J. Yetter,et al.  CMOS PA-RISC processor for a new family of workstations , 1991, COMPCON Spring '91 Digest of Papers.

[15]  David A. Patterson,et al.  Computer Architecture: A Quantitative Approach , 1969 .

[16]  Douglas V. Hall Microprocessors and Digital Systems , 1980 .

[17]  James R. Goodman,et al.  A study of instruction cache organizations and replacement policies , 1983, ISCA '83.

[18]  Anant Agarwal,et al.  Blocking: exploiting spatial locality for trace compaction , 1990, SIGMETRICS '90.