Cache Organization for Embeded Processors: CAM-vs-SRAM

Caches are becoming an increasingly important part of embedded processor design because of the impact they have on performance as well as implementation, specifically, area, power and timing. Different cache organizations make tradeoffs between these metrics. One of the main architectural choices is whether to use standard SRAM-based tag design or to go with a CAM- based organization. This choice has far reaching consequences on all other aspects of the cache design. We will compare these two cache styles using results from a recently completed DSP core design. Our conclusion is that, contrary to popular belief, an SRAM-tag based design provided a more optimal overall design point and is superior in energy respect. Some of driving factors such as the increasing dominance of wire and leakage power will be extrapolated forward to next generation processes.

[1]  L.T. Clark,et al.  A low-power 2.5-GHz 90-nm level 1 cache and memory management unit , 2005, IEEE Journal of Solid-State Circuits.

[2]  Michael Zhang,et al.  Highly-Associative Caches for Low-Power Processors , 2000 .

[3]  Lawrence T. Clark,et al.  An embedded 32-b microprocessor core for low-power and high-performance applications , 2001 .

[4]  K. Pagiamtzis,et al.  Content-addressable memory (CAM) circuits and architectures: a tutorial and survey , 2006, IEEE Journal of Solid-State Circuits.

[5]  David Harris,et al.  CMOS VLSI Design: A Circuits and Systems Perspective , 2004 .

[6]  Iris Bahar,et al.  Power and Performance Tradeoffs using Various Cache Configurations , 2007 .

[7]  Alexander V. Veidenbaum,et al.  Low energy, highly-associative cache design for embedded processors , 2004, IEEE International Conference on Computer Design: VLSI in Computers and Processors, 2004. ICCD 2004. Proceedings..

[8]  Richard T. Witek,et al.  A 160 MHz 32 b 0.5 W CMOS RISC microprocessor , 1996, 1996 IEEE International Solid-State Circuits Conference. Digest of TEchnical Papers, ISSCC.