On cache memory hierarchy for Chip-Multiprocessor

Advances in integrated circuit technology have led the microprocessor design to more challenging areas by providing a high budget of transistors. The one billion microprocessor is not that far from now. This leads microarchitects to look for efficient ways to make use of this feature. Chip-Multiprocessor (CMP) is one of the strong candidates in this area. Memory system for any multiprocessor is a pivotal part and can boost the performance or decrease it dramatically. The purpose of this study is to investigate the most appropriate cache memory hierarchy for CMP. We provide comparisons between several candidates and present a conclusion that can lead to better performance for CMP as well as the configurations that must be avoided.

[1]  Kunle Olukotun,et al.  The case for a single-chip multiprocessor , 1996, ASPLOS VII.

[2]  Michel Dubois,et al.  Cache and Interconnect Architectures in Multiprocessors , 1990, Springer US.

[3]  N. Ranganathan,et al.  Utilization of Cache Area in On-Chip Multiprocessor , 1999, ISHPC.

[4]  N. Jouppi,et al.  Tradeoffs in two-level on-chip caching , 1994, Proceedings of 21 International Symposium on Computer Architecture.

[5]  J. K. Archibald The cache coherence problem in shared-memory multiprocessors , 1987 .

[6]  Edward S. Davidson,et al.  Flexible Timing Simulation of Multiple-Cache Configurations , 2000 .

[7]  Mark Horowitz,et al.  An analytical cache model , 1989, TOCS.

[8]  Dr. Jurij Šilc,et al.  Processor Architecture , 1999, Springer Berlin Heidelberg.

[9]  Luigi M. Ricciardi,et al.  A Trace-Driven Simulator for Performance Evaluation of Cache-Based Multiprocessor Systems , 1995, IEEE Trans. Parallel Distributed Syst..

[10]  Gurindar S. Sohi,et al.  Multiscalar processors , 1995, Proceedings 22nd Annual International Symposium on Computer Architecture.

[11]  Randy H. Katz,et al.  Evaluating The Performance Of Four Snooping Cache Coherency Protocols , 1989, The 16th Annual International Symposium on Computer Architecture.

[12]  Anoop Gupta,et al.  SPLASH: Stanford parallel applications for shared-memory , 1992, CARN.

[13]  R. H. Katz,et al.  Evaluating the performance of four snooping cache coherency protocols , 1989, ISCA '89.

[14]  Neal Margulis,et al.  I860 Microprocessor Architecture , 1990 .

[15]  Kunle Olukotun,et al.  Designing High Bandwidth On-Chip Caches , 1997, ISCA.

[16]  Anoop Gupta,et al.  The SPLASH-2 programs: characterization and methodological considerations , 1995, ISCA.

[17]  James K. Archibald,et al.  Cache coherence protocols: evaluation using a multiprocessor simulation model , 1986, TOCS.