论文信息 - Memory Performance and Scalability of Intel's and AMD's Dual-Core Processors: A Case Study

Memory Performance and Scalability of Intel's and AMD's Dual-Core Processors: A Case Study

As chip multiprocessor (CMP) has become the mainstream in processor architectures, Intel and AMD have introduced their dual-core processors to the PC market. In this paper, performance studies on an Intel Core 2 Duo, an Intel Pentium D and an AMD Athlon 64times2 processor are reported. According to the design specifications, key derivations exist in the critical memory hierarchy architecture among these dual-core processors. In addition to the overall execution time and throughput measurement using both multiprogrammed and multi-threaded workloads, this paper provides detailed analysis on the memory hierarchy performance and on the performance scalability between single and dual cores. Our results indicate that for the best performance and scalability, it is important to have (1) fast cache-to-cache communication, (2) large L2 or shared capacity, (3) fast L2 to core latency, and (4) fair cache resource sharing. Three dual-core processors that we studied have shown benefits of some of these factors, but not all of them. Core 2 Duo has the best performance for most of the workloads because of its microarchitecture features such as shared L2 cache. Pentium D shows the worst performance in many aspects due to its technology-remap of Pentium 4.

Lu Peng | Yen-Kuang Chen | Jih-Kwon Peir | David M. Koppelman | Tribuvan K. Prakash

[1] Ian Pratt,et al. Multiprogramming Performance of the Pentium 4 with Hyper-Threading , 2004 .

[2] Dean M. Tullsen,et al. Initial observations of the simultaneous multithreading Pentium 4 processor , 2003, 2003 12th International Conference on Parallel Architectures and Compilation Techniques.

[3] Carl Staelin. lmbench: an extensible micro‐benchmark suite , 2005, Softw. Pract. Exp..

[4] David A. Bader,et al. BioPerf: a benchmark suite to evaluate high-performance computer architecture on bioinformatics applications , 2005, IEEE International. 2005 Proceedings of the IEEE Workload Characterization Symposium, 2005..

[5] Thorsten von Eicken,et al. 技術解説 IEEE Computer , 1999 .

[6] Avi Mendelson,et al. CMP Implementation in Systems Based on the Intel Core Duo Processor , 2006 .

[7] Kunle Olukotun,et al. A Single-Chip Multiprocessor , 1997, Computer.

[8] Balaram Sinharoy,et al. IBM Power5 chip: a dual-core multithreaded processor , 2004, IEEE Micro.

[9] Anoop Gupta,et al. The SPLASH-2 programs: characterization and methodological considerations , 1995, ISCA.

[10] Balaram Sinharoy,et al. POWER4 system microarchitecture , 2002, IBM J. Res. Dev..