论文信息 - MisSPECulation: partial and misleading use of SPEC CPU2000 in computer architecture conferences

MisSPECulation: partial and misleading use of SPEC CPU2000 in computer architecture conferences

A majority of the papers published in leading computer architecture conferences use SPEC CPU2000, or its predecessor SPEC CPU95, which has become the de facto standard for measuring processor and/or memory-hierarchy performance. However, in most cases a subset of the suite's benchmarks are simulated. For example: 27 papers were published in ISCA 2002, 16 used SPEC CINT2000, 4 used the whole suite, and only 3 papers explained their omissions.This paper quantifies the extent of this phenomenon in the ISCA, Micro, and HPCA conferences: 173 papers were surveyed, 115 used benchmarks from SPEC CINT, but only 23 used the whole suite. If this current trend continues, by the year 2005 80% of the papers will use the full CINT2000 suite, a year after CPU2004 shall be announced.We claim that results based upon a subset of a benchmark suite are speculative and conflict with Amdahl's Law. The law implies that we must present the speedup of using the proposed technique on the whole suite. Projecting the law (by statistically supplying values for the missing benchmarks) to several published papers reduces promising results to average ones. Speedups are reduced from 1.42 to 1.16 in one case, from 1.43 to 1.13 in another, and from 1.76 to 1.15 in a third.Finally, we have found that the disregard for CFP2000 is unwarranted in papers that explore the data cache domain, the suite displays a higher data cache miss rate than CINT2000, which is used more frequently.

Daniel Citron

[1] Mark D. Hill,et al. Cache performance for selected SPEC CPU2000 benchmarks , 2001, CARN.

[2] A. J. KleinOsowski,et al. MinneSPEC: A New SPEC Benchmark Workload for Simulation-Based Computer Architecture Research , 2002, IEEE Computer Architecture Letters.

[3] James E. Smith,et al. Characterizing computer performance with a single number , 1988, CACM.

[4] André Seznec,et al. Choosing representative slices of program execution for microarchitecture simulations: a preliminary , 2000 .

[5] Kevin Skadron,et al. Power issues related to branch prediction , 2002, Proceedings Eighth International Symposium on High Performance Computer Architecture.

[6] David A. Patterson,et al. Computer Architecture - A Quantitative Approach, 5th Edition , 1996 .

[7] David A. Patterson,et al. Computer Architecture: A Quantitative Approach , 1969 .

[8] Larry Rudolph,et al. Accelerating multi-media processing by implementing memoing in multiplication and division units , 1998, ASPLOS VIII.

[9] James E. Smith,et al. Modeling superscalar processors via statistical simulation , 2001, Proceedings 2001 International Conference on Parallel Architectures and Compilation Techniques.

[10] Mayan Moudgill,et al. Environment for PowerPC microarchitecture exploration , 1999, IEEE Micro.

[11] Niv Ahituv,et al. SPEC as a Performance Evaluation Measure , 1995, Computer.