论文信息 - Theoretical modeling of superscalar processor performance

Theoretical modeling of superscalar processor performance

The current trace-driven simulation approach to determine superscalar processor performance is widely used but has some shortcomings. Modern benchmarks generate extremely long traces, resulting in problems with data storage, as well as very long simulation run times. More fundamentally, simulation generally does not provide significant insight into the factors that determine performance or a characterization of their interactions. This paper proposes a theoretical model of superscalar processor performance that addresses these shortcomings. Performance is viewed as an interaction of program parallelism and machine parallelism. Both program and machine parallelisms are decomposed into multiple component functions. Methods for measuring or computing these functions are described. The functions are combined to provide a model of the interaction between program and machine parallelisms and an accurate estimate of the performance. The computed performance, based on this model, is compared to simulated performance for six benchmarks from the SPEC 92 suite on several configurations of the IBM RS/6000 instruction set architecture.

John Paul Shen | Derek B. Noonburg

[1] David W. Wall,et al. Limits of instruction-level parallelism , 1991, ASPLOS IV.

[2] book,et al. Computer Architecture , a Quantitative Approach , 1995 .

[3] Monica S. Lam,et al. Limits of Control Flow on Parallelism , 1992, [1992] Proceedings the 19th Annual International Symposium on Computer Architecture.

[4] Norman P. Jouppi,et al. The Nonuniform Distribution of Instruction-Level and Machine Parallelism and Its Effect on Performance , 1989, IEEE Trans. Computers.

[5] Michael J. Flynn,et al. Instruction Window Size Trade-Offs and Characterization of Program Parallelism , 1994, IEEE Trans. Computers.

[6] Trung A. Diep,et al. EXPLORER: a retargetable and visualization-based trace-driven simulator for superscalar processors , 1993, MICRO 1993.

[7] Guang R. Gao,et al. On the limits of program parallelism and its smoothability , 1992, MICRO.

[8] David A. Patterson,et al. Computer Architecture: A Quantitative Approach , 1969 .

[9] Todd M. Austin,et al. Dynamic dependency analysis of ordinary programs , 1992, ISCA '92.

[10] Andrew Wolfe,et al. Two-ported cache alternatives for superscalar processors , 1993, MICRO 1993.

[11] Trung A. Diep,et al. EXPLORER: a retargetable and visualization-based trace-driven simulator for superscalar processors , 1993, MICRO.