论文信息 - The accuracy of trace-driven simulations of multiprocessors

The accuracy of trace-driven simulations of multiprocessors

In trace-driven simulation, traces generated for one set of system characteristics are used to simulate a system with different characteristics. However, the execution path of a multiprocessor workload may depend on the order of events occurring on different processing elements. The event order, in turn, depends on system charcteristics such as memory-system latencies and buffer-sizes. Trace-driven simulations of multiprocessor workloads are inaccurate unless the dependencies are eliminated from the traces.We have measured the effects of these inaccuracies by comparing trace-driven simulations to direct simulations of the same workloads. The simulators predicted identical performance only for workloads whose traces were timing-independent. Workloads that used first-come first-served scheduling and/or non-deterministic algorithms produced timing-dependent traces, and simulation of these traces produced inaccurate performance predictions. Two types of performance metrics were particularly affected: those related to synchronization latency and those derived from relatively small numbers of events. To accurately predict such performance metrics, timing-independent traces or direct simulation should be used.

John L. Hennessy | Stephen R. Goldschmidt | J. Hennessy | S. Goldschmidt

[1] Helen Davis,et al. Tango introduction and tutorial , 1990 .

[2] Carla Schlatter Ellis,et al. Accuracy of Memory Reference Traces of Parallel Computations in Trace-Driven Simulation , 1992, IEEE Trans. Parallel Distributed Syst..

[3] Dirk Grunwald,et al. Execution Architecture Independent Program Tracing ; CU-CS-525-91 , 1991 .

[4] W. Kent Fuchs,et al. TRAPEDS: producing traces for multicomputers via execution driven simulation , 1989, SIGMETRICS '89.

[5] Susan J. Eggers,et al. On the validity of trace-driven simulation for multiprocessors , 1991, ISCA '91.

[6] John L. Hennessy,et al. Multiprocessor Simulation and Tracing Using Tango , 1991, ICPP.

[7] Mark Horowitz,et al. ATUM: a new technique for capturing address traces using microcode , 1986, ISCA '86.

[8] W. Kent Fuchs,et al. Address tracing for parallel machines , 1991, Computer.

[9] Helen Davis,et al. Tango: A Multiprocessor Simulation and Tracing System , 1990 .

[10] Anoop Gupta,et al. SPLASH: Stanford parallel applications for shared-memory , 1992, CARN.

[11] Philip Bitar,et al. A Critique of Trace-Driven Simulation for Shared-Memory Multiprocessors , 1990 .

[12] James H. Patterson,et al. Portable Programs for Parallel Processors , 1987 .