论文信息 - The effects of thread placement on the KSR1

The effects of thread placement on the KSR1

This paper describes a effects of thread placement on memory access times measurement study on the Kendall Square KSR1 multiprocessor. The KSR1 uses a conventional shared memory programming model in a distributed memory architecture based on a ring of rings of 64-bit superscalar microprocessors. Memory consists of local cache memories attached to each processor and is managed in a cache-only memory architecture (COMA) fashion. Experiments run on the KSR1 across a variety of thread configurations show that shared memory access is accelerated through strategic placement of threads which share data. The experiments "stress test" the automatic prefetching feature of the hardware. Strategies to keep the KSR1 memory access times nearly constant even when the number of participating threads increases are proposed.<<ETX>>

Evgenia Smirni | Manish Madhukar | Amy W. Apon | Thomas D. Wagner | Lawrence W. Dowdy

[1] Edward S. Davidson,et al. Evaluating the communication performance of MPPs using synthetic sparse matrix multiplication workloads , 1993, ICS '93.

[2] Evgenia Smirni,et al. The KSR1: experimentation and modeling of poststore , 1993, SIGMETRICS '93.

[3] Evgenia Smirni,et al. Measuring the Effects of Thread Placement on the Kendall Square KSR1 , 1993 .

[4] T. H. Dunigan. Multi-ring performance of the Kendall square multiprocessor , 1994 .

[5] Edward S. Davidson,et al. KSR1 multiprocessor: analysis of latency hiding techniques in a sparse solver , 1993, [1993] Proceedings Seventh International Parallel Processing Symposium.

[6] Thomas H. Dunigan. KENDALL SQUARE MULTIPROCESSOR: EARLY EXPERIENCES AND PERFORMANCE , 1992 .

[7] M. J. Carlton,et al. Micro benchmark analysis of the KSR1 , 1993, Supercomputing '93.