Abstract Latency measures the delay caused by communication between processors and memory modules over the network in a parallel system. Using intensive measurements and simulation, we show that network latency forms a major obstacle to improving parallel computing performance and scalability. We present an experimental metric, using network latency to measure and evaluate the scalability of parallel programs and architectures. This latency metric is an extension to the isoefficiency function [Grama et al., IEEE Parallel Distrib. Technology 1, 3 (1993), 12-21] and iso-speed metric [Sun and Rover, IEEE Trans. Parallel Distrib. Systems 5, 6 (1994), 599-613]. We give a measurement method for using this latency metric, and report the experimental results of evaluating the scalabilities of several scientific computing algorithms on the KSR-1 shared-memory architecture. Our analysis and experiments show that the latency metric is a practical method to effectively predict and evaluate scalability based on measured latencies inherent in the program and the architecture.
[1]
E T. Leighton,et al.
Introduction to parallel algorithms and architectures
,
1991
.
[2]
Xian-He Sun,et al.
Scalability of Parallel Algorithm-Machine Combinations
,
1994,
IEEE Trans. Parallel Distributed Syst..
[3]
Anant Agarwal,et al.
Directory-based cache coherence in large-scale multiprocessors
,
1990,
Computer.
[4]
Anoop Gupta,et al.
SPLASH: Stanford parallel applications for shared-memory
,
1992,
CARN.
[5]
Keqiang He,et al.
Execution behavior analysis and performance improvement in shared-memory architectures
,
1993,
Proceedings of 1993 5th IEEE Symposium on Parallel and Distributed Processing.
[6]
Anoop Gupta,et al.
Cache Invalidation Patterns in Shared-Memory Multiprocessors
,
1992,
IEEE Trans. Computers.