The degree to which high-speed vector processors approach their peak performance levels is closely tied to the amount of interference they encounter while accessing vectors in memory. In this paper we present an evaluation of a storage scheme that reduces the average memory access time in a vector-oriented architecture. A skewing scheme is used to map vector components into parallel memory modules such that, for most vector access patterns, the number of memory conflicts is reduced over that observed in interleaved parallel memory systems. Address and data buffers are used locally in each module so that transient nonuniformities which occur in some access patterns do not degrade performance. Previous investigations into skewing techniques have attempted to provide conflict-free access for a limited subset of access patterns. The goal of this investigation is different. The skewing scheme evaluated here does not eliminate all memory conflicts but it does improve the average performance of vector access over interleaved systems for a wide range of strides. It is shown that little extra hardware is required to implement the skewing scheme. Also, far fewer restrictions are placed on the number of memory modules in the system than are present in other proposed schemes.
[1]
Henry D. Shapiro,et al.
Theoretical Limitations on the Efficient Use of Parallel Memories
,
1978,
IEEE Transactions on Computers.
[2]
David Tennyson Harper.
Performance analysis of data skewing in parallel memories
,
1985
.
[3]
Duncan H. Lawrie,et al.
Access and Alignment of Data in an Array Processor
,
1975,
IEEE Transactions on Computers.
[4]
Jan van Leeuwen,et al.
The Structure of Periodic Storage Schemes for Parallel Memories
,
1985,
IEEE Transactions on Computers.
[5]
D. T. Harper,et al.
Performance evaluation of vector accesses in parallel memories using a skewed storage scheme
,
1986,
ISCA 1986.
[6]
I. Niven,et al.
An introduction to the theory of numbers
,
1961
.
[7]
Paul Budnik,et al.
The Organization and Use of Parallel Memories
,
1971,
IEEE Transactions on Computers.
[8]
Duncan H. Lawrie,et al.
The Prime Memory System for Array Access
,
1982,
IEEE Transactions on Computers.