The paper proposes a memory architecture called Wafer-scale Interconnected Memory Array (WIMA), which is intended to replace ultra-high-density monolithic DRAM ICs. This architecture employs the high-performance interconnects provided by multichip module technology, the cache-embedding concept, and prime degree interleaving to expose the DRAM's internal parallelism not exploitable by monolithic DRAMs. Using WIMA modules as the basic building blocks, a high-bandwidth, low latency, and low cost vector memory system is developed that supports parallelism among multiple vector access streams. To mask the long start-up latencies, vector memory accesses are architectured to be split-phased. To alleviate the performance impact of bank conflicts, prime degree memory interleaving is adopted. The major contribution of this work is the development of a novel indexing mechanism for prime degree interleaving, which takes at most two integer divisions for each logical vector memory access.<<ETX>>
[1]
David H. Bailey,et al.
Vector Computer Memory Bank Contention
,
1987,
IEEE Transactions on Computers.
[2]
David T. Harper,et al.
Conflict-Free Vector Access Using a Dynamic Storage Scheme
,
1991,
IEEE Trans. Computers.
[3]
Kimming So,et al.
Cache performance of vector processors
,
1988,
ISCA '88.
[4]
Hideto Niijima,et al.
QRAM-Quick access memory system
,
1990,
Proceedings., 1990 IEEE International Conference on Computer Design: VLSI in Computers and Processors.
[5]
David J. Kuck,et al.
The Burroughs Scientific Processor (BSP)
,
1982,
IEEE Transactions on Computers.