Modified SIMD architecture suitable for single-chip implementation
暂无分享,去创建一个
We describe a modified SIMD architecture suitable for single-chip integration of a large number of processing elements, such as 1,000 or more. Important differences from traditional SIMD designs are: a) The size of the memory per processing elements is kept small. b) The processors are organized into groups, each with a small buffer memory. Reduction operation over the groups is done in hardware. The first change allows us to integrate a very large number of processing elements into a single chip. The second change allows us to achieve a close-to-peak performance for many scientific applications like particle-based simulations and dense-matrix operations.
[1] Gianni Conte,et al. Massively Parallel Processor , 1994 .
[2] W. Daniel Hillis,et al. The connection machine , 1985 .
[3] R. Michael Hord,et al. The Illiac IV, the first supercomputer , 1982 .