论文信息 - Modified SIMD architecture suitable for single-chip implementation

Modified SIMD architecture suitable for single-chip implementation

We describe a modified SIMD architecture suitable for single-chip integration of a large number of processing elements, such as 1,000 or more. Important differences from traditional SIMD designs are: a) The size of the memory per processing elements is kept small. b) The processors are organized into groups, each with a small buffer memory. Reduction operation over the groups is done in hardware. The first change allows us to integrate a very large number of processing elements into a single chip. The second change allows us to achieve a close-to-peak performance for many scientific applications like particle-based simulations and dense-matrix operations.

Junichiro Makino

[1] Gianni Conte,et al. Massively Parallel Processor , 1994 .

[2] W. Daniel Hillis,et al. The connection machine , 1985 .

[3] R. Michael Hord,et al. The Illiac IV, the first supercomputer , 1982 .