In array radar signal processing applications, the processing demands range from tens of GFLOPS to several TFLOPS. To address this, as well as the, size and power dissipation issues, a special purpose “array signal processing” architecture is proposed. We argue that a combined MIMD-SIMD system can give flexibility, scalability, and programmability as well as high computing density. The MIMD system level, where SIMD modules are interconnected by a fiber-optic real-time network, provides the high level flexibility while the SIMD module level provides the compute density. In this paper we evaluate different design alternatives and show how the VEGA architecture was derived. By examining the applications and the algorithms used, the SIMD mesh processor is found be sufficient. However, the smaller the meshes are the better is the flexibility and efficiency. Then, based on prototype VLSI implementations and on instruction statistics, we find that a relatively large pipelined processing element maximises the performance per area. It is thereby concluded that the small SIMD mesh processor array with powerful processing elements is the best choice. These observations are further exploited in the design of the single-chip SIMD processor array to be included in the MIMD-style overall system. The system scales from 6.4 GFLOPS to several TFLOPS peak performance.
[1]
Bertil Svensson,et al.
Fiber-ribbon pipeline ring network for high-performance distributed computing systems
,
1997,
Proceedings of the 1997 International Symposium on Parallel Architectures, Algorithms and Networks (I-SPAN'97).
[2]
Bertil Svensson,et al.
A multiple SIMD approach to radar signal processing
,
1996,
Proceedings of Digital Processing Applications (TENCON '96).
[3]
C. Rader,et al.
VLSI systolic arrays for adaptive nulling
,
1996
.
[4]
J. G. McWhirter,et al.
Algorithmic engineering in adaptive signal processing
,
1992
.
[5]
Lars Bengtsson.
REMAP-γ: A Scalable SIMD VLSI Architecture with Hierarchical Control
,
1997
.
[6]
Bertil Svensson,et al.
A multiple SIMD mesh architecture for multi-channel radar processing
,
1996
.
[7]
Michael J. Flynn,et al.
Suggestions for implementing a fast IEEE multiply-add-fused instruction
,
1991
.
[8]
Michael J. Flynn,et al.
An Analysis of Division Algorithms and Implementations
,
1995
.
[9]
Bertil Svensson,et al.
Time-deterministic WDM star network for massively parallel computing in radar systems
,
1996,
Proceedings of Massively Parallel Processing Using Optical Interconnections.