Wavefront Array Processors-Concept to Implementation

Most signal and image processing algorithms can be decomposed into computational wavefronts that can be processed on pipelined arrays. T he supervisory overhead incurred in general-purpose supercom-T puters often makes them too slow and expensive for real-time signal and image processing. To achieve a through-put rate adequate for these applications, the only feasible alternative appears to be massively concurrent processing by special-purpose hardware-by array processors. Progress in VLSI technology has lowered implementation costs for large array processors to an acceptable level, and CAD techniques have facilitated speedy prototyping and implementation of application-oriented (or algorithm-oriented) array processors. Digital signal and image processing encompasses a variety of mathematical and algorithmic techniques. Most signal and image processing algorithms are dominated by transform techniques, con-volution and correlation filtering, and certain key linear algebraic methods. These algorithms possess properties such as regularity, recursiveness, and locality, and these properties can be exploited in array processor design. With VLSI it becomes feasible to construct an array processor that closely resembles the flow graph of a particular algorithm. This type of array maximizes the main strength of VLSI-intensive computing power-and yet circumvents its main weakness-restricted communication.

[1]  Charles L. Seitz,et al.  The cosmic cube , 1985, CACM.

[2]  Harold S. Stone,et al.  Parallel Processing with the Perfect Shuffle , 1971, IEEE Transactions on Computers.

[3]  H. T. Kung,et al.  Synchronizing LargeVLSI ProcessorArrays , 1985 .

[4]  H. T. Kung,et al.  Synchronizing Large VLSI Processor Arrays , 1985, IEEE Trans. Computers.

[5]  D. V. Bhaskar Rao,et al.  Wavefront Array Processor: Language, Architecture, and Applications , 1982, IEEE Transactions on Computers.

[6]  Rami Melhem Irregular Wavefronts in Data-Driven Data-Dependent Computations. , 1986 .

[7]  Jacques Lenfant,et al.  Parallel Permutations of Data: A Benes Network Control Algorithm for Frequently Used Permutations , 1978, IEEE Transactions on Computers.

[8]  Jack B. Dennis,et al.  Data Flow Supercomputers , 1980, Computer.

[9]  Lawrence Snyder,et al.  Introduction to the configurable, highly parallel computer , 1982, Computer.

[10]  J. G. McWhirter,et al.  A practical comparison of the systolic and wavefront array processing architectures , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  Tse-yun Feng,et al.  A Survey of Interconnection Networks , 1981, Computer.

[12]  Sun-Yuan Kung,et al.  Performance Analysis and Optimization of VLSI Dataflow Arrays , 1987, J. Parallel Distributed Comput..

[13]  Kai Hwang,et al.  Computer architecture and parallel processing , 1984, McGraw-Hill Series in computer organization and architecture.

[14]  P. Wilson OCCAM architecture eases system design. I , 1983 .