A systolic VSLI chip for implementing orthogonal transforms

The design of a systolic VLSI chip is described for the implementing of signal processing algorithms that may be decomposed into a product of simple real rotations. These include orthogonal transformations. Applications of this chip include projections, discrete Fourier and cosine transforms, and geometrical transformations. Large transforms may be computed by 'tilting' together many chips for increased throughput. A CMOS VLSI chip containing 138000 transistors in a 5*3 array of rotators has been designed, fabricated, and tested. The chip has a 32-MHz clock and performs real rotations at a rate of 30 MHz. The systolic nature of the chip makes use of a fully synchronous bit-serial interconnect and a very regular structure at the rotator and bit levels. A distributed arithmetic scheme is used to implement the matrix-vector multiplication of the rotation. >

[1]  Louis L. Scharf,et al.  Cellular architectures for implementing projection operators , 1987, IEEE Trans. Acoust. Speech Signal Process..

[2]  H. T. Kung Why systolic architectures? , 1982, Computer.

[3]  Richard F. Lyon,et al.  Two's Complement Pipeline Multipliers , 1976, IEEE Trans. Commun..

[4]  S. Smith,et al.  Efficient bit-serial complex multiplication and sum-of-products computation using distributed arithmetic , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  H. T. Kung,et al.  Wafer-scale integration and two-level pipelined implementations of systolic arrays , 1984, J. Parallel Distributed Comput..

[6]  Bede Liu,et al.  A new hardware realization of digital filters , 1974 .