Microelectronic architectures and devices for signal and symbol processing

Modern microelectronics technology holds the promise of high level, computer-aided design of very complex systems on a single silicon chip. The freedom to create non-standard architectures within this context has stimulated widespread interest in the development of computing structures that offer increased processing speed relative to the von Neumann architecture. This paper describes two architectures that are especially well suited for large scale integration because of their concurrent structure and their use of primarily local data flows. The first architecture is designed to implement the QR matrix decomposition and it can be used to reliably solve the least squares and eigenvalue problems of linear algebra. The second architecture is based on an elementary building block approach to the realization of FIR and IIR lattice digital filters. An especially important issue that is sometimes overlooked in the system level design of an architecture is the impact of the particular scheme for implementing the fundamental operations such as multiplication, rotation, etc., on the performance of the highly parallel computing structure. It is argued that if one can imbed pipelined operations within the concurrent computing structure, then often the resulting system will not only provide substantial processing gain but it can be implemented in such a way that efficient use of chip real estate is achieved.

[1]  K. J. Prost,et al.  CMOS/SOS serial-parallel multiplier , 1975 .

[2]  G.L. Baldwin,et al.  A modular, high-speed serial pipeline multiplier for digital signal processing , 1978, IEEE Journal of Solid-State Circuits.

[3]  Sun-Yuan Kung VLSI Array Processor for Signal Processing. , 1982 .

[4]  Noel R. Strader VLSI bit-sequential architectures for digital signal processing , 1983, ICASSP.

[5]  J. S. Walther,et al.  A unified algorithm for elementary functions , 1899, AFIPS '71 (Spring).

[6]  Sun-Yuan Kung,et al.  Highly parallel architectures for solving linear equations , 1981, ICASSP.

[7]  Franklin T. Luk,et al.  A Systolic Architecture for the Singular Value Decomposition , 1982 .

[8]  A. Gray,et al.  A normalized digital filter structure , 1975 .

[9]  I-Ngo Chen,et al.  An 0(n) Parallel Multiplier with Bit-Sequential Input and Output , 1979, IEEE Transactions on Computers.

[10]  A. Gray,et al.  Digital lattice and ladder filter synthesis , 1973 .

[11]  Franklin T. Luk,et al.  Systolic Array Computation Of The Singular Value Decomposition , 1982, Other Conferences.

[12]  L. Johnsson A Computational Array for the QR-Method , 1982 .

[13]  Thompson Fourier Transforms in VLSI , 1983, IEEE Transactions on Computers.

[14]  Noel R. Strader,et al.  Concurrent Very Large Scale Integration (VLSI) Structure For Digital Signal Processing , 1982, Other Conferences.

[15]  Bruce A. Wooley,et al.  A Two's Complement Parallel Array Multiplication Algorithm , 1973, IEEE Transactions on Computers.

[16]  David J. Kuck,et al.  On Stable Parallel Linear System Solvers , 1978, JACM.

[17]  D. W. Tufts,et al.  Parallel Implementation of Likelihood-Based Estimation and Detection, , 1983 .

[18]  J. J. Symanski,et al.  Systolic Array Processor Implementation , 1982, Optics & Photonics.

[19]  Christopher S. Wallace,et al.  A Suggestion for a Fast Multiplier , 1964, IEEE Trans. Electron. Comput..

[20]  V. Thomas Rhyne,et al.  A Canonical Bit-Sequential Multiplier , 1982, IEEE Transactions on Computers.

[21]  M. Morf,et al.  A VLSI speech analysis chip set based on square root normalized ladder forms , 1981, ICASSP.

[22]  Franklin T. Luk,et al.  A Systolic Architecture for Almost Linear-Time Solution of the Symmetric Eigenvalue Problem , 1982 .

[23]  Sun-Yuan Kung,et al.  WAVEFRONT ARRAY PROCESSOR: ARCHITECTURE, LANGUAGE AND APPLICATIONS. , 1982 .

[24]  Stylianos D. Pezaris A 40-ns 17-Bit by 17-Bit Array Multiplier , 1971, IEEE Transactions on Computers.

[25]  L. Ljung,et al.  New inversion formulas for matrices classified in terms of their distance from Toeplitz matrices , 1979 .

[26]  Azriel Rosenfeld Parallel String Parsing Using Lattice Graphs. , 1981 .

[27]  Robert Schreiber,et al.  Systolic Arrays For Eigenvalue Computation , 1982, Other Conferences.

[28]  H. T. Kung Why systolic architectures? , 1982, Computer.

[29]  Gerald Jay Sussman,et al.  Storage Management in a LISP-based Microprocessor , 1979 .

[30]  D. V. Bhaskar Rao,et al.  Wavefront Array Processor: Language, Architecture, and Applications , 1982, IEEE Transactions on Computers.

[31]  H. T. Kung Let's Design Algorithms for VLSI Systems , 1979 .

[32]  Sun-Yuan Kung,et al.  A highly concurrent algorithm and pipeleined architecture for solving Toeplitz systems , 1983 .

[33]  Jean-Marc Delosme,et al.  Highly concurrent computing structures for matrix arithmetic and signal processing , 1982, Computer.

[34]  Richard F. Lyon,et al.  Two's Complement Pipeline Multipliers , 1976, IEEE Trans. Commun..

[35]  D. Heller A Survey of Parallel Algorithms in Numerical Linear Algebra. , 1978 .

[36]  H. T. Kung,et al.  Matrix Triangularization By Systolic Arrays , 1982, Optics & Photonics.