A reduced hardware general purpose systolic array design

Although there may be many general purpose systolic array design and implementation which could be used to configure some special systolic structure, the degree of their hardware complexity and flexibility is a main question. In this paper we have designed a new systolic array system to realize a range of various algorithms without increasing the hardware structure significantly. The proposed concepts that have been used are: 1) using a novel architecture to allow any number of desired time delays along the data path; 2) dividing instructions and data into two formats-tagged and untagged formats; and 3) suitable processing element (PE) architecture design.

[1]  S. Kung,et al.  VLSI Array processors , 1985, IEEE ASSP Magazine.

[2]  Christopher S. Wallace,et al.  A Suggestion for a Fast Multiplier , 1964, IEEE Trans. Electron. Comput..

[3]  James Sutton,et al.  iWarp: a 100-MOPS, LIW microprocessor for multicomputers , 1991, IEEE Micro.

[4]  H. T. Kung,et al.  The Warp Computer: Architecture, Implementation, and Performance , 1987, IEEE Transactions on Computers.

[5]  Kai Hwang,et al.  Computer arithmetic: Principles, architecture, and design , 1979 .

[6]  John V. McCanny,et al.  Implementation of signal processing functions using 1-bit systolic arrays , 1982 .

[7]  Robert Schreiber The Saxpy-1M: Architecture and Algorithms , 1986, Photonics West - Lasers and Applications in Science and Engineering.