Behavioral synthesis of high performance, low cost, and low power application specific processors for linear computations

Throughput has been widely traditionally recognized as the most popular performance metric for implementation of application specific computations. However, increasingly applications such as embedded controllers impose constraints on both throughput and latency as important metrics of speed. Although throughput alone can be arbitrarily improved for several classes of systems using previously published techniques, none of those approaches are effective when latency constraints are considered. DSP, communications, and control systems are often either linear, or have subsystems that are linear. Recently an optimal technique for simultaneous optimization of throughput and latency of linear computations was introduced by M.B. Srivastava and M. Potkonjak (1994). However, in many cases this technique introduces significant area overhead. We apply certain key aspects of that technique (on-arrival-processing and maximally fast implementation of linear computations) with exploration of state-space based transformations to develop four synthesis techniques which generate high throughput, low latency, low area, and low power application specific processors for the special case of single input linear computations. The new transformation techniques can also be used to increase the implementation efficiency while achieving the same latency and throughput as the original design-we obtained large improvements in area and power on many benchmarks when using the proposed transformations in this alternate role.<<ETX>>

[1]  M. Potkonjak,et al.  Maximally fast and arbitrarily fast implementation of linear computations (circuit layout CAD) , 1992, 1992 IEEE/ACM International Conference on Computer-Aided Design.

[2]  Miodrag Potkonjak,et al.  Transforming linear systems for joint latency and throughput optimization , 1994, Proceedings of European Design and Test Conference EDAC-ETC-EUROASIC.

[3]  Abhijit Chatterjee,et al.  An Architectural Transformation Program for Optimization of Digital Systems by Multi-Level Decomposition , 1993, 30th ACM/IEEE Design Automation Conference.

[4]  Miodrag Potkonjak,et al.  HYPER-LP: a system for power minimization using architectural transformations , 1992, ICCAD 1992.

[5]  Keshab K. Parhi,et al.  Pipeline interleaving and parallelism in recursive digital filters. I. Pipelining using scattered look-ahead and decomposition , 1989, IEEE Trans. Acoust. Speech Signal Process..

[6]  Keshab K. Parhi,et al.  Algorithm transformation techniques for concurrent processors , 1989, Proc. IEEE.

[7]  Gerhard Fettweis,et al.  Algorithm transformations for unlimited parallelism , 1990, IEEE International Symposium on Circuits and Systems.

[8]  Miodrag Potkonjak,et al.  Fast prototyping of datapath-intensive architectures , 1991, IEEE Design & Test of Computers.

[9]  Keshab K. Parhi,et al.  Pipeline interleaving and parallelism in recursive digital filters. II. Pipelined incremental block filtering , 1989, IEEE Trans. Acoust. Speech Signal Process..

[10]  John A. Stankovic,et al.  Real-time computing systems: the next generation , 1988 .

[11]  Miodrag Potkonjak,et al.  Maximally fast and arbitrarily fast implementation of linear computations , 1992, ICCAD '92.