Performance Optimization Using Variable-Latency Design Style

In many designs, the worst-case delay of a critical path may be activated infrequently. Traditional optimization approaches assume the worst-case conditions, which could lead to an inefficient resource usage. It is possible to improve the throughput of such designs by introducing variable latency. One existing realization of the variable-latency design style is based on telescopic units. The design of the hold logic in telescopic units influences the circuit's throughput. In this paper, we show that the traditionally designed hold logic may be inaccurate. We use the short path activation conditions to obtain more accurate hold logic and improve the efficiency of telescopic units. To reduce the overhead for large circuits, we propose an efficient heuristic methodology of constructing non-exact hold logic. We also discuss how to choose the telescopic unit's timing constraint. On average, our approach achieves the performance gain of 21.67% compared to 13.99%, reported in the previous work.

[1]  Steven M. Nowick Design of a low-latency asynchronous adder using speculative completion , 1996 .

[2]  Jordi Cortadella,et al.  Speculation in Elastic Systems , 2009, 2009 46th ACM/IEEE Design Automation Conference.

[3]  Shih-Chieh Chang,et al.  Efficient Calculation of Timed Cumulative Probability Density Function , 2007, 2007 International Symposium on VLSI Design, Automation and Test (VLSI-DAT).

[4]  Michael J. Flynn,et al.  Design Issues in Division and Other Floating-Point Operations , 1997, IEEE Trans. Computers.

[5]  Luca Benini,et al.  Automatic Synthesis of Large Telescopic Units Based on Near-Minimum Timed Supersetting , 1999, IEEE Trans. Computers.

[6]  Jordi Cortadella,et al.  Synthesis of synchronous elastic architectures , 2006, 2006 43rd ACM/IEEE Design Automation Conference.

[7]  Josep Carmona,et al.  Elastic Circuits , 2009, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[8]  Hiroshi Nakamura,et al.  Performance optimization of synchronous control units for datapaths with variable delay arithmetic units , 2003, ASP-DAC '03.

[9]  Kaushik Roy,et al.  CRISTA: A New Paradigm for Low-Power, Variation-Tolerant, and Adaptive Circuit Synthesis Using Critical Path Isolation , 2007, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[10]  David Blaauw,et al.  Opportunities and challenges for better than worst-case design , 2005, ASP-DAC.

[11]  Giovanni De Micheli,et al.  Synthesis and Optimization of Digital Circuits , 1994 .

[12]  Trevor Mudge,et al.  Razor: a low-power pipeline based on circuit-level timing speculation , 2003, Proceedings. 36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003. MICRO-36..

[13]  Mauro Olivieri,et al.  Design of synchronous and asynchronous variable-latency pipelined multipliers , 2001, IEEE Trans. Very Large Scale Integr. Syst..

[14]  Michael J. Flynn,et al.  A Variable Latency Pipelined Floating-Point Adder , 1996, Euro-Par, Vol. II.

[15]  Luca Benini,et al.  Efficient controller design for telescopic units , 1997, 1997 Proceedings Second Annual IEEE International Conference on Innovative Systems in Silicon.

[16]  R. G. Buschman,et al.  Calculus with analytic geometry , 1962 .

[17]  Luca Benini,et al.  Telescopic units: a new paradigm for performance optimization of VLSI designs , 1998, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..