In this paper, we propose a RT level power reduction scheme which can be used for the applications that require ultra-low power consumption. A novel wasting-toggle-rate (WTR) based clock power reduction technique is introduced. It compares the WTR of a stimulus with the pre-computed threshold WTR of the circuit, and clock gating is applied to the circuit only if there is power-saving benefit. It considers not only clock enable signal, but also data signal and wasting toggle rate of the clock in a fine-grained manner. The proposed technique is implemented at RT level in a complete CAD solution. We have tested the proposed technique on real industrial multimedia-mobile-processor design. For the accuracy of the power optimization results, all the power estimation results are measured at gate level after synthesis by using industrial 65 and 90 nanometer technology libraries. The experimental results show that using 65 nanometer technology, the technique reduces 35.46% power comparing with non-clock gating design and 18.80% power comparing with clock-gating design by Power Compiler; and that using 90 nanometer technology, the technique reduces average 39.34% total power comparing with non-clock gating design and 27.76% total power comparing with clock-gating design by Power Compiler. For the design overhead of the proposed technique, it increases 1.17% of the critical path delay and reduces 0.81% of the area for the whole circuit comparing with the original circuit in 65 nanometer technology, and increases 2.75% of the critical path delay and reduces 0.75% area for whole circuit in 90 nanometer technology.
[1]
Sujit Dey,et al.
High-Level Power Analysis and Optimization
,
1997
.
[2]
T. Watanabe,et al.
An Adaptive Width Data Cache for low power design
,
2009,
2009 International SoC Design Conference (ISOCC).
[3]
Marios C. Papaefthymiou,et al.
Precomputation-based sequential logic optimization for low power
,
1994,
IEEE Trans. Very Large Scale Integr. Syst..
[4]
Luca Benini,et al.
A scalable algorithm for RTL insertion of gated clocks based on ODCs computation
,
2005,
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.
[5]
Luca Benini,et al.
Automatic synthesis of low-power gated-clock finite-state machines
,
1996,
IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..
[6]
Farid N. Najm,et al.
A survey of power estimation techniques in VLSI circuits
,
1994,
IEEE Trans. Very Large Scale Integr. Syst..
[7]
Kurt Keutzer,et al.
Estimation of average switching activity in combinational and sequential circuits
,
1992,
[1992] Proceedings 29th ACM/IEEE Design Automation Conference.
[8]
Gila Kamhi,et al.
A new paradigm for synthesis and propagation of clock gating conditions
,
2008,
2008 45th ACM/IEEE Design Automation Conference.