Efficient variability analysis of arithmetic units using linear regression techniques

The performance of modern digital signal processing (DSP) systems is inherently affected by the variability tolerance of their main arithmetic units. As CMOS technology approaches nanometer scales, numerous threats for the reliability of DSP designs emerge. A large portion of these phenomena are related to threshold voltage $$V_{th}$$Vth variations, resulting in timing failures due to an overall increase of the arithmetic unit delay. In this work, we employ linear regression techniques to accelerate transistor variability estimation using static timing analysis (STA) tools. By identifying the variation-critical part of an arithmetic circuit, we reduce the transistor inventory that needs to be tracked by the STA solver. We substantiate the efficiency of the proposed framework for realistic designs. For the main logic blocks of the modulo $$2^n-1$$2n-1 add–multiply (AM) operation, we capture the variability-induced degradation of the Modified Booth (MB) encoding stage with negligible accuracy losses. We also exploit our methodology as a useful aid for variability-aware design techniques and we investigate the variability-tolerance of novel MB recoding schemes against conventional designs.

[1]  Antonio Rubio,et al.  Circuit propagation delay estimation through multivariate regression-based modeling under spatio-temporal variability , 2010, 2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010).

[2]  Murat R. Becer,et al.  Transistor level gate modeling for accurate and fast timing, noise, and power analysis , 2008, 2008 45th ACM/IEEE Design Automation Conference.

[3]  C. Efstathiou,et al.  Modified Booth modulo 2/sup n/-1 multipliers , 2004, IEEE Transactions on Computers.

[4]  Yu Cao,et al.  The Impact of NBTI Effect on Combinational Circuit: Modeling, Simulation, and Analysis , 2010, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[5]  Marcel Pelgrom,et al.  Matching properties of MOS transistors , 1988 .

[6]  Reto Zimmermann,et al.  Efficient VLSI implementation of modulo (2/sup n//spl plusmn/1) addition and multiplication , 1999, Proceedings 14th IEEE Symposium on Computer Arithmetic (Cat. No.99CB36336).

[7]  Vassilis Paliouras,et al.  Delay-variation-tolerant FIR filter architectures based on the Residue Number System , 2013, 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013).

[8]  Sarma B. K. Vrudhula,et al.  Statistical waveform and current source based standard cell models for accurate timing analysis , 2008, 2008 45th ACM/IEEE Design Automation Conference.

[9]  Francky Catthoor,et al.  Implications of BTI-Induced Time-Dependent Statistics on Yield Estimation of Digital Circuits , 2014, IEEE Transactions on Electron Devices.

[10]  Dimitrios Soudris,et al.  Understanding timing impact of BTI/RTN with massively threaded atomistic transient simulations , 2014, 2014 IEEE International Conference on IC Design & Technology.

[11]  Jinfeng Liu,et al.  A fast approach for static timing analysis covering all PVT corners , 2011, 2011 48th ACM/EDAC/IEEE Design Automation Conference (DAC).

[12]  Gian Carlo Cardarilli,et al.  Low-power adaptive filter based on RNS components , 2007, 2007 IEEE International Symposium on Circuits and Systems.

[13]  T. Stouraitis,et al.  A new approach to elliptic curve cryptography: an RNS architecture , 2006, MELECON 2006 - 2006 IEEE Mediterranean Electrotechnical Conference.

[14]  James D. Meindl,et al.  Impact of die-to-die and within-die parameter fluctuations on the maximum clock frequency distribution for gigascale integration , 2002, IEEE J. Solid State Circuits.

[15]  Dhiraj K. Pradhan,et al.  Statistical Blockade Method for Fast Robustness Estimation and Compensation of Nano-CMOS Arithmetic Circuits , 2011, 2011 International Symposium on Electronic System Design.

[16]  Vassilis Paliouras,et al.  Residue arithmetic bases for reducing delay variation , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[17]  Francky Catthoor,et al.  Efficient Reliability Analysis of Processor Datapath using Atomistic BTI Variability Models , 2015, ACM Great Lakes Symposium on VLSI.

[18]  Youngjun Kim,et al.  Design of a combined processor containing a 32-bit RISC microprocessor and a 16-bit fixed-point DSP on a chip , 1999, ICVC '99. 6th International Conference on VLSI and CAD (Cat. No.99EX361).

[19]  Dimitrios Soudris,et al.  Linear regression techniques for efficient analysis of transistor variability , 2014, 2014 21st IEEE International Conference on Electronics, Circuits and Systems (ICECS).

[20]  T. Grasser,et al.  The statistical analysis of individual defects constituting NBTI and its implications for modeling DC- and AC-stress , 2010, 2010 IEEE International Reliability Physics Symposium.

[21]  Javier D. Bruguera,et al.  Implementation of the FFT butterfly with redundant arithmetic , 1996 .

[22]  V. Paliouras,et al.  Residue arithmetic for designing multiply-add units in the presence of non-gaussian variation , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[23]  Kiamal Z. Pekmestzi,et al.  Fused modulo 2n − 1 add-multiply unit , 2014, 2014 21st IEEE International Conference on Electronics, Circuits and Systems (ICECS).

[24]  Kiamal Z. Pekmestzi,et al.  An Optimized Modified Booth Recoder for Efficient Design of the Add-Multiply Operator , 2014, IEEE Transactions on Circuits and Systems I: Regular Papers.

[25]  Reto Zimmermann,et al.  Optimized synthesis of sum-of-products , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[26]  Ricardo Chaves,et al.  RDSP: a RISC DSP based on residue number system , 2003, Euromicro Symposium on Digital System Design, 2003. Proceedings..

[27]  A. Lloris,et al.  RNS-based implementation of 8 /spl times/ 8 point 2D-DCT over field-programmable devices [image compression] , 2003 .

[28]  Chein-Wei Jen,et al.  High-speed and low-power split-radix FFT , 2003, IEEE Trans. Signal Process..

[29]  C. Efstathiou,et al.  Modified Booth 1's complement and modulo 2/sup n/-1 multipliers , 2000, ICECS 2000. 7th IEEE International Conference on Electronics, Circuits and Systems (Cat. No.00EX445).

[30]  Qin Tang,et al.  Statistical Transistor-Level Timing Analysis Using a Direct Random Differential Equation Solver , 2014, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[31]  Francky Catthoor,et al.  Atomistic Pseudo-Transient BTI Simulation With Inherent Workload Memory , 2014, IEEE Transactions on Device and Materials Reliability.

[32]  Chip-Hong Chang,et al.  Area-Power Efficient Modulo 2n-1 and Modulo 2n+1 Multipliers for {2n-1, 2n, 2n+1} Based RNS , 2012, IEEE Trans. Circuits Syst. I Regul. Pap..