Design Methodology to Explore Hybrid Approximate Adders for Energy-Efficient Image and Video Processing Accelerators

This paper proposes a new design methodology to explore the state-of-the-art approximate adders for accelerator architectures conceived in the realm of multiplier-less multiple constant multiplication optimization problem. The proposed methodology is composed of: 1) a search heuristic to seek faster and feasible approximate configurations for the architectures under evaluation; 2) low-power techniques regarding hybrid approximate adders design for accelerators based on trees of shift-and-add operations; 3) high-performance evaluation by exploring parallel prefix adders and low power analysis through the use of the adder optimized by a commercial synthesis tool in the precise part of the approximate adders; and 4) energy efficiency analysis by considering both the approximate techniques and voltage over scaling estimation. Furthermore, improvements are proposed for the state-of-the-art approximate adders under evaluation in this paper. Two case studies are considered to assess the proposed methodology: 1) Gaussian image filter and 2) Sobel operator. The precise and approximate image filters were described in very high-speed integrated circuits hardware description language regarding the proposed methodology. Results are shown after synthesis to a 45-nm standard cell-based technology, where energy reductions ranging from 7.7% up to 73.2% were experienced for multiple levels of quality considering the applications under analysis.

[1]  Kaushik Roy,et al.  Approximate computing and the quest for computing efficiency , 2015, 2015 52nd ACM/EDAC/IEEE Design Automation Conference (DAC).

[2]  Sergio Bampi,et al.  Exploiting approximate adder circuits for power-efficient Gaussian and Gradient filters for Canny edge detector algorithm , 2016, 2016 IEEE 7th Latin American Symposium on Circuits & Systems (LASCAS).

[3]  Zhi-Hui Kong,et al.  Design of Low-Power High-Speed Truncation-Error-Tolerant Adder and Its Application in Digital Signal Processing , 2010, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[4]  Caro Lucas,et al.  Bio-Inspired Imprecise Computational Blocks for Efficient VLSI Implementation of Soft-Computing Applications , 2010, IEEE Transactions on Circuits and Systems I: Regular Papers.

[5]  Alberto Garcia-Ortiz,et al.  Coherent Design of Hybrid Approximate Adders: Unified Design Framework and Metrics , 2018, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[6]  Eduardo A. C. da Costa,et al.  Exploring the use of parallel prefix adder topologies into approximate adder circuits , 2017, 2017 24th IEEE International Conference on Electronics, Circuits and Systems (ICECS).

[7]  Kaushik Roy,et al.  Low-Power Digital Signal Processing Using Approximate Adders , 2013, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[8]  Heba Khdr,et al.  New trends in dark silicon , 2015, 2015 52nd ACM/EDAC/IEEE Design Automation Conference (DAC).

[9]  Jongsun Park,et al.  Energy Efficient Canny Edge Detector for Advanced Mobile Vision Applications , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Tack-Don Han,et al.  Fast area-efficient VLSI adders , 1987, 1987 IEEE 8th Symposium on Computer Arithmetic (ARITH).

[11]  Semeen Rehman,et al.  Architectural-space exploration of approximate multipliers , 2016, 2016 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[12]  Jack Sklansky,et al.  Conditional-Sum Addition Logic , 1960, IRE Trans. Electron. Comput..

[13]  Muhammad Shafique,et al.  The EDA challenges in the dark silicon era , 2014, 2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC).

[14]  Andrew B. Kahng,et al.  Accuracy-configurable adder for approximate arithmetic designs , 2012, DAC Design Automation Conference 2012.

[15]  Robert H. Dennard Past Progress and Future Challenges in LSI Technology: From DRAM and Scaling to Ultra-Low-Power CMOS , 2015, IEEE Solid-State Circuits Magazine.

[16]  Rakesh Kumar,et al.  On reconfiguration-oriented approximate adder design and its application , 2013, 2013 IEEE/ACM International Conference on Computer-Aided Design (ICCAD).

[17]  Davide De Caro,et al.  Variable Latency Speculative Parallel Prefix Adders for Unsigned and Signed Operands , 2016, IEEE Transactions on Circuits and Systems I: Regular Papers.

[18]  Muhammad Shafique,et al.  A low latency generic accuracy configurable adder , 2015, 2015 52nd ACM/EDAC/IEEE Design Automation Conference (DAC).

[19]  H. T. Kung,et al.  A Regular Layout for Parallel Adders , 1982, IEEE Transactions on Computers.

[20]  Jason Cong,et al.  Accelerator-rich architectures: Opportunities and progresses , 2014, 2014 51st ACM/EDAC/IEEE Design Automation Conference (DAC).

[21]  Harold S. Stone,et al.  A Parallel Algorithm for the Efficient Solution of a General Class of Recurrence Equations , 1973, IEEE Transactions on Computers.

[22]  Levent Aksoy,et al.  Optimization of Area and Delay at Gate-Level in Multiple Constant Multiplications , 2010, 2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools.

[23]  Qiang Xu,et al.  Approximate Computing: A Survey , 2016, IEEE Design & Test.

[24]  Christoforos E. Kozyrakis,et al.  Understanding sources of inefficiency in general-purpose chips , 2010, ISCA.

[25]  Markus Püschel,et al.  Multiplierless multiple constant multiplication , 2007, TALG.

[26]  Paolo Ienne,et al.  Variable Latency Speculative Addition: A New Paradigm for Arithmetic Circuit Design , 2008, 2008 Design, Automation and Test in Europe.

[27]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Cheng-Chew Lim,et al.  Parallel prefix adder design , 2001, Proceedings 15th IEEE Symposium on Computer Arithmetic. ARITH-15 2001.

[29]  Jie Han,et al.  Approximate computing: An emerging paradigm for energy-efficient design , 2013, 2013 18th IEEE European Test Symposium (ETS).

[30]  Seokhyeong Kang,et al.  Novel approximate synthesis flow for energy-efficient FIR filter , 2016, 2016 IEEE 34th International Conference on Computer Design (ICCD).

[31]  Sergio Bampi,et al.  Exploring power-performance-quality tradeoff of approximate adders for energy efficient sobel filtering , 2018, 2018 IEEE 9th Latin American Symposium on Circuits & Systems (LASCAS).

[32]  Gang Wang,et al.  Enhanced low-power high-speed adder for error-tolerant application , 2009, 2010 International SoC Design Conference.

[33]  Ravi Iyer Accelerator-rich architectures: Implications, opportunities and challenges , 2012, 17th Asia and South Pacific Design Automation Conference.