Power and area efficient CORDIC-Based DCT using direct realization of decomposed matrix

Abstract Discrete Cosine Transform (DCT) is one of the most fundamental yet complicated transforms in image and signal processing. In spite of the diverse applications of DCT, its implementation is still considered as a great difficulty due to high computational complexity and large power consumption. In this paper, a novel CORDIC-based DCT architecture is developed which remarkably reduces power consumption, area overhead, and hardware complexity. The key idea behind the proposed architecture is that the fundamental data path components of DCT are individually modified with incremental advance on butterfly block, CORDIC array, and controller unit by means of matrix decomposition, resource sharing and merging techniques, removal of pre-processing step, extending CORDIC idle time, and deploying the Low-power Lookahead (LPLA) CORDIC structure. Also, in the proposed controller unit, switching activity is reduced by changing the generation order of the butterfly outputs. In addition, the proposed structure has regular, scalable, and modular architecture which makes it a suitable candidate for hardware implementation.

[1]  Somayeh Timarchi,et al.  Area–Time–Power Efficient FFT Architectures Based on Binary-Signed-Digit CORDIC , 2019, IEEE Transactions on Circuits and Systems I: Regular Papers.

[2]  Alan C. Bovik,et al.  . Efficient DCT-domain blind measurement and reduction of blocking artifacts , 2002, IEEE Trans. Circuits Syst. Video Technol..

[4]  Jianfeng Zhang,et al.  A Novel Low-Power and High-PSNR Architecture Based on ARC for DCT/IDCT , 2016, NCCET.

[5]  Wen-Hsiung Chen,et al.  A Fast Computational Algorithm for the Discrete Cosine Transform , 1977, IEEE Trans. Commun..

[6]  Somayeh Timarchi,et al.  Novel algorithm and architectures for high-speed low-power ConText-based steganography , 2017, 2017 19th International Symposium on Computer Architecture and Digital Systems (CADS).

[7]  Jack E. Volder The CORDIC Trigonometric Computing Technique , 1959, IRE Trans. Electron. Comput..

[9]  Liyi Xiao,et al.  CORDIC Based Fast Radix-2 DCT Algorithm , 2013, IEEE Signal Processing Letters.

[10]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[11]  Somayeh Timarchi,et al.  Low-Power and Fast Full Adder by Exploring New XOR and XNOR Gates , 2018, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[12]  Paul Chow,et al.  FPGA implementation of low-power and high-PSNR DCT/IDCT architecture based on adaptive recoding CORDIC , 2015, 2015 International Conference on Field Programmable Technology (FPT).

[13]  Liyi Xiao,et al.  CORDIC based fast algorithm for power-of-two point DCT and its efficient VLSI implementation , 2014, Microelectron. J..

[14]  Trong-Thuc Hoang,et al.  High-speed 8/16/32-point DCT Architecture Using Fixed-rotation Adaptive CORDIC , 2018, 2018 IEEE International Symposium on Circuits and Systems (ISCAS).

[15]  Touradj Ebrahimi,et al.  The JPEG 2000 still image compression standard , 2001, IEEE Signal Process. Mag..

[16]  S.A. White,et al.  Applications of distributed arithmetic to digital signal processing: a tutorial review , 1989, IEEE ASSP Magazine.

[17]  Hai Huang,et al.  A novel CORDIC based unified architecture for DCT and IDCT , 2012, 2012 International Conference on Optoelectronics and Microelectronics.

[18]  B. Lee A new algorithm to compute the discrete cosine Transform , 1984 .

[19]  Shanq-Jang Ruan,et al.  Low-power and high-quality Cordic-based Loeffler DCT for signal processing , 2007, IET Circuits Devices Syst..

[20]  G.S. Moschytz,et al.  Practical fast 1-D DCT algorithms with 11 multiplications , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[21]  Fabrizio Lombardi,et al.  Algorithm and Design of a Fully Parallel Approximate Coordinate Rotation Digital Computer (CORDIC) , 2017, IEEE Transactions on Multi-Scale Computing Systems.

[22]  Somayeh Timarchi,et al.  Low-power DCT-based compressor for wireless capsule endoscopy , 2017, Signal Process. Image Commun..

[23]  Dong Sam Ha,et al.  On the low-power design of DCT and IDCT for low bit-rate video codecs , 2001, Proceedings 14th Annual IEEE International ASIC/SOC Conference (IEEE Cat. No.01TH8558).

[24]  Trong-Thuc Hoang,et al.  High-performance DCT architecture based on angle recoding CORDIC and Scale-Free Factor , 2016, 2016 IEEE Sixth International Conference on Communications and Electronics (ICCE).

[25]  Zhongde Wang Fast algorithms for the discrete W transform and for the discrete Fourier transform , 1984 .