A digital hardware fast algorithm and FPGA-based prototype for a novel 16-point approximate DCT for image compression applications

The discrete cosine transform (DCT) is the key step in many image and video coding standards. The eight-point DCT is an important special case, possessing several low-complexity approximations widely investigated. However, the 16-point DCT transform has energy compaction advantages. In this sense, this paper presents a new 16-point DCT approximation with null multiplicative complexity. The proposed transform matrix is orthogonal and contains only zeros and ones. The proposed transform outperforms the well-known Walsh?Hadamard transform and the current state-of-the-art 16-point approximation. A fast algorithm for the proposed transform is also introduced. This fast algorithm is experimentally validated using hardware implementations that are physically realized and verified on a 40?nm CMOS Xilinx Virtex-6 XC6VLX240T FPGA chip for a maximum clock rate of 342?MHz. Rapid prototypes on FPGA for a 8-bit input word size show significant improvement in compressed image quality by up to 1?2?dB at the cost of only eight adders compared to the state-of-art 16-point DCT approximation algorithm in the literature (Bouguezel et al 2010 Proc. 53rd IEEE Int. Midwest Symp. on Circuits and Systems).

[1]  A NikaraJari,et al.  Discrete cosine and sine transforms , 2006 .

[2]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[3]  R. Cintra,et al.  Image Compression via a Fast DCT Approximation , 2010, IEEE Latin America Transactions.

[4]  Allan O. Steinhardt,et al.  Fast algorithms for digital signal processing , 1986, Proceedings of the IEEE.

[5]  Muhammad Shafique,et al.  Hardware/Software Architectures for Low-Power Embedded Multimedia Systems , 2011 .

[6]  Wilhelm Burger,et al.  Digital Image Processing - An Algorithmic Introduction using Java , 2008, Texts in Computer Science.

[7]  Tarek I. Haweel,et al.  A new square wave transform based on the DCT , 2001, Signal Process..

[8]  Jorg Henkel,et al.  Background and Related Work , 2016 .

[9]  Source Poznań,et al.  INTERNATIONAL ORGANISATION FOR STANDARDISATION ORGANISATION INTERNATIONALE DE NORMALISATION ISO/IEC JTC1/SC29/WG11 CODING OF MOVING PICTURES AND AUDIO , 2008 .

[10]  Masaaki Ikehara,et al.  Integer DCT Based on Direct-Lifting of DCT-IDCT for Lossless-to-Lossy Image Coding , 2010, IEEE Transactions on Image Processing.

[11]  Steven Kay,et al.  Fundamentals Of Statistical Signal Processing , 2001 .

[12]  G.S. Moschytz,et al.  Practical fast 1-D DCT algorithms with 11 multiplications , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[13]  S. Kay Fundamentals of statistical signal processing: estimation theory , 1993 .

[14]  Fábio M. Bayer,et al.  A DCT Approximation for Image Compression , 2011, IEEE Signal Processing Letters.

[15]  Jianqin Zhou,et al.  On discrete cosine transform , 2011, ArXiv.

[16]  G.G. Langdon,et al.  Data compression , 1988, IEEE Potentials.

[17]  Hsieh S. Hou A fast recursive algorithm for computing the discrete cosine transform , 1987, IEEE Trans. Acoust. Speech Signal Process..

[18]  M. Omair Ahmad,et al.  A low-complexity parametric transform for image compression , 2011, 2011 IEEE International Symposium of Circuits and Systems (ISCAS).

[19]  M.N.S. Swamy,et al.  A fast 8×8 transform for image compression , 2009 .

[20]  N. Cho,et al.  Fast algorithm and implementation of 2-D discrete cosine transform , 1991 .

[21]  M. Swamy,et al.  A novel transform for image compression , 2010, 2010 53rd IEEE International Midwest Symposium on Circuits and Systems.

[22]  Henrique S. Malvar,et al.  Low-complexity transform and quantization in H.264/AVC , 2003, IEEE Trans. Circuits Syst. Video Technol..

[23]  Naoki Suehiro,et al.  Fast algorithms for the DFT and other sinusoidal transforms , 1986, IEEE Trans. Acoust. Speech Signal Process..

[24]  Sergio Saponara Real-time and low-power processing of 3D direct/inverse discrete cosine transform for low-complexity video codec , 2010, Journal of Real-Time Image Processing.

[25]  Konstantinos Konstantinides,et al.  Image and video compression standards , 1995 .

[26]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[27]  A. Bovik,et al.  A universal image quality index , 2002, IEEE Signal Processing Letters.

[28]  Antonio Ortega,et al.  Scalable variable complexity approximate forward DCT , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[30]  David Salomon The Computer Graphics Manual , 2011, Texts in Computer Science.

[31]  Thomas Wiegand,et al.  Draft ITU-T recommendation and final draft international standard of joint video specification , 2003 .

[32]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[33]  Wen-Kuang Kuo,et al.  Traffic prediction and QoS transmission of real-time live VBR videos in WLANs , 2011, TOMCCAP.

[34]  Dejan S. Milojicic Background and Related Work , 1994 .

[35]  Y. Arai,et al.  A Fast DCT-SQ Scheme for Images , 1988 .

[36]  M.N.S. Swamy,et al.  Low-complexity 8×8 transform for image compression , 2008 .

[37]  Graham A. Jullien,et al.  Multiplication-free 8×8 2D DCT architecture using algebraic integer encoding , 2004 .

[38]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[39]  Ming-Ting Sun,et al.  Approximation of calculations for forward discrete cosine transform , 1998, IEEE Trans. Circuits Syst. Video Technol..

[40]  Yonghong Zeng,et al.  Transforms and Fast Algorithms for Signal Analysis and Representations , 2003 .

[41]  Robert Piessens,et al.  Quadpack: A Subroutine Package for Automatic Integration , 2011 .

[42]  B. Fino Relations between Haar and Walsh/Hadamard transforms , 1972 .

[43]  Wen-Hsiung Chen,et al.  A Fast Computational Algorithm for the Discrete Cosine Transform , 1977, IEEE Trans. Commun..

[44]  K. R. Rao,et al.  The H.264 Video Coding Standard , 2014, IEEE Potentials.

[45]  Trac D. Tran,et al.  Fast multiplierless approximations of the DCT with the lifting scheme , 2001, IEEE Trans. Signal Process..

[46]  Vladimir Britanak,et al.  CHAPTER 1 – Discrete Cosine and Sine Transforms , 2006 .

[47]  A. W. M. van den Enden,et al.  Discrete Time Signal Processing , 1989 .

[48]  Ajay Luthra,et al.  Introduction to the special issue on the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[49]  Arjuna Madanayake,et al.  Algebraic integer based 8×8 2-D DCT architecture for digital video processing , 2011, 2011 IEEE International Symposium of Circuits and Systems (ISCAS).

[50]  Nuno Roma,et al.  Efficient Hybrid DCT-Domain Algorithm for Video Spatial Downscaling , 2007, EURASIP J. Adv. Signal Process..