论文信息 - A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform (IDCT) Design

A Machine Learning Approach to Optimal Inverse Discrete Cosine Transform (IDCT) Design

The design of the optimal inverse discrete cosine transform (IDCT) to compensate the quantization error is proposed for effective lossy image compression in this work. The forward and inverse DCTs are designed in pair in current image/video coding standards without taking the quantization effect into account. Yet, the distribution of quantized DCT coefficients deviate from that of original DCT coefficients. This is particularly obvious when the quality factor of JPEG compressed images is small. To address this problem, we first use a set of training images to learn the compound effect of forward DCT, quantization and dequantization in cascade. Then, a new IDCT kernel is learned to reverse the effect of such a pipeline. Experiments are conducted to demonstrate that the advantage of the new method, which has a gain of 0.11-0.30dB over the standard JPEG over a wide range of quality factors.

[1] Eirikur Agustsson,et al. NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[2] Gerhard Stoll,et al. ISO-MPEG-1 Audio: A Generic Standard for Coding of High-: Quality Digital Audio , 1994 .

[3] Ahmed Ben Atitallah,et al. Optimization and Implementation on Fpga of the DCT/IDCT Algorithm , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[4] Kiho Choi,et al. Zero coefficient-aware fast butterfly-based inverse discrete cosine transform algorithm , 2016, IET Image Process..

[5] Ajay Luthra,et al. Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[6] Ferran Marqués,et al. The MPEG-4 Book , 2003, J. Electronic Imaging.

[7] Michael W. Marcellin,et al. JPEG2000 - image compression fundamentals, standards and practice , 2013, The Kluwer international series in engineering and computer science.

[8] Gregory K. Wallace,et al. The JPEG still picture compression standard , 1991, CACM.

[9] Ming-Ting Sun,et al. Modeling DCT coefficients for fast video encoding , 1999, IEEE Trans. Circuits Syst. Video Technol..

[10] Gary J. Sullivan,et al. Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[11] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[12] Arun N. Netravali,et al. Digital Video: An introduction to MPEG-2 , 1996 .

[13] Jianqin Zhou,et al. On discrete cosine transform , 2011, ArXiv.

[14] Safia Abdelmounaime,et al. New Brodatz-Based Image Databases for Grayscale Color and Multiband Texture Analysis , 2013 .