Deep Optimized Multiple Description Image Coding via Scalar Quantization Learning

In this paper, we introduce a deep multiple description coding (MDC) framework optimized by minimizing multiple description (MD) compressive loss. First, MD multi-scale-dilated encoder network generates multiple description tensors, which are discretized by scalar quantizers, while these quantized tensors are decompressed by MD cascaded-ResBlock decoder networks. To greatly reduce the total amount of artificial neural network parameters, an auto-encoder network composed of these two types of network is designed as a symmetrical parameter sharing structure. Second, this autoencoder network and a pair of scalar quantizers are simultaneously learned in an end-to-end self-supervised way. Third, considering the variation in the image spatial distribution, each scalar quantizer is accompanied by an importance-indicator map to generate MD tensors, rather than using direct quantization. Fourth, we introduce the multiple description structural similarity distance loss, which implicitly regularizes the diversified multiple description generations, to explicitly supervise multiple description diversified decoding in addition to MD reconstruction loss. Finally, we demonstrate that our MDC framework performs better than several state-of-the-art MDC approaches regarding image coding efficiency when tested on several commonly available datasets.

[1]  Lucas Theis,et al.  Lossy Image Compression with Compressive Autoencoders , 2017, ICLR.

[2]  David Zhang,et al.  Learning Convolutional Networks for Content-Weighted Image Compression , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[3]  Sorina Dumitrescu,et al.  Bit-Error Resilient Index Assignment for Multiple Description Scalar Quantizers , 2013, IEEE Transactions on Information Theory.

[4]  Vivek K. Goyal,et al.  Multiple description vector quantization with a coarse lattice , 2002, IEEE Trans. Inf. Theory.

[5]  Yao Zhao,et al.  Simultaneous color-depth super-resolution with conditional generative adversarial networks , 2019, Pattern Recognit..

[6]  Li Li,et al.  Layered multiple description video coding using dual-tree discrete wavelet transform and H.264/AVC , 2015, Multimedia Tools and Applications.

[7]  Yao Zhao,et al.  Multiple Description Convolutional Neural Networks for Image Compression , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Jia Zhang,et al.  Multiple Description Coding Based on Convolutional Auto-Encoder , 2019, IEEE Access.

[9]  Yao Zhao,et al.  Learning a Virtual Codec Based on Deep Convolutional Neural Network to Compress Image , 2017, J. Vis. Commun. Image Represent..

[10]  Vladlen Koltun,et al.  Multi-Scale Context Aggregation by Dilated Convolutions , 2015, ICLR.

[11]  David Minnen,et al.  Variable Rate Image Compression with Recurrent Neural Networks , 2015, ICLR.

[12]  Luc Van Gool,et al.  Conditional Probability Models for Deep Image Compression , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[13]  Yuanyuan Xu,et al.  End-to-End Rate-Distortion Optimized Description Generation for H.264 Multiple Description Video Coding , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Lubomir D. Bourdev,et al.  Real-Time Adaptive Image Compression , 2017, ICML.

[15]  S. Sandeep Pradhan,et al.  An Achievable Rate-Distortion Region for Multiple Descriptions Source Coding Based on Coset Codes , 2016, IEEE Transactions on Information Theory.

[16]  Yochai Blau,et al.  The Perception-Distortion Tradeoff , 2017, CVPR.

[17]  Jian Wang,et al.  Error-resilient region-of-interest video coding , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Vivek K. Goyal,et al.  Generalized multiple description coding with correlating transforms , 2001, IEEE Trans. Inf. Theory.

[19]  Vivek K. Goyal,et al.  Scalar Quantization With Random Thresholds , 2011, IEEE Signal Processing Letters.

[20]  Hiroshi Ishikawa,et al.  Let there be color! , 2016, ACM Trans. Graph..

[21]  Edward J. Delp,et al.  Spatial subsampling-based multiple description video coding with adaptive temporal-spatial error concealment , 2015, 2015 Picture Coding Symposium (PCS).

[22]  Francesco Palmieri,et al.  Multiple description image coder using correlating transforms , 2004, 2004 12th European Signal Processing Conference.

[23]  Ce Zhu,et al.  Multiple Description Video Coding Based on Hierarchical B Pictures , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Takahiro Yakoh,et al.  Ratio configurable multiple description correlating transforms coding , 2011, 2011 IEEE International Conference on Industrial Technology.

[25]  Michael T. Orchard,et al.  Multiple description coding using pairwise correlating transforms , 2001, IEEE Trans. Image Process..

[26]  Sorina Dumitrescu,et al.  Index mapping for bit-error resilient multiple description lattice vector quantizer , 2017, 2017 IEEE International Symposium on Information Theory (ISIT).

[27]  Wenhan Yang,et al.  Dmcnn: Dual-Domain Multi-Scale Convolutional Neural Network for Compression Artifacts Removal , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[28]  Stefano Tubaro,et al.  Multiple description video coding for scalable and robust transmission over IP , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Vinay A. Vaishampayan,et al.  Design of multiple description scalar quantizers , 1993, IEEE Trans. Inf. Theory.

[31]  Yao Zhao,et al.  Multiple Description Coding With Randomly and Uniformly Offset Quantizers , 2014, IEEE Transactions on Image Processing.

[32]  Sorina Dumitrescu,et al.  Flexible Symmetric Multiple Description Lattice Vector Quantizer With $L\geq 3$ Descriptions , 2014, IEEE Transactions on Communications.

[33]  N. J. A. Sloane,et al.  Multiple-description vector quantization with lattice codebooks: Design and analysis , 2001, IEEE Trans. Inf. Theory.

[34]  Sorina Dumitrescu,et al.  Flexible Multiple Description Lattice Vector Quantizer with L ≥ 3 Descriptions , 2014, 2014 Data Compression Conference.

[35]  Robert L. Stevenson,et al.  Efficient error recovery for multiple description video coding , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[36]  Garrison W. Cottrell,et al.  Understanding Convolution for Semantic Segmentation , 2017, 2018 IEEE Winter Conference on Applications of Computer Vision (WACV).

[37]  Hamid Jafarkhani,et al.  Multiple description trellis coded quantization , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[38]  Zhou Wang,et al.  Multiscale structural similarity for image quality assessment , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[39]  Sorina Dumitrescu,et al.  On the Design of Symmetric Entropy-Constrained Multiple Description Scalar Quantizer With Linear Joint Decoders , 2017, IEEE Transactions on Communications.

[40]  Shervin Shirmohammadi,et al.  Joint Intra and Multiple Description Coding for Packet Loss Resilient Video Transmission , 2018, IEEE Transactions on Multimedia.

[41]  Luca Benini,et al.  Soft-to-Hard Vector Quantization for End-to-End Learned Compression of Images and Neural Networks , 2017, ArXiv.

[42]  Syed Muhammad Anwar,et al.  Visual saliency based redundancy allocation in HEVC compatible multiple description video coding , 2017, Multimedia Tools and Applications.