论文信息 - Transform domain sparsification of depth maps using iterative quadratic programming

Transform domain sparsification of depth maps using iterative quadratic programming

Compression of depth maps is important for “texture plus depth” format of multiview images, which enables synthesis of novel intermediate views via depth-image-based rendering (DIBR) at decoder. Previous depth map coding schemes exploit unique depth data characteristics to compactly and faithfully reproduce the original signal. In contrast, since depth map is only a means to the end of view synthesis and not itself viewed, in this paper we explicitly manipulate depth values, without causing severe synthesized view distortion, in order to maximize representation sparsity in the transform domain for compression gain — we call this process transform domain spar-sification (TDS). Specifically, for each pixel in the depth map, we first define a quadratic penalty function, with minimum at ground truth depth value, based on synthesized view's distortion sensitivity to the pixel's depth value during DIBR. We then define an objective for a depth signal in a block as a weighted sum of: i) signal's sparsity in the transform domain, and ii) per-pixel synthesized view distortion penalties for the chosen signal. Given that sparsity (70-norm) is non-convex and difficult to optimize, we replace the Zo-norm in the objective with a computationally inexpensive weighted 12-norm; the optimization is then an unconstrained quadratic program, solvable via a set of linear equations. For the weighted /2-norm to promote sparsity, we solve the optimization iteratively, where at each iteration weights are readjusted to mimic sparsity-promoting ZT-norm, 0 < r < 1. Using JPEG as an example transform codec, we show that our TDS approach gained up to 1.7dB in rate-distortion performance for the interpolated view over compression of unaltered depth maps.

Antonio Ortega | Gene Cheung | Akira Kubota | Junichi Ishida

[1] Antonio Ortega,et al. Sparse representation of depth maps for efficient transform coding , 2010, 28th Picture Coding Symposium.

[2] Nasir D. Memon,et al. Near-lossless image compression techniques , 1998, J. Electronic Imaging.

[3] Aljoscha Smolic,et al. Multi-View Video Plus Depth Representation and Coding , 2007, 2007 IEEE International Conference on Image Processing.

[4] Detlev Marpe,et al. Global and local rate-distortion optimization for Lapped Biorthogonal Transform coding , 2010, 2010 IEEE International Conference on Image Processing.

[5] Leonard McMillan,et al. Post-rendering 3D warping , 1997, SI3D.

[6] I. Daubechies,et al. Iteratively reweighted least squares minimization for sparse recovery , 2008, 0807.0575.

[7] Antonio Ortega,et al. Depth map distortion analysis for view rendering and depth coding , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[8] Stephen P. Boyd,et al. Enhancing Sparsity by Reweighted ℓ1 Minimization , 2007, 0711.1612.

[9] Avideh Zakhor. Iterative procedures for reduction of blocking effects in transform image coding , 1992, IEEE Trans. Circuits Syst. Video Technol..

[10] Masayuki Tanimoto,et al. Multiview Imaging and 3DTV , 2007, IEEE Signal Processing Magazine.

[11] Minh N. Do,et al. Wavelet-Based Joint Estimation and Encoding of Depth-Image-Based Representations for Free-Viewpoint Rendering , 2008, IEEE Transactions on Image Processing.

[12] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.