Platelet-based coding of depth maps for the transmission of multiview images

Emerging 3-D displays show several views of the scene simultaneously. A direct transmission of a selection of these views is impractical, because various types of displays support a different number of views and the decoder has to interpolate the intermediate views. The transmission of multiview image information can be simplified by only transmitting the texture data for the central view and a corresponding depth map. Additional to the coding of the texture data, this technique requires the efficient coding of depth maps. Since the depth map represents the scene geometry and thereby covers the 3-D perception of the scene, sharp edges corresponding to object boundaries, should be preserved. We propose an algorithm that models depth maps using piecewise-linear functions (platelets). To adapt to varying scene detail, we employ a quadtree decomposition that divides the image into blocks of variable size, each block being approximated by one platelet. In order to preserve sharp object boundaries, the support area of each platelet is adapted to the object boundary. The subdivision of the quadtree and the selection of the platelet type are optimized such that a global rate-distortion trade-off is realized. Experimental results show that the described method can improve the resulting picture quality after compression of depth maps by 1-3 dB when compared to a JPEG-2000 encoder.

[1]  Richard Szeliski,et al.  High-accuracy stereo depth maps using structured light , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[2]  Minh N. Do,et al.  Rate-distortion optimized tree-structured compression algorithms for piecewise polynomial images , 2005, IEEE Transactions on Image Processing.

[3]  Ingemar J. Cox,et al.  A maximum-flow formulation of the N-camera stereo correspondence problem , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[4]  Peter H. N. de With,et al.  Novel coding technique for depth images using quadtree decomposition and plane approximation , 2005, Visual Communications and Image Processing.

[5]  D. Donoho Wedgelets: nearly minimax estimation of edges , 1999 .

[6]  Nikolaos Grammalidis,et al.  Disparity field and depth map coding for multiview image sequence compression , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[7]  Harpreet S. Sawhney,et al.  A depth map representation for real-time transmission and view-based rendering of a dynamic 3D scene , 2002, Proceedings. First International Symposium on 3D Data Processing Visualization and Transmission.

[8]  Faouzi Kossentini,et al.  JasPer: a software-based JPEG-2000 codec implementation , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[9]  Hai Tao,et al.  Compression and transmission of depth maps for image-based rendering , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[10]  Robert D. Nowak,et al.  Platelets: a multiscale approach for recovering edges and surfaces in photon-limited medical imaging , 2003, IEEE Transactions on Medical Imaging.

[11]  Philip A. Chou,et al.  Optimal pruning with applications to tree-structured source coding and modeling , 1989, IEEE Trans. Inf. Theory.

[12]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.