Toward a 3D video format for auto-stereoscopic displays

There has been increased momentum recently in the production of 3D content for cinema applications; for the most part, this has been limited to stereo content. There are also a variety of display technologies on the market that support 3DTV, each offering a different viewing experience and having different input requirements. More specifically, stereoscopic displays support stereo content and require glasses, while auto-stereoscopic displays avoid the need for glasses by rendering view-dependent stereo pairs for a multitude of viewing angles. To realize high quality auto-stereoscopic displays, multiple views of the video must either be provided as input to the display, or these views must be created locally at the display. The former approach has difficulties in that the production environment is typically limited to stereo, and transmission bandwidth for a large number of views is not likely to be available. This paper discusses an emerging 3D data format that enables the latter approach to be realized. A new framework for efficiently representing a 3D scene and enabling the reconstruction of an arbitrarily large number of views prior to rendering is introduced. Several design challenges are also highlighted through experimental results.

[1]  Sehoon Yea,et al.  View Synthesis Prediction for Rate-Overhead Reduction in FTV , 2008, 2008 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[2]  Aljoscha Smolic,et al.  Multi-View Video Plus Depth Representation and Coding , 2007, 2007 IEEE International Conference on Image Processing.

[3]  Marc Pollefeys,et al.  An evolutionary and optimised approach on 3D-TV , 2002 .

[4]  Peter H. N. de With,et al.  Depth-Image Compression Based on an R-D Optimized Quadtree Decomposition for the Transmission of Multiview Images , 2007, 2007 IEEE International Conference on Image Processing.

[5]  Wa James Tam,et al.  Stereoscopic image coding: Effect of disparate image-quality in left- and right-eye views , 1998, Signal Process. Image Commun..

[6]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[7]  André Vincent,et al.  Stereo image quality: effects of mixed spatio-temporal resolution , 2000, IEEE Trans. Circuits Syst. Video Technol..

[8]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  D. Scharstein,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV 2001).

[10]  T. Wiegand,et al.  The Effect of Depth Compression on Multiview Rendering Quality , 2008, 2008 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[11]  J. Konrad,et al.  3D displays and signal processing : an answer to 3-D ills? , 2007 .

[12]  N. Atzpadin,et al.  Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability , 2007, Signal Process. Image Commun..