Coding Algorithms for 3DTV—A Survey

Research efforts on 3DTV technology have been strengthened worldwide recently, covering the whole media processing chain from capture to display. Different 3DTV systems rely on different 3D scene representations that integrate various types of data. Efficient coding of these data is crucial for the success of 3DTV. Compression of pixel-type data including stereo video, multiview video, and associated depth or disparity maps extends available principles of classical video coding. Powerful algorithms and open international standards for multiview video coding and coding of video plus depth data are available and under development, which will provide the basis for introduction of various 3DTV systems and services in the near future. Compression of 3D mesh models has also reached a high level of maturity. For static geometry, a variety of powerful algorithms are available to efficiently compress vertices and connectivity. Compression of dynamic 3D geometry is currently a more active field of research. Temporal prediction is an important mechanism to remove redundancy from animated 3D mesh sequences. Error resilience is important for transmission of data over error prone channels, and multiple description coding (MDC) is a suitable way to protect data. MDC of still images and 2D video has already been widely studied, whereas multiview video and 3D meshes have been addressed only recently. Intellectual property protection of 3D data by watermarking is a pioneering research area as well. The 3D watermarking methods in the literature are classified into three groups, considering the dimensions of the main components of scene representations and the resulting components after applying the algorithm. In general, 3DTV coding technology is maturating. Systems and services may enter the market in the near future. However, the research area is relatively young compared to coding of other types of media. Therefore, there is still a lot of room for improvement and new development of algorithms.

[1]  Thomas Sikora,et al.  Extending Single-View Scalable Video Coding to Multi-View Based on H.264/AVC , 2006, 2006 International Conference on Image Processing.

[2]  Marcus A. Magnor,et al.  Data compression for light-field rendering , 2000, IEEE Trans. Circuits Syst. Video Technol..

[3]  Cevahir Çigla,et al.  Free-View Watermarking for Free-View Television , 2006, 2006 International Conference on Image Processing.

[4]  M. Barkowsky,et al.  Improving the Prediction Efficiency for MultiView Video Coding Using Histogram Matching , 2006 .

[5]  Aljoscha Smolic,et al.  Interactive 3-D Video Representation and Coding Technologies , 2005, Proceedings of the IEEE.

[6]  Craig Gotsman,et al.  Spectral compression of mesh geometry , 2000, EuroCG.

[7]  Jarek Rossignac,et al.  Edgebreaker: Connectivity Compression for Triangle Meshes , 1999, IEEE Trans. Vis. Comput. Graph..

[8]  Agnieszka C. Miguel,et al.  SPIHT for generalized multiple description coding , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[9]  Jaakko Astola,et al.  Schemes for Multiple Description Coding of Stereoscopic Video , 2006, MRCS.

[10]  B. Julesz Foundations of Cyclopean Perception , 1971 .

[11]  André Kaup,et al.  4D Scalable Multi-View Video Coding Using Disparity Compensated View Filtering and Motion Compensated Temporal Filtering , 2006, 2006 IEEE Workshop on Multimedia Signal Processing.

[12]  Yao Wang,et al.  Error-resilient video coding using multiple description motion compensation , 2002, IEEE Trans. Circuits Syst. Video Technol..

[13]  J. E. Lengyel Compression Abstract of Time-Dependent Geometry , 1999 .

[14]  Michael T. Orchard,et al.  Multiple description coding using pairwise correlating transforms , 2001, IEEE Trans. Image Process..

[15]  Jörn Ostermann,et al.  Connectivity-Guided Predictive Compression of Dynamic 3D Meshes , 2006, 2006 International Conference on Image Processing.

[16]  Jean-Luc Dugelay,et al.  Texture-based watermarking of 3D video objects , 2003, IEEE Trans. Circuits Syst. Video Technol..

[17]  M. Reha Civanlar,et al.  Interactive transport of multi-view videos for 3DTV applications , 2006 .

[18]  C.-C. Jay Kuo,et al.  Technologies for 3D mesh compression: A survey , 2005, J. Vis. Commun. Image Represent..

[19]  André Kaup,et al.  Analysis of Multi-Reference Block Matching for MultiView Video Coding , 2006 .

[20]  Sang Uk Lee,et al.  Multiple description coding of motion fields for robust video transmission , 2001, IEEE Trans. Circuits Syst. Video Technol..

[21]  Marc Pollefeys,et al.  An evolutionary and optimised approach on 3D-TV , 2002 .

[22]  Aljoscha Smolic,et al.  An Overview of a New European Consortium: Integrated Three-Dimensional Television - Capture, Transmission and Display (3DTV) , 2004, EWIMT.

[23]  Borko Furht,et al.  Challenges and Opportunities in Video Coding for 3D TV , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[24]  Jörn Ostermann,et al.  AN ASSESSMENT OF 3DTV TECHNOLOGIES , 2006 .

[25]  Jaakko Astola,et al.  Two-stage multiple description image coders: Analysis and comparative study , 2006, Signal Process. Image Commun..

[26]  Jaakko Astola,et al.  Low-Complexity Multiple Description Coding of Video Based on 3D Block Transforms , 2007, EURASIP J. Embed. Syst..

[27]  Craig Gotsman,et al.  Triangle Mesh Compression , 1998, Graphics Interface.

[28]  Wen Gao,et al.  Distributed multi-view video coding , 2006, Electronic Imaging.

[29]  Toshiaki Fujii,et al.  Multi-View Video Coding using View Interpolation and Reference Picture Selection , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[30]  Borko Furht,et al.  Permutation-Based Low-Complexity Alternate Coding in Multi-View H.264/AVC , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[31]  Anthony Vetro,et al.  View Synthesis for Multiview Video Compression , 2006 .

[32]  Richard Szeliski,et al.  Layered depth images , 1998, SIGGRAPH.

[33]  G.B. Akar,et al.  Multiple Description Coding of 3D Geometrywith Forward Error Correction Codes , 2007, 2007 3DTV Conference.

[34]  W. Matusik,et al.  Coding Approaches for End-To-End 3D TV Systems , 2004 .

[35]  Arun N. Netravali,et al.  Digital Video: An introduction to MPEG-2 , 1996 .

[36]  Masayuki Tanimoto Free Viewpoint Television (FTV) , 2007 .

[37]  Jörn Ostermann,et al.  Layered Predictive Coding of Time-Consistent Dynamic 3D Meshes using a Non-Linear Predictor , 2007, 2007 IEEE International Conference on Image Processing.

[38]  Hans-Peter Seidel,et al.  Free-viewpoint video of human actors , 2003, ACM Trans. Graph..

[39]  Andrei Khodakovsky,et al.  Wavelet compression of parametrically coherent mesh sequences , 2004, SCA '04.

[40]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[41]  Michael T. Orchard,et al.  Overlapped block motion compensation: an estimation-theoretic approach , 1994, IEEE Trans. Image Process..

[42]  A. Murat Tekalp,et al.  Interactive Multi-view Video Delivery with View-Point Tracking and Fast Stream Switching , 2006, MRCS.

[43]  Wolfgang Straßer,et al.  Real time compression of triangle mesh connectivity , 1998, SIGGRAPH.

[44]  Marc Levoy,et al.  Light field rendering , 1996, SIGGRAPH.

[45]  N. Stefanoski,et al.  Scalable Linear Predictive Coding of Time-Consistent 3D Mesh Sequences , 2007, 2007 3DTV Conference.

[46]  Martin Isenburg,et al.  Face fixer: compressing polygon meshes with properties , 2000, SIGGRAPH.

[47]  Peter Eisert,et al.  Predictive compression of dynamic 3D meshes , 2005, IEEE International Conference on Image Processing 2005.

[48]  Michael T. Orchard,et al.  Multiple-description video coding using motion-compensated temporal prediction , 2002, IEEE Trans. Circuits Syst. Video Technol..

[49]  C.-C. Jay Kuo,et al.  Geometry-guided progressive lossless 3D mesh coding with octree (OT) decomposition , 2005, SIGGRAPH 2005.

[50]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[51]  C.-C. Jay Kuo,et al.  Geometry-guided progressive lossless 3D mesh coding with octree (OT) decomposition , 2005, ACM Trans. Graph..

[52]  W. Gao,et al.  Inter-View Direct Mode for Multiview Video Coding , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[53]  Aljoscha Smolic,et al.  Coding efficiency and complexity analysis of MVC prediction structures , 2007, 2007 15th European Signal Processing Conference.

[54]  N. J. A. Sloane,et al.  Multiple-description vector quantization with lattice codebooks: Design and analysis , 2001, IEEE Trans. Inf. Theory.

[55]  Marcus A. Magnor,et al.  Multi-view coding for image-based rendering using 3-D scene geometry , 2003, IEEE Trans. Circuits Syst. Video Technol..

[56]  Steven J. Gortler,et al.  Geometry images , 2002, SIGGRAPH.

[57]  Vinay A. Vaishampayan,et al.  Asymptotic Analysis of Multiple Description Quantizers , 1998, IEEE Trans. Inf. Theory.

[58]  Marc Alexa,et al.  Representing Animations by Principal Components , 2000, Comput. Graph. Forum.

[59]  Wolfgang Straßer,et al.  Efficient Compression of 3D Dynamic Mesh Sequences , 2007, J. WSCG.

[60]  Andrei Khodakovsky,et al.  Progressive geometry compression , 2000, SIGGRAPH.

[61]  John W. Woods,et al.  Domain-based multiple description coding of images and video , 2002, IS&T/SPIE Electronic Imaging.

[62]  Vinay A. Vaishampayan,et al.  Design of multiple description scalar quantizers , 1993, IEEE Trans. Inf. Theory.

[63]  Jed Lengyel,et al.  Compression of time-dependent geometry , 1999, SI3D.

[64]  Aljoscha Smolic,et al.  3DAV exploration of video-based rendering technology in MPEG , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[65]  Pierre Alliez,et al.  Progressive compression for lossless transmission of triangle meshes , 2001, SIGGRAPH.

[66]  Ryutarou Ohbuchi,et al.  Watermarking 3D Polygonal Meshes in the Mesh Spectral Domain , 2001, Graphics Interface.

[67]  L. Ozarow,et al.  On a source-coding problem with two channels and three receivers , 1980, The Bell System Technical Journal.

[68]  Wenxian Yang,et al.  Scalable multiview video coding using wavelet , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[69]  Rieko Fukushima,et al.  Flatbed-type autostereoscopic display system and its image format for encoding , 2006, Electronic Imaging.

[70]  Gozde Bozdagi Akar,et al.  Multiple Description Scalar Quantization Based 3D Mesh Coding , 2006, 2006 International Conference on Image Processing.

[71]  Wen Gao,et al.  Distributed Multiview Video Coding , 2005 .

[72]  U. Gudukbay,et al.  Connectivity-Guided Adaptive Lifting Transform for Image Like Compression of Meshes , 2007, 2007 3DTV Conference.

[73]  Gabriel Taubin,et al.  Geometric compression through topological surgery , 1998, TOGS.

[74]  Vivek K. Goyal,et al.  Generalized multiple description coding with correlating transforms , 2001, IEEE Trans. Inf. Theory.

[75]  Thomas Wiegand,et al.  3D Video and Free Viewpoint Video - Technologies, Applications and MPEG Standards , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[76]  Jaakko Astola,et al.  Wavelet-based multiple description coding of 3-D geometry , 2007, Electronic Imaging.

[77]  Yao Wang,et al.  Multiple Description Coding for Video Delivery , 2005, Proceedings of the IEEE.

[78]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[79]  Nasir D. Memon,et al.  Multiple-description geometry compression for networked interactive 3D graphics , 2004, Third International Conference on Image and Graphics (ICIG'04).

[80]  Craig Gotsman,et al.  Compression of soft-body animation sequences , 2004, Comput. Graph..

[81]  T. Wiegand,et al.  Efficient Compression of Multi-View Depth Data Based on MVC , 2007, 2007 3DTV Conference.

[82]  Rachida Amjoun,et al.  Higher order prediction for geometry compression , 2003, 2003 Shape Modeling International..

[83]  Adam Finkelstein,et al.  Robust mesh watermarking , 1999, SIGGRAPH.

[84]  Oliver Benedens,et al.  Geometry-Based Watermarking of 3D Models , 1999, IEEE Computer Graphics and Applications.

[85]  Toshiaki Fujii,et al.  Real-time view interpolation system for super multiview 3D display , 2001, IS&T/SPIE Electronic Imaging.

[86]  Stefano Tubaro,et al.  A space domain approach for multiple description video coding , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[87]  Kwang Pyo Choi,et al.  An Efficient Multiple Description Coding Using Whitening Transform , 2003 .

[88]  Kannan Ramchandran,et al.  Multiple-description wavelet based image coding , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[89]  Gabriel Taubin,et al.  Progressive forest split compression , 1998, SIGGRAPH.

[90]  Pierre Alliez,et al.  Recent advances in compression of 3D meshes , 2005, 2005 13th European Signal Processing Conference.

[91]  Aljoscha Smolic,et al.  Multi-View Video Plus Depth Representation and Coding , 2007, 2007 IEEE International Conference on Image Processing.

[92]  Yao Wang,et al.  Lapped orthogonal transforms designed for error-resilient image coding , 2002, IEEE Trans. Circuits Syst. Video Technol..

[93]  A. Murat Tekalp,et al.  Scalable Multi-View Video Coding for Interactive 3DTV , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[94]  David Zhang,et al.  Robust mesh watermarking based on multiresolution processing , 2001, Comput. Graph..

[95]  Jörg Ott,et al.  Error resilience support in H.263+ , 1998, IEEE Trans. Circuits Syst. Video Technol..

[96]  Kwan-Jung Oh Multi-view Video Coding based on the Lattice-like Pyramid GOP Structure , 2006 .

[97]  Titus B. Zaharia,et al.  A skinning approach for dynamic 3D mesh compression , 2006, Comput. Animat. Virtual Worlds.

[98]  Anastasios Tefas,et al.  Blind robust watermarking schemes for copyright protection of 3D mesh objects , 2005, IEEE Transactions on Visualization and Computer Graphics.

[99]  Pierre Alliez,et al.  Valence‐Driven Connectivity Encoding for 3D Meshes , 2001, Comput. Graph. Forum.

[100]  Avishai Henik,et al.  Compression of stereo images using subsampling and transform coding , 1991 .

[101]  Wojciech Matusik,et al.  3D TV: a scalable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes , 2004, ACM Trans. Graph..

[102]  Hiroshi Masuda,et al.  Watermarking three-dimensional polygonal models through geometric and topological modifications , 1998, IEEE J. Sel. Areas Commun..

[103]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[104]  N. Atzpadin,et al.  Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability , 2007, Signal Process. Image Commun..

[105]  S. B. Kang,et al.  Survey of image-based representations and compression techniques , 2003, IEEE Trans. Circuits Syst. Video Technol..

[106]  Vivek K. Goyal,et al.  Multiple description coding: compression meets the network , 2001, IEEE Signal Process. Mag..

[107]  Heiko Schwarz,et al.  Analysis of Hierarchical B Pictures and MCTF , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[108]  A. Murat Tekalp,et al.  Temporal and spatial scaling for stereoscopic video compression , 2006, 2006 14th European Signal Processing Conference.

[109]  Jarek Rossignac,et al.  Dynapack: space-time compression of the 3D animations of triangle meshes with fixed connectivity , 2003, SCA '03.

[110]  Peter Eisert,et al.  Rate-distortion-optimized predictive compression of dynamic 3D mesh sequences , 2006, Signal Process. Image Commun..

[111]  James D. K. Kim,et al.  Interpolator data compression for MPEG-4 animation , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[112]  B. Girod,et al.  Motion and Disparity Compensated Coding for Video Camera Arrays , 2006 .

[113]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[114]  Wenxian Yang,et al.  4-D Wavelet-Based Multiview Video Coding , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[115]  Aljoscha Smolic,et al.  Scene Representation Technologies for 3DTV—A Survey , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[116]  Xiaojun Wu,et al.  Real-time dynamic 3-D object shape reconstruction and high-fidelity texture mapping for 3-D video , 2004, IEEE Transactions on Circuits and Systems for Video Technology.

[117]  Wojciech Matusik,et al.  3D TV: a scalable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes , 2004, ACM Trans. Graph..

[118]  Adrian G. Bors,et al.  Watermarking mesh-based representations of 3-D objects using local moments , 2006, IEEE Transactions on Image Processing.

[119]  Ralf Sarlette,et al.  Simple and efficient compression of animation sequences , 2005, SCA '05.

[120]  Anthony Vetro,et al.  Extensions of H.264/AVC for Multiview Video Compression , 2006, 2006 International Conference on Image Processing.

[121]  Hugues Hoppe,et al.  Shape Compression using Spherical Geometry Images , 2005, Advances in Multiresolution for Geometric Modelling.

[122]  Jiang Li,et al.  AN EFFECTIVE EPIPOLAR GEOMETRY ASSISTED MOTION-ESTIMATION TECHNIQUE FOR MULTIVIEW IMAGE CODING , 2006 .

[123]  Gozde Bozdagi Akar,et al.  A Multi-View Video Codec Based on H.264 , 2006, 2006 International Conference on Image Processing.

[124]  A. Aksay,et al.  Towards compound stereo-video quality metric: a specific encoder-based framework , 2006, 2006 IEEE Southwest Symposium on Image Analysis and Interpretation.

[125]  Vivek K. Goyal,et al.  Multiple description vector quantization with a coarse lattice , 2002, IEEE Trans. Inf. Theory.

[126]  Hugues Hoppe,et al.  Spherical parametrization and remeshing , 2003, ACM Trans. Graph..

[127]  Toshiaki Fujii,et al.  Free viewpoint TV system based on ray-space representation , 2002, SPIE ITCom.

[128]  Markus H. Gross,et al.  3D video fragments: dynamic point samples for real-time free-viewpoint video , 2004, Comput. Graph..

[129]  Pedro V. Sander,et al.  Geometry videos: a new representation for 3D animations , 2003, SCA '03.

[130]  Michael G. Strintzis,et al.  Drift-free multiple description coding of video , 2001, 2001 IEEE Fourth Workshop on Multimedia Signal Processing (Cat. No.01TH8564).

[131]  Horace Ho-Shing Ip,et al.  A robust watermarking scheme for 3D triangular mesh models , 2003, Pattern Recognit..

[132]  Markus H. Gross,et al.  3D Video Recorder: a System for Recording and Playing Free‐Viewpoint Video † , 2003, Comput. Graph. Forum.

[133]  Jaroslaw Domaszewicz,et al.  Design of entropy-constrained multiple-description scalar quantizers , 1994, IEEE Trans. Inf. Theory.

[134]  Gabriella Olmo,et al.  A novel multiple description coding scheme compatible with the JPEG2000 decoder , 2004, IEEE Signal Processing Letters.

[135]  Aljoscha Smolic,et al.  Efficient Compression of Multi-View Video Exploiting Inter-View Dependencies Based on H.264/MPEG4-AVC , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[136]  Shawmin Lei,et al.  Stereo-view video coding using H.264 tools , 2005, IS&T/SPIE Electronic Imaging.

[137]  Civanlar M. Reha,et al.  Interactive transport of multi-view videos for 3DTV applications , 2006 .

[138]  S. Kanai,et al.  Digital Watermarking for 3D Polygons using Multiresolution Wavelet Decomposition , 1998 .

[139]  L. Vasa,et al.  CODDYAC: Connectivity Driven Dynamic Mesh Compression , 2007, 2007 3DTV Conference.

[140]  Borko Furht,et al.  Design and evaluation of a 3D video system based on H.264 view coding , 2006, NOSSDAV '06.