论文信息 - Cross-View Multi-Lateral Filter for Compressed Multi-View Depth Video

Cross-View Multi-Lateral Filter for Compressed Multi-View Depth Video

Multi-view depth is crucial for describing positioning information in 3D space for virtual reality, free viewpoint video, and other interaction- and remote-oriented applications. However, in cases of lossy compression for bandwidth limited remote applications, the quality of multi-view depth video suffers from quantization errors, leading to the generation of obvious artifacts in consequent virtual view rendering during interactions. Considerable efforts must be made to properly address these artifacts. In this paper, we propose a cross-view multi-lateral filtering scheme to improve the quality of compressed depth maps/videos within the framework of asymmetric multi-view video with depth compression. Through this scheme, a distorted depth map is enhanced via non-local candidates selected from current and neighboring viewpoints of different time-slots. Specifically, these candidates are clustered into a macro super pixel denoting the physical and semantic cross-relationships of the cross-view, spatial and temporal priors. The experimental results show that gains from static depth maps and dynamic depth videos can be obtained from PSNR and SSIM metrics, respectively. In subjective evaluations, even object contours are recovered from a compressed depth video. We also verify our method via several practical applications. For these verifications, artifacts on object contours are properly managed for the development of interactive video and discontinuous object surfaces are restored for 3D modeling. Our results suggest that the proposed filter outperforms state-of-the-art filters and is suitable for use in multi-view color plus depth-based interaction- and remote-oriented applications.

Xin He | Zhen Liu | Qiong Liu | You Yang

[1] Leonard McMillan,et al. Plenoptic Modeling: An Image-Based Rendering System , 2023 .

[2] Richard G. Baraniuk,et al. Kronecker Compressive Sensing , 2012, IEEE Transactions on Image Processing.

[3] Yue Gao,et al. Cross-View Down/Up-Sampling Method for Multiview Depth Video Coding , 2012, IEEE Signal Processing Letters.

[4] Jian Sun,et al. Guided Image Filtering , 2010, ECCV.

[5] Gunnar Farnebäck,et al. Two-Frame Motion Estimation Based on Polynomial Expansion , 2003, SCIA.

[6] Christoph Fehn,et al. Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[7] Kwanghoon Sohn,et al. Reliability-Based Multiview Depth Enhancement Considering Interview Coherence , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[8] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[9] Kwanghoon Sohn,et al. Multiview ToF sensor fusion technique for high-quality depth map , 2013, Electronic Imaging.

[10] Ying Chen,et al. Standardized Extensions of High Efficiency Video Coding (HEVC) , 2013, IEEE Journal of Selected Topics in Signal Processing.

[11] Minh N. Do,et al. Depth Video Enhancement Based on Weighted Mode Filtering , 2012, IEEE Transactions on Image Processing.

[12] Ying Chen,et al. Overview of the Multiview and 3D Extensions of High Efficiency Video Coding , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[13] V. N. Q. Bao,et al. Projected Barzilai-Borwein Methods Applied to Distributed Compressive Spectrum Sensing , 2010, 2010 IEEE Symposium on New Frontiers in Dynamic Spectrum (DySPAN).

[14] B. Zeng,et al. Candidate value-based boundary filtering for compressed depth images , 2015 .

[15] Zhi Jin,et al. Quality enhancement of quality-asymmetric multiview plus depth video by using virtual view , 2015, 2015 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[16] Mei Yu,et al. Cluster-based cross-view filtering for compressed multi-view depth maps , 2016, 2016 Visual Communications and Image Processing (VCIP).

[17] Dani Lischinski,et al. Joint bilateral upsampling , 2007, SIGGRAPH 2007.

[18] Ming-Yu Liu,et al. Joint Geodesic Upsampling of Depth Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[19] Yao Zhao,et al. Two-stage filtering of compressed depth images with Markov Random Field , 2017, Signal Process. Image Commun..

[20] Long Bao Le,et al. Wireless Communications and Mobile Computing 1 Joint Data Compression and Mac Protocol Design for Smartgrids with Renewable Energy , 2022 .

[21] Gary J. Sullivan,et al. Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard , 2011, Proceedings of the IEEE.

[22] Long Bao Le,et al. Compressed sensing based data processing and MAC protocol design for smartgrids , 2015, 2015 IEEE Wireless Communications and Networking Conference (WCNC).

[23] Thorsten Joachims,et al. Semantic Labeling of 3D Point Clouds for Indoor Scenes , 2011, NIPS.

[24] Yo-Sung Ho,et al. Depth Coding Using a Boundary Reconstruction Filter for 3-D Video Systems , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[25] Yao Wang,et al. Color-Guided Depth Recovery From RGB-D Data Using an Adaptive Autoregressive Model , 2014, IEEE Transactions on Image Processing.

[26] Sebastian Thrun,et al. A Noise‐aware Filter for Real‐time Depth Upsampling , 2008 .

[27] Wolfram Burgard,et al. 3-D Mapping With an RGB-D Camera , 2014, IEEE Transactions on Robotics.

[28] Kwanghoon Sohn,et al. Depth boundary reconstruction based on similarity of adjacent pixels for efficient 3-D video coding , 2013, IEEE Transactions on Consumer Electronics.

[29] Ahmet M. Kondoz,et al. A Depth Map Post-Processing Framework for 3D-TV Systems based on Compression Artifact Analysis , 2011 .

[30] Toshiaki Fujii,et al. FTV for 3-D Spatial Communication , 2012, Proceedings of the IEEE.

[31] Lai-Man Po,et al. Adaptive depth truncation filter for MVC based compressed depth image , 2014, Signal Process. Image Commun..

[32] Jitendra Malik,et al. Large Displacement Optical Flow: Descriptor Matching in Variational Motion Estimation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33] Djemel Ziou,et al. Image Quality Metrics: PSNR vs. SSIM , 2010, 2010 20th International Conference on Pattern Recognition.

[34] Li Yu,et al. Beyond the interference problem: hierarchical patterns for multiple-projector structured light system. , 2014, Applied optics.

[35] Erhan Ekmekcioglu,et al. Content Adaptive Enhancement of Multi-View Depth Maps for Free Viewpoint Video , 2011, IEEE Journal of Selected Topics in Signal Processing.

[36] Hyung Yun Kong,et al. A novel and efficient mixed-signal compressed sensing for wide-band cognitive radio , 2010, International Forum on Strategic Technology 2010.

[37] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[38] Roberto Manduchi,et al. Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[39] Michael J. Black,et al. Secrets of optical flow estimation and their principles , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[40] Namho Hur,et al. Asymmetric Coding of Stereoscopic Video for Transmission Over T-DMB , 2007, 2007 3DTV Conference.

[41] Berthold K. P. Horn,et al. Determining Optical Flow , 1981, Other Conferences.

[42] Qiang Wu,et al. Robust Color Guided Depth Map Restoration , 2017, IEEE Transactions on Image Processing.

[43] Miska M. Hannuksela,et al. Subjective study on compressed asymmetric stereoscopic video , 2010, 2010 IEEE International Conference on Image Processing.

[44] Pascal Fua,et al. SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45] Dong Tian,et al. Joint trilateral filtering for depth map compression , 2010, Visual Communications and Image Processing.

[46] Yo-Sung Ho,et al. Depth Reconstruction Filter and Down/Up Sampling for Depth Coding in 3-D Video , 2009, IEEE Signal Processing Letters.

[47] Roger Fletcher,et al. Projected Barzilai-Borwein methods for large-scale box-constrained quadratic programming , 2005, Numerische Mathematik.

[48] Jonathan Steuer,et al. Defining virtual reality: dimensions determining telepresence , 1992 .

[49] Jui-Chiu Chiang,et al. Frame-compatible asymmetric stereo video coding considering human perception , 2014, 2014 IEEE Global Conference on Signal and Information Processing (GlobalSIP).