Hole-filling for single-view plus-depth based rendering with temporal texture synthesis

View synthesis technique for 3D video and free viewpoint video (FVV) using existing view(s) can avoid the large volume of video data transmission. Existing techniques may concern poor rendering quality by missing pixel values (i.e. creating holes) due to the occluded region, rounding error and disparity discontinuity. To address those problems with the existing techniques uses correlations in spatial texture only or both spatial texture and temporal background. The former techniques (e.g. inpainting) suffer quality degradation due to lack of spatial correlation on the foreground-background boundary areas. The latter techniques (e.g. background update with Gaussian Mixture-based Modelling (GMM)) can improve quality in some occluded areas, however, due to the dependency on warping of background image and spatial correlation they still suffer quality degradation. In this paper, we propose a new hole-filling technique using the number of GMM model rather than the background image to identify background/foreground pixels. The missing pixels of background and foreground are recovered from the background pixel and the weighted average of warped and foreground model pixels respectively. The experimental results show that the proposed approach provides 0.9~1.7dB PSNR improvement compare to the state-of-the-art method.

[1]  Marcelo Walter,et al.  Selective hole-filling for depth-image based rendering , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  Bu-Sung Lee,et al.  Explore and Model Better I-Frames for Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  P. KaewTrakulPong,et al.  An Improved Adaptive Background Mixture Model for Real-time Tracking with Shadow Detection , 2002 .

[4]  Wei Xiang,et al.  Rate-Distortion Optimized Mode Switching for Error-Resilient Multi-View Video Plus Depth Based 3-D Video Coding , 2014, IEEE Transactions on Multimedia.

[5]  Peter H. N. de With,et al.  System architecture for free-viewpoint video and 3D-TV , 2008, IEEE Transactions on Consumer Electronics.

[6]  Manoranjan Paul,et al.  Efficient multi-view video coding using 3D motion estimation and virtual frame , 2016, Neurocomputing.

[7]  Patrick Pérez,et al.  Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[8]  Manoranjan Paul,et al.  Free view-point video synthesis using Gaussian Mixture Modelling , 2015, 2015 International Conference on Image and Vision Computing New Zealand (IVCNZ).

[9]  Mei Yu,et al.  View-spatial-temporal post-refinement for view synthesis in 3D video systems , 2013, Signal Process. Image Commun..

[10]  Bu-Sung Lee,et al.  Pattern-based video coding with dynamic background modeling , 2013, EURASIP J. Adv. Signal Process..

[11]  Yao Zhao,et al.  Depth Map Driven Hole Filling Algorithm Exploiting Temporal Correlation Information , 2014, IEEE Transactions on Broadcasting.

[12]  Bu-Sung Lee,et al.  Video coding with dynamic background , 2013, EURASIP J. Adv. Signal Process..

[13]  Oscar C. Au,et al.  View Synthesis Prediction in the 3-D Video Coding Extensions of AVC and HEVC , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Manoranjan Paul,et al.  Improved Gaussian mixtures for robust object detection by adaptive multi-background generation , 2008, 2008 19th International Conference on Pattern Recognition.

[15]  Marco Grangetto,et al.  Depth image based rendering with inverse mapping , 2013, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP).

[16]  Dar-Shyang Lee,et al.  Effective Gaussian mixture learning for video background subtraction , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Ken Chen,et al.  Asymmetric Coding of Multi-View Video Plus Depth Based 3-D Video for View Rendering , 2012, IEEE Transactions on Multimedia.

[18]  Bogdan Ionescu,et al.  Multiview Plus Depth Video Coding With Temporal Prediction View Synthesis , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Bu-Sung Lee,et al.  A Long-Term Reference Frame for Hierarchical B-Picture-Based Video Coding , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Yo-Sung Ho,et al.  Virtual view synthesis method and self‐evaluation metrics for free viewpoint television and 3D video , 2010, Int. J. Imaging Syst. Technol..

[21]  Zhaoyang Lu,et al.  Model-Based Joint Bit Allocation Between Texture Videos and Depth Maps for 3-D Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.