A Novel Virtual View Quality Enhancement Technique through a Learning of Synthesised Video

With the development of displaying techniques, free viewpoint video (FVV) system shows its potential to provide immersive perceptual feeling by changing viewpoints. To provide this luxury, a large number of high quality views have to be synthesised from limited number of viewpoints. However, in this process, a portion of the background is occluded by the foreground object in the generated synthesised videos. Recent techniques, i.e. view synthesized prediction using Gaussian model (VSPGM) and adaptive weighting between warped and learned foregrounds indicate that learning techniques may fill occluded areas almost correctly. However, these techniques use temporal correlation by assuming that original texture of the target viewpoint are already available to fill up occluded areas which is not a practical solution. Moreover, if a pixel position experiences foreground once during learning, the existing techniques considered it as foreground throughout the process. However, the actual fact is that after experiencing a foreground a pixel position can be background again. To address the aforementioned issues, in the proposed view synthesise technique, we apply Gaussian mixture modelling (GMM) on the output images of inverse mapping (IM) technique for further improving the quality of the synthesised videos. In this technique, the foreground and background pixel intensities are refined from adaptive weights of the output of inverse mapping and the pixel intensities from the corresponding model(s) of the GMM. This technique provides a better pixel correspondence, which improves 0.10~0.46dB PSNR compared to the IM technique.

[1]  Manoranjan Paul,et al.  View Synthesised Prediction with Temporal Texture Synthesis for Multi-View Video , 2016, 2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[2]  Marco Grangetto,et al.  Depth image based rendering with inverse mapping , 2013, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP).

[3]  Yu Huang,et al.  A layered method of visibility resolving in depth image-based rendering , 2008, 2008 19th International Conference on Pattern Recognition.

[4]  Yuesheng Zhu,et al.  A Hole Filling Approach Based on Background Reconstruction for View Synthesis in 3D Video , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[5]  Manoranjan Paul,et al.  Improved Gaussian mixtures for robust object detection by adaptive multi-background generation , 2008, 2008 19th International Conference on Pattern Recognition.

[6]  Manoranjan Paul,et al.  Efficient multi-view video coding using 3D motion estimation and virtual frame , 2016, Neurocomputing.

[7]  Thomas Wiegand,et al.  3-D Video Representation Using Depth Maps , 2011, Proceedings of the IEEE.

[8]  Hideo Saito,et al.  A Novel Inpainting-Based Layered Depth Video for 3DTV , 2011, IEEE Transactions on Broadcasting.

[9]  Yao Zhao,et al.  Depth Map Driven Hole Filling Algorithm Exploiting Temporal Correlation Information , 2014, IEEE Transactions on Broadcasting.

[10]  Manoranjan Paul,et al.  Hole-filling for single-view plus-depth based rendering with temporal texture synthesis , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[11]  Ghassan Al-Regib,et al.  Hierarchical Hole-Filling For Depth-Based View Synthesis in FTV and 3D Video , 2012, IEEE Journal of Selected Topics in Signal Processing.

[12]  Manoranjan Paul,et al.  Free view-point video synthesis using Gaussian Mixture Modelling , 2015, 2015 International Conference on Image and Vision Computing New Zealand (IVCNZ).

[13]  Patrick Pérez,et al.  Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[14]  Yao Zhao,et al.  View Synthesis Based on Background Update with Gaussian Mixture Model , 2012, PCM.

[15]  Ying Chen,et al.  Overview of the Multiview and 3D Extensions of High Efficiency Video Coding , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Wei Xiang,et al.  Rate-Distortion Optimized Mode Switching for Error-Resilient Multi-View Video Plus Depth Based 3-D Video Coding , 2014, IEEE Transactions on Multimedia.

[17]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[18]  Manoranjan Paul,et al.  Adaptive weighting between warped and learned foregrounds for view synthesize , 2017, 2017 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[19]  Bu-Sung Lee,et al.  Explore and Model Better I-Frames for Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Changick Kim,et al.  A Novel Depth-Based Virtual View Synthesis Method for Free Viewpoint Video , 2013, IEEE Transactions on Broadcasting.