Virtual View Synthesis for Free Viewpoint Video and Multiview Video Compression using Gaussian Mixture Modelling

High quality virtual views need to be synthesized from adjacent available views for free viewpoint video and multiview video coding (MVC) to provide users with a more realistic 3D viewing experience of a scene. View synthesis techniques suffer from poor rendering quality due to holes created by occlusion and rounding integer error through warping. To remove the holes in the virtual view, the existing techniques use spatial and temporal correlation in intra/inter-view images and depth maps. However, they still suffer quality degradation in the boundary region of foreground and background areas due to the low spatial correlation in texture images and low correspondence in inter-view depth maps. To overcome the above-mentioned limitations, we use a number of models in the Gaussian mixture modeling (GMM) to separate background and foreground pixels in our proposed technique. Here, the missing pixels introduced from the warping process are recovered by the adaptive weighted average of the pixel intensities from the corresponding GMM model(s) and warped image. The weights vary with time to accommodate the changes due to a dynamic background and the motions of the moving objects for view synthesis. We also introduce an adaptive strategy to reset the GMM modeling if the contributions of the pixel intensities drop significantly. Our experimental results indicate that the proposed approach provides 5.40–6.60-dB PSNR improvement compared with the relevant methods. To verify the effectiveness of the proposed view synthesis technique, we use it as an extra reference frame in the motion estimation for MVC. The experimental results confirm that the proposed view synthesis is able to improve PSNR by 3.15–5.13 dB compared with the conventional three reference frames.

[1]  Manoranjan Paul,et al.  Free view-point video synthesis using Gaussian Mixture Modelling , 2015, 2015 International Conference on Image and Vision Computing New Zealand (IVCNZ).

[2]  Sehoon Yea,et al.  View synthesis prediction for multiview video coding , 2009, Signal Process. Image Commun..

[3]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[4]  Patrick Pérez,et al.  Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[5]  Yao Zhao,et al.  Depth Map Driven Hole Filling Algorithm Exploiting Temporal Correlation Information , 2014, IEEE Transactions on Broadcasting.

[6]  Dar-Shyang Lee,et al.  Effective Gaussian mixture learning for video background subtraction , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Marco Grangetto,et al.  Depth image based rendering with inverse mapping , 2013, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP).

[8]  Yao Zhao,et al.  View Synthesis Based on Background Update with Gaussian Mixture Model , 2012, PCM.

[9]  Zhang Zhao-yang,et al.  Arbitrary view generation based on DIBR , 2007, 2007 International Symposium on Intelligent Signal Processing and Communication Systems.

[10]  Heiko Schwarz,et al.  3D High-Efficiency Video Coding for Multi-View Video and Depth Data , 2013, IEEE Transactions on Image Processing.

[11]  Shuai Li,et al.  Depth Image Based View Synthesis: New Insights and Perspectives on Hole Generation and Filling , 2016, IEEE Transactions on Broadcasting.

[12]  Marek Domanski,et al.  Depth-based inter-view prediction of motion vectors for improved multiview video coding , 2010, 2010 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[13]  Byung Tae Oh,et al.  View Synthesis Distortion Estimation for AVC- and HEVC-Compatible 3-D Video Coding , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Yun He,et al.  Fast disparity motion estimation in MVC based on range prediction , 2008, 2008 15th IEEE International Conference on Image Processing.

[15]  Bu-Sung Lee,et al.  A Long-Term Reference Frame for Hierarchical B-Picture-Based Video Coding , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Zhaoyang Lu,et al.  Model-Based Joint Bit Allocation Between Texture Videos and Depth Maps for 3-D Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Ken Chen,et al.  Asymmetric Coding of Multi-View Video Plus Depth Based 3-D Video for View Rendering , 2012, IEEE Transactions on Multimedia.

[18]  Manoranjan Paul,et al.  A novel motion classification based intermode selection strategy for HEVC performance improvement , 2016, Neurocomputing.

[19]  Manoranjan Paul,et al.  Improved Gaussian mixtures for robust object detection by adaptive multi-background generation , 2008, 2008 19th International Conference on Pattern Recognition.

[20]  Bogdan Ionescu,et al.  Multiview Plus Depth Video Coding With Temporal Prediction View Synthesis , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Béatrice Pesquet-Popescu,et al.  Depth-aided image inpainting for novel view synthesis , 2010, 2010 IEEE International Workshop on Multimedia Signal Processing.

[22]  Oscar C. Au,et al.  View Synthesis Prediction in the 3-D Video Coding Extensions of AVC and HEVC , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[23]  Shang-Hong Lai,et al.  Improved novel view synthesis from depth image with large baseline , 2008, 2008 19th International Conference on Pattern Recognition.

[24]  Ying Chen,et al.  Overview of the Multiview and 3D Extensions of High Efficiency Video Coding , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Wei Xiang,et al.  Rate-Distortion Optimized Mode Switching for Error-Resilient Multi-View Video Plus Depth Based 3-D Video Coding , 2014, IEEE Transactions on Multimedia.

[26]  Thomas Maugey,et al.  Encoder-Driven Inpainting Strategy in Multiview Video Compression , 2016, IEEE Transactions on Image Processing.

[27]  Manoranjan Paul,et al.  Efficient multi-view video coding using 3D motion estimation and virtual frame , 2016, Neurocomputing.

[28]  Wen Gao,et al.  Low Complexity Adaptive View Synthesis Optimization in HEVC Based 3D Video Coding , 2014, IEEE Transactions on Multimedia.

[29]  Manoranjan Paul,et al.  Hole-filling for single-view plus-depth based rendering with temporal texture synthesis , 2016, 2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[30]  Yuesheng Zhu,et al.  A Hole Filling Approach Based on Background Reconstruction for View Synthesis in 3D Video , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[31]  Bu-Sung Lee,et al.  Explore and Model Better I-Frames for Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[32]  Marcelo Walter,et al.  Selective hole-filling for depth-image based rendering , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[33]  W. Gao,et al.  Inter-View Direct Mode for Multiview Video Coding , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[34]  Kwanghoon Sohn,et al.  Early disparity estimation skipping for multi-view video coding , 2012, EURASIP Journal on Wireless Communications and Networking.

[35]  Mahsa Talebpourazad,et al.  3D-TV Content generation and multi-view video coding , 2010 .

[36]  Hsueh-Ming Hang,et al.  Virtual view synthesis using backward depth warping algorithm , 2013, 2013 Picture Coding Symposium (PCS).