View synthesis using foreground object extraction for disparity control and image inpainting

Abstract Among the rapidly growing three-dimensional technologies, multiview displays have drawn great research interests in three-dimensional television due to their adaption to the motion parallax and wider viewing angles. However, multiview displays still suffer from dazzling discomfort on the border of viewing zones. Leveraging on the separability of scene via foreground segmentation, we propose a novel virtual view synthesis method for depth-image-based rendering to alleviate the discomfort. Foreground objects of interest are extracted to segment the whole image into multiple layers, which are further warped to the virtual viewpoint in order. To alleviate the visual discomfort, global disparity adjustments and local depth control are performed for specific objects in each layer. For the post-processing, we improve an exemplar-based inpainting algorithm to tackle the disoccluded areas. Experimental results demonstrate that our method achieves effective disparity control and generates high-quality virtual view images.

[1]  Filippo Speranza,et al.  Stereoscopic 3D-TV: Visual Comfort , 2011, IEEE Transactions on Broadcasting.

[2]  Ju Shen,et al.  Layer Depth Denoising and Completion for Structured-Light RGB-D Cameras , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Tolga K. Çapin,et al.  Attention-Aware Disparity Control in interactive environments , 2013, The Visual Computer.

[4]  Yongdong Zhang,et al.  Effective Uyghur Language Text Detection in Complex Background Images for Traffic Prompt Identification , 2018, IEEE Transactions on Intelligent Transportation Systems.

[5]  Yong Man Ro,et al.  Visual Comfort Amelioration Technique for Stereoscopic Images: Disparity Remapping to Mitigate Global and Local Discomfort Causes , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Jianjun Lei,et al.  View generation with DIBR for 3D display system , 2014, Multimedia Tools and Applications.

[7]  Toshiaki Fujii,et al.  Free-viewpoint image synthesis using superpixel segmentation , 2017 .

[8]  Mårten Sjöström,et al.  Virtual view synthesis using layered depth image generation and depth-based inpainting for filling disocclusions and translucent disocclusions , 2016, J. Vis. Commun. Image Represent..

[9]  Yingli Tian,et al.  Resolution enhancement in single depth map and aligned image , 2016, 2016 IEEE Winter Conference on Applications of Computer Vision (WACV).

[10]  Neil A. Dodgson,et al.  Autostereoscopic 3D displays , 2005, Computer.

[11]  Eisuke Nakasu,et al.  Progress Overview of Capturing Method for Integral 3-D Imaging Displays , 2017, Proceedings of the IEEE.

[12]  Yo-Sung Ho,et al.  Disparity map enhancement in pixel based stereo matching method using distance transform , 2016, J. Vis. Commun. Image Represent..

[13]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[14]  Yongdong Zhang,et al.  A Highly Parallel Framework for HEVC Coding Unit Partitioning Tree Decision on Many-core Processors , 2014, IEEE Signal Processing Letters.

[15]  Ying Chen,et al.  Overview of the MVC + D 3D video coding standard , 2014, J. Vis. Commun. Image Represent..

[16]  Hui Chen,et al.  An effective graph and depth layer based RGB-D image foreground object extraction method , 2017, Computational Visual Media.

[17]  Martin Kleinsteuber,et al.  A Joint Intensity and Depth Co-sparse Analysis Model for Depth Map Super-resolution , 2013, 2013 IEEE International Conference on Computer Vision.

[18]  Yongdong Zhang,et al.  Efficient Parallel Framework for HEVC Motion Estimation on Many-Core Processors , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Changick Kim,et al.  A Novel Depth-Based Virtual View Synthesis Method for Free Viewpoint Video , 2013, IEEE Transactions on Broadcasting.

[20]  Yongdong Zhang,et al.  Supervised Hash Coding With Deep Neural Network for Environment Perception of Intelligent Vehicles , 2018, IEEE Transactions on Intelligent Transportation Systems.

[21]  Jianjun Lei,et al.  Stereoscopic Visual Attention Guided Disparity Control for Multiview Images , 2014, Journal of Display Technology.

[22]  Wojciech Matusik,et al.  3D TV: a scalable system for real-time acquisition, transmission, and autostereoscopic display of dynamic scenes , 2004, ACM Trans. Graph..

[23]  Patrick Pérez,et al.  Region filling and object removal by exemplar-based image inpainting , 2004, IEEE Transactions on Image Processing.

[24]  David M. Hoffman,et al.  The zone of comfort: Predicting visual discomfort with stereo displays. , 2011, Journal of vision.

[25]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.