Unsupervised stereoscopic image retargeting via view synthesis and stereo cycle consistency losses

Abstract Stereoscopic image retargeting aims to manipulate the stereoscopic images to fit various devices with different resolutions and prescribed aspect ratios. With the development of various types of three-dimensional (3D) displays, stereoscopic image retargeting becomes increasingly popular in the field of computer graphics. In this paper, we propose an unsupervised stereoscopic image retargeting network (USIR-Net) to address the problem of stereoscopic image retargeting without label information. By exploring the inter-view correlation and disparity relationship of stereoscopic images, two unsupervised losses are developed to guide the learning of stereoscopic image retargeting model. First, in view of the inter-view correlation, a view synthesis loss is proposed to guarantee the generation of high-quality stereoscopic images with accurate inter-view relationship. Second, by exploiting the consistency of stereoscopic images before and after the retargeting, a stereo cycle consistency loss, which consists of a content consistency term and a disparity consistency term, is developed to preserve the structure information and prevent binocular disparity inconsistency. Quantitative and qualitative experimental results demonstrate that the proposed method achieves superior performance compared with state-of-the-art methods.

[1]  Shai Avidan,et al.  Stereo Seam Carving a Geometrically Consistent Approach , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Meng Wang,et al.  Improved seam carving combining with 3D saliency for image retargeting , 2015, Neurocomputing.

[3]  In-So Kweon,et al.  CBAM: Convolutional Block Attention Module , 2018, ECCV.

[4]  Weimin Tan,et al.  Semantic Segmentation Guided Pixel Fusion for Image Retargeting , 2020, IEEE Transactions on Multimedia.

[5]  Yao Sun,et al.  Composing Semantic Collage for Image Retargeting , 2018, IEEE Transactions on Image Processing.

[6]  Zhibo Chen,et al.  DeepIR: A Deep Semantics Driven Framework for Image Retargeting , 2019, 2019 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).

[7]  Xuelong Li,et al.  Perceptually Aware Image Retargeting for Mobile Devices , 2018, IEEE Transactions on Image Processing.

[8]  Mei Yu,et al.  Three-dimensional visual comfort assessment via preference learning , 2015, J. Electronic Imaging.

[9]  Weisi Lin,et al.  Stereoscopic image retargeting based on 3D saliency detection , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Bo Yan,et al.  Pixel fusion based stereo image retargeting , 2015, 2015 IEEE International Conference on Multimedia and Expo (ICME).

[11]  Lok Ming Lui,et al.  Image Retargeting via Beltrami Representation , 2018, IEEE Transactions on Image Processing.

[12]  S. Avidan,et al.  Multi-operator media retargeting , 2009, SIGGRAPH 2009.

[13]  Feng Wu,et al.  Depth-Preserving Stereo Image Retargeting Based on Pixel Fusion , 2017, IEEE Transactions on Multimedia.

[14]  Feng Shao,et al.  MSTGAR: Multioperator-Based Stereoscopic Thumbnail Generation With Arbitrary Resolution , 2020, IEEE Transactions on Multimedia.

[15]  Kanoksak Wattanachote,et al.  Occlusion-Guided Vertical Retargeting For Stereoscopic Images Based On Pixel Fusion , 2020, 2020 IEEE International Conference on Image Processing (ICIP).

[16]  Jianjun Lei,et al.  Region-Enhanced Convolutional Neural Network for Object Detection in Remote Sensing Images , 2020, IEEE Transactions on Geoscience and Remote Sensing.

[17]  Tae-Hyun Oh,et al.  Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[18]  Giovanni Maria Farinella,et al.  Saliency-Based Selection of Gradient Vector Flow Paths for Content Aware Image Resizing , 2014, IEEE Transactions on Image Processing.

[19]  Gangyi Jiang,et al.  QoE-Guided Warping for Stereoscopic Image Retargeting , 2017, IEEE Transactions on Image Processing.

[20]  Weisi Lin,et al.  Saliency-based stereoscopic image retargeting , 2016, Inf. Sci..

[21]  Weimin Tan,et al.  Beyond Visual Retargeting: A Feature Retargeting Approach for Visual Recognition and Its Applications , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Jianjun Lei,et al.  Visual Attention Prediction for Stereoscopic Video by Multi-Module Fully Convolutional Network , 2019, IEEE Transactions on Image Processing.

[23]  Heiko Hirschmüller,et al.  Evaluation of Cost Functions for Stereo Matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Yung-Yu Chuang,et al.  Scene warping: Layer-based stereoscopic image resizing , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Qi Tian,et al.  Codebook Guided Feature-Preserving for Recognition-Oriented Image Retargeting , 2017, IEEE Transactions on Image Processing.

[26]  S. Avidan,et al.  Seam carving for content-aware image resizing , 2007, SIGGRAPH 2007.

[27]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[28]  Wu-chi Feng,et al.  Aesthetics-Based Stereoscopic Photo Cropping for Heterogeneous Displays , 2012, IEEE Transactions on Multimedia.

[29]  Oisin Mac Aodha,et al.  Unsupervised Monocular Depth Estimation with Left-Right Consistency , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Ling Shao,et al.  Semi-Heterogeneous Three-Way Joint Embedding Network for Sketch-Based Image Retrieval , 2019, IEEE Transactions on Circuits and Systems for Video Technology.

[31]  Jeffrey Ho,et al.  CASAIR: Content and Shape-Aware Image Retargeting and Its Applications , 2016, IEEE Transactions on Image Processing.

[32]  Matthew Turk,et al.  On Preserving Structure in Stereo Seam Carving , 2015, 2015 International Conference on 3D Vision.

[33]  Feiniu Yuan,et al.  Optimized Multioperator Image Retargeting Based on Perceptual Similarity Measure , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[34]  Yung-Yu Chuang,et al.  Content-Aware Display Adaptation and Interactive Editing for Stereoscopic Images , 2011, IEEE Transactions on Multimedia.

[35]  Bo Liu,et al.  Deep Feature Guided Image Retargeting , 2019, 2019 IEEE Visual Communications and Image Processing (VCIP).

[36]  Qi Tian,et al.  Image Retargeting for Preserving Robust Local Feature: Application to Mobile Visual Search , 2016, IEEE Transactions on Multimedia.

[37]  Weimin Tan,et al.  Cycle-IR: Deep Cyclic Image Retargeting , 2019, IEEE Transactions on Multimedia.

[38]  Qingming Huang,et al.  Stereoscopic Image Stitching via Disparity-Constrained Warping and Blending , 2020, IEEE Transactions on Multimedia.

[39]  Ruigang Yang,et al.  GA-Net: Guided Aggregation Net for End-To-End Stereo Matching , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Wen Gao,et al.  Perceptual Temporal Incoherence-Guided Stereo Video Retargeting , 2020, IEEE Transactions on Image Processing.

[41]  Antonio Plaza,et al.  Scale-Free Convolutional Neural Network for Remote Sensing Scene Classification , 2019, IEEE Transactions on Geoscience and Remote Sensing.

[42]  Wen Gao,et al.  Depth-Preserving Warping for Stereo Image Retargeting , 2015, IEEE Transactions on Image Processing.

[43]  Mei Yu,et al.  Stereoscopic Visual Attention Guided Seam Carving for Stereoscopic Image Retargeting , 2016, Journal of Display Technology.

[44]  Xuelong Li,et al.  Shape-Preserving Object Depth Control for Stereoscopic Images , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[45]  Chao-Hung Lin,et al.  Patch-Based Image Warping for Content-Aware Retargeting , 2013, IEEE Transactions on Multimedia.

[46]  Jie Xie,et al.  Multiscale Densely-Connected Fusion Networks for Hyperspectral Images Classification , 2021, IEEE Transactions on Circuits and Systems for Video Technology.

[47]  Weisi Lin,et al.  Saliency Detection in the Compressed Domain for Adaptive Image Retargeting , 2012, IEEE Transactions on Image Processing.

[48]  Jianjun Lei,et al.  Depth Sensation Enhancement for Multiple Virtual View Rendering , 2015, IEEE Transactions on Multimedia.

[49]  Hyunjun Eun,et al.  A Quad Edge-Based Grid Encoding Model for Content-Aware Image Retargeting , 2019, IEEE Transactions on Visualization and Computer Graphics.

[50]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[51]  Bo Yan,et al.  Seam Searching-Based Pixel Fusion for Image Retargeting , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[52]  Chao-Hung Lin,et al.  Object-Coherence Warping for Stereoscopic Image Retargeting , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[53]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.