Neural Stereoscopic Image Style Transfer

Neural style transfer is an emerging technique which is able to endow daily-life images with attractive artistic styles. Previous work has succeeded in applying convolutional neural network (CNN) to style transfer for monocular images or videos. However, style transfer for stereoscopic images is still a missing piece. Different from processing a monocular image, the two views of a stylized stereoscopic pair are required to be consistent to provide the observer a comfortable visual experience. In this paper, we propose a dual path network for view-consistent style transfer on stereoscopic images. While each view of the stereoscopic pair is processed in an individual path, a novel feature aggregation strategy is proposed to effectively share information between the two paths. Besides a traditional perceptual loss used for controlling style transfer quality in each view, a multi-layer view loss is proposed to enforce the network to coordinate the learning of both paths to generate view-consistent stylized results. Extensive experiments show that, compared with previous methods, the proposed model can generate stylized stereoscopic images which achieve the best view consistency.

[1]  Serge J. Belongie,et al.  Arbitrary Style Transfer in Real-Time with Adaptive Instance Normalization , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[2]  Kai Zeng,et al.  Quality Prediction of Asymmetrically Distorted Stereoscopic 3D Images , 2015, IEEE Transactions on Image Processing.

[3]  Thomas Brox,et al.  Artistic Style Transfer for Videos , 2016, GCPR.

[4]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[5]  Shai Avidan,et al.  Geometrically consistent stereo seam carving , 2011, 2011 International Conference on Computer Vision.

[6]  Ming-Hsuan Yang,et al.  Diversified Texture Synthesis with Feed-Forward Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Eric P. Xing,et al.  ZM-Net: Real-time Zero-shot Image Manipulation Network , 2017, ArXiv.

[8]  Andrea Vedaldi,et al.  Texture Networks: Feed-forward Synthesis of Textures and Stylized Images , 2016, ICML.

[9]  Thomas Brox,et al.  A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[11]  Hao Wang,et al.  Real-Time Neural Style Transfer for Videos , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Andrea Vedaldi,et al.  Instance Normalization: The Missing Ingredient for Fast Stylization , 2016, ArXiv.

[13]  Michael J. Black,et al.  A Naturalistic Open Source Movie for Optical Flow Evaluation , 2012, ECCV.

[14]  Craig S. Kaplan,et al.  Consistent stylization and painterly rendering of stereoscopic 3D images , 2012, NPAR '12.

[15]  Marcel P. Lucassen,et al.  Visual comfort of binocular and 3D displays , 2001, IS&T/SPIE Electronic Imaging.

[16]  Yung-Yu Chuang,et al.  Scene warping: Layer-based stereoscopic image resizing , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Margrit Gelautz,et al.  Image-based stereoscopic stylization , 2005, IEEE International Conference on Image Processing 2005.

[18]  Yunjin Lee,et al.  Stereoscopic 3D line drawing , 2013, ACM Trans. Graph..

[19]  Alexander Toet,et al.  Visual comfort of binocular and 3D displays , 2004 .

[20]  Leon A. Gatys,et al.  Image Style Transfer Using Convolutional Neural Networks , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Craig S. Kaplan,et al.  Stereoscopic 3D image stylization , 2013, Comput. Graph..

[22]  Bing-Yu Chen,et al.  Geometrically Consistent Stereoscopic Image Editing Using Patch-Based Synthesis , 2015, IEEE Transactions on Visualization and Computer Graphics.

[23]  Xin Wang,et al.  Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[24]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[25]  Yung-Yu Chuang,et al.  Content-Aware Display Adaptation and Interactive Editing for Stereoscopic Images , 2011, IEEE Transactions on Multimedia.

[26]  Nenghai Yu,et al.  Coherent Online Video Style Transfer , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[27]  Jiaying Liu,et al.  Demystifying Neural Style Transfer , 2017, IJCAI.

[28]  Li Fei-Fei,et al.  Characterizing and Improving Stability in Neural Style Transfer , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).