Joint learning of visual and spatial features for edit propagation from a single image

In this paper, we regard edit propagation as a multi-class classification problem and deep neural network (DNN) is used to solve the problem. We design a shallow and fully convolutional DNN that can be trained end-to-end. To achieve this, our method uses combinations of low-level visual features, which are extracted from the input image, and spatial features, which are computed through transforming user interactions, as input of the DNN, which efficiently performs a joint learning of visual and spatial features. We then train the DNN on many of such combinations in order to build a DNN-based pixel-level classifier. Our DNN is also equipped with patch-by-patch training and whole image estimation, speeding up learning and inference. Finally, we improve classification accuracy of the DNN by employing a fully connected conditional random field. Experimental results show that our method can respond to user interactions well and generate precise results compared with the state-of-art edit propagation approaches. Furthermore, we demonstrate our method on various applications.

[1]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2]  Dani Lischinski,et al.  Colorization using optimization , 2004, ACM Trans. Graph..

[3]  Yongwei Nie,et al.  Efficient Edit Propagation Using Hierarchical Data Structure , 2011, IEEE Transactions on Visualization and Computer Graphics.

[4]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Tien-Tsin Wong,et al.  Manga colorization , 2006, ACM Trans. Graph..

[6]  Vladlen Koltun,et al.  Efficient Inference in Fully Connected CRFs with Gaussian Edge Potentials , 2011, NIPS.

[7]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Yizhou Yu,et al.  Automatic Photo Adjustment Using Deep Neural Networks , 2014, ACM Trans. Graph..

[9]  Xiaoou Tang,et al.  Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Jason Lawrence,et al.  AppWand: editing measured materials using appearance-driven optimization , 2007, ACM Trans. Graph..

[11]  Zeev Farbman,et al.  Interactive local adjustment of tonal values , 2006, ACM Trans. Graph..

[12]  Raanan Fattal,et al.  Diffusion maps for edge-aware image editing , 2010, SIGGRAPH 2010.

[13]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[14]  Hans-Peter Seidel,et al.  Design and volume optimization of space structures , 2017, ACM Trans. Graph..

[15]  Harry Shum,et al.  Natural Image Colorization , 2007, Rendering Techniques.

[16]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Yan Gui,et al.  Fast and Robust Image Cutout Using Bilateral Grid and Confidence Based Color Model , 2018 .

[18]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Alexei A. Efros,et al.  Real-time user-guided image colorization with learned deep priors , 2017, ACM Trans. Graph..

[20]  Shi-Min Hu,et al.  Instant Propagation of Sparse Edits on Images and Videos , 2010, Comput. Graph. Forum.

[21]  Patrick Pérez,et al.  Geodesic image and video editing , 2010, TOGS.

[22]  S. Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, CVPR 2009.

[23]  Yasushi Yamaguchi,et al.  Sparse pixel sampling for appearance edit propagation , 2015, The Visual Computer.

[24]  Fabio Pellacini,et al.  AppProp: all-pairs appearance-space edit propagation , 2008, ACM Trans. Graph..

[25]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[26]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[27]  Wencheng Wang,et al.  Real Time Edit Propagation by Efficient Sampling , 2011, Comput. Graph. Forum.

[28]  Mariano Rivera,et al.  Bayesian Scheme for Interactive Colourization, Recolourization and Image/Video Editing , 2010, Comput. Graph. Forum.

[29]  Guillermo Sapiro,et al.  Fast image and video colorization using chrominance blending , 2006, IEEE Transactions on Image Processing.

[30]  Edward H. Adelson,et al.  Eurographics Symposium on Rendering 2008 Scribbleboost: Adding Classification to Edge-aware Interpolation of Local Image and Video Adjustments , 2022 .

[31]  Hiroshi Ishikawa,et al.  Globally and locally consistent image completion , 2017, ACM Trans. Graph..

[32]  Ning Xu,et al.  Deep Interactive Object Selection , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[33]  Dani Lischinski,et al.  A Closed-Form Solution to Natural Image Matting , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[34]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[35]  Yoshihiro Kanamori,et al.  DeepProp: Extracting Deep Features from a Single Image for Edit Propagation , 2016, Comput. Graph. Forum.

[36]  Kun Xu,et al.  Efficient antialiased edit propagation for images and videos , 2012, Comput. Graph..

[37]  Trevor Darrell,et al.  Fully Convolutional Networks for Semantic Segmentation , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[38]  Li Xu,et al.  A sparse control model for image and video editing , 2013, ACM Trans. Graph..

[39]  Qinping Zhao,et al.  Sparse Dictionary Learning for Edit Propagation of High-Resolution Images , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Ana Cristina Murillo,et al.  A generic tool for interactive complex image editing , 2017, The Visual Computer.

[41]  Xiaowu Chen,et al.  Manifold preserving edit propagation , 2012, ACM Trans. Graph..

[42]  Seungyong Lee,et al.  Bilateral texture filtering , 2014, ACM Trans. Graph..

[43]  Jieping Ye,et al.  A framework for interactive image color editing , 2012, The Visual Computer.

[44]  Shi-Min Hu,et al.  Efficient affinity-based edit propagation using K-D tree , 2009, ACM Trans. Graph..