Pixel-level Semantics Guided Image Colorization

While many image colorization algorithms have recently shown the capability of producing plausible color versions from gray-scale photographs, they still suffer from the problems of context confusion and edge color bleeding. To address context confusion, we propose to incorporate the pixel-level object semantics to guide the image colorization. The rationale is that human beings perceive and distinguish colors based on the object's semantic categories. We propose a hierarchical neural network with two branches. One branch learns what the object is while the other branch learns the object's colors. The network jointly optimizes a semantic segmentation loss and a colorization loss. To attack edge color bleeding we generate more continuous color maps with sharp edges by adopting a joint bilateral upsamping layer at inference. Our network is trained on PASCAL VOC2012 and COCO-stuff with semantic segmentation labels and it produces more realistic and finer results compared to the colorization state-of-the-art.

[1]  Tien-Tsin Wong,et al.  Manga colorization , 2006, SIGGRAPH 2006.

[2]  Dani Lischinski,et al.  Colorization by example , 2005, EGSR '05.

[3]  Yi Li,et al.  R-FCN: Object Detection via Region-based Fully Convolutional Networks , 2016, NIPS.

[4]  Stephen Lin,et al.  Semantic colorization with internet images , 2011, ACM Trans. Graph..

[5]  Bin Sheng,et al.  Deep Colorization , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[6]  Lorenzo Torresani,et al.  Multiple hypothesis colorization and its application to image compression , 2017, Comput. Vis. Image Underst..

[7]  Jun-Cheng Chen,et al.  An adaptive edge detection based colorization algorithm and its applications , 2005, ACM Multimedia.

[8]  Dani Lischinski,et al.  Colorization using optimization , 2004, SIGGRAPH 2004.

[9]  Harry Shum,et al.  Natural Image Colorization , 2007, Rendering Techniques.

[10]  Iasonas Kokkinos,et al.  DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Mohammad Norouzi,et al.  PixColor: Pixel Recursive Colorization , 2017, BMVC.

[12]  Jianguo Zhang,et al.  The PASCAL Visual Object Classes Challenge , 2006 .

[13]  Hendrik P. A. Lensch,et al.  Infrared Colorization Using Deep Convolutional Neural Networks , 2016, 2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA).

[14]  Zongxu Pan,et al.  Automatic Color Correction for Multisource Remote Sensing Images with Wasserstein CNN , 2017, Remote. Sens..

[15]  Gregory Shakhnarovich,et al.  Learning Representations for Automatic Colorization , 2016, ECCV.

[16]  Deepu Rajan,et al.  Image colorization using similar images , 2012, ACM Multimedia.

[17]  Yong Yu,et al.  Unsupervised Diverse Colorization via Generative Adversarial Networks , 2017, ECML/PKDD.

[18]  Takeshi Naemura,et al.  Automatic colorization of grayscale images using multiple images on the web , 2009, SIGGRAPH '09.

[19]  Alexei A. Efros,et al.  Colorful Image Colorization , 2016, ECCV.

[20]  Stephen Lin,et al.  Intrinsic colorization , 2008, SIGGRAPH 2008.

[21]  Iasonas Kokkinos,et al.  Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs , 2014, ICLR.

[22]  Alexei A. Efros,et al.  Real-time user-guided image colorization with learned deep priors , 2017, ACM Trans. Graph..

[23]  Pietro Perona,et al.  Microsoft COCO: Common Objects in Context , 2014, ECCV.

[24]  Christoph H. Lampert,et al.  Probabilistic Image Colorization , 2017, BMVC.

[25]  Kevin Frans,et al.  Outline Colorization through Tandem Adversarial Networks , 2017, ArXiv.

[26]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Dani Lischinski,et al.  Joint bilateral upsampling , 2007, SIGGRAPH 2007.

[28]  Vittorio Ferrari,et al.  COCO-Stuff: Thing and Stuff Classes in Context , 2016, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[29]  Hiroshi Ishikawa,et al.  Let there be color! , 2016, ACM Trans. Graph..

[30]  한보형,et al.  Learning Deconvolution Network for Semantic Segmentation , 2015 .

[31]  Bernhard Schölkopf,et al.  Automatic Image Colorization Via Multimodal Predictions , 2008, ECCV.