Weakly Supervised Silhouette-based Semantic Change Detection

This paper presents a novel semantic change detection scheme with only weak supervision. A straightforward approach for this task is to train a semantic change detection network directly from a large-scale dataset in an end-to-end manner. However, a specific dataset for this new task, which is usually labor-intensive and time-consuming, becomes indispensable. To avoid this problem, we propose to train this kind of network from existing datasets by dividing this task into change detection and semantic extraction. On the other hand, the difference in camera viewpoints, for example images of the same scene captured from a vehicle-mounted camera at different time points, usually brings a challenge to the change detection task. To address this challenge, we propose a new siamese network structure with the introduction of correlation layer. In addition, we create a publicly available dataset for semantic change detection to evaluate the proposed method. Both the robustness to viewpoint difference in change detection task and the effectiveness for semantic change detection of the proposed networks are verified by the experimental results.

[1]  Kentaro Toyama,et al.  Wallflower: principles and practice of background maintenance , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[2]  Nassir Navab,et al.  Distortion-Aware Convolutional Filters for Dense Prediction in Panoramic Images , 2018, ECCV.

[3]  Luis Salgado,et al.  A Benchmarking Framework for Background Subtraction in RGBD Videos , 2017, ICIAP Workshops.

[4]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[5]  Takayuki Okatani,et al.  Change Detection from a Street Image Pair using CNN Features and Superpixel Segmentation , 2015, BMVC.

[6]  Germán Ros,et al.  Street-view change detection with deconvolutional networks , 2016, Autonomous Robots.

[7]  Ming-Hsuan Yang,et al.  Learning to Adapt Structured Output Space for Semantic Segmentation , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8]  Martial Hebert,et al.  Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[9]  Mohammed Bennamoun,et al.  Forest Change Detection in Incomplete Satellite Images With Deep Neural Networks , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[10]  Yann LeCun,et al.  Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches , 2015, J. Mach. Learn. Res..

[11]  Ramakant Nevatia,et al.  Detecting changes in aerial views of man-made structures , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[12]  Ryosuke Nakamura,et al.  Damage detection from aerial images via convolutional neural networks , 2017, 2017 Fifteenth IAPR International Conference on Machine Vision Applications (MVA).

[13]  Hichem Sahbi,et al.  Constrained optical flow for aerial image change detection , 2011, 2011 IEEE International Geoscience and Remote Sensing Symposium.

[14]  Thomas Brox,et al.  FlowNet: Learning Optical Flow with Convolutional Networks , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[15]  Alexandre Boulch,et al.  Fully Convolutional Siamese Networks for Change Detection , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[16]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Takayuki Okatani,et al.  Detecting Changes in 3D Structure of a Scene from Multi-view Images Captured by a Vehicle-Mounted Camera , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[18]  Li Fei-Fei,et al.  ImageNet: A large-scale hierarchical image database , 2009, CVPR.

[19]  Geoffrey E. Hinton,et al.  Machine Learning for Aerial Image Labeling , 2013 .

[20]  Byron Boots,et al.  4D crop monitoring: Spatio-temporal reconstruction for agriculture , 2016, 2017 IEEE International Conference on Robotics and Automation (ICRA).

[21]  Peter Kontschieder,et al.  The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[22]  Nobuo Kawaguchi,et al.  Dense Optical Flow based Change Detection Network Robust to Difference of Camera Viewpoints , 2017, ArXiv.

[23]  Yann LeCun,et al.  Learning a similarity metric discriminatively, with application to face verification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[24]  Takayuki Okatani,et al.  Massive City-Scale Surface Condition Analysis Using Ground and Aerial Imagery , 2014, ACCV.

[25]  Gabriel Taubin,et al.  A Variable-Resolution Probabilistic Three-Dimensional Model for Change Detection , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[26]  Joseph L. Mundy,et al.  Change Detection in a 3-d World , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Iasonas Kokkinos,et al.  Discriminative Learning of Deep Convolutional Feature Point Descriptors , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[28]  Shuhei Hikosaka,et al.  Building Detection from Satellite Imagery using Ensemble of Size-Specific Detectors , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[29]  Antonio M. López,et al.  The SYNTHIA Dataset: A Large Collection of Synthetic Images for Semantic Segmentation of Urban Scenes , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[30]  Marc Pollefeys,et al.  Image based detection of geometric changes in urban environments , 2011, 2011 International Conference on Computer Vision.

[31]  Mohammed Bennamoun,et al.  Learning deep structured network for weakly supervised change detection , 2016, IJCAI.

[32]  Björn Stenger,et al.  Detecting Change for Multi-View, Long-Term Surface Inspection , 2015, BMVC.

[33]  Marc Pollefeys,et al.  City-Scale Change Detection in Cadastral 3D Models Using Images , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[34]  Liang Wang,et al.  A Deep Visual Correspondence Embedding Model for Stereo Matching Costs , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[35]  Qiao Wang,et al.  VirtualWorlds as Proxy for Multi-object Tracking Analysis , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Serge J. Belongie,et al.  Learning deep representations for ground-to-aerial geolocalization , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[37]  Tomas Pfister,et al.  Learning from Simulated and Unsupervised Images through Adversarial Training , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  Fatih Murat Porikli,et al.  CDnet 2014: An Expanded Change Detection Benchmark Dataset , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[39]  Nikos Komodakis,et al.  Learning to compare image patches via convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  智一 吉田,et al.  Efficient Graph-Based Image Segmentationを用いた圃場図自動作成手法の検討 , 2014 .

[41]  Silvio Savarese,et al.  Joint 2D-3D-Semantic Data for Indoor Scene Understanding , 2017, ArXiv.

[42]  Fei-Yue Wang,et al.  $M^{4}CD$ : A Robust Change Detection Method for Intelligent Visual Surveillance , 2018, IEEE Access.

[43]  Stefan Leutenegger,et al.  SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-training on Indoor Segmentation? , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[44]  Andreas Geiger,et al.  SphereNet: Learning Spherical Representations for Detection and Classification in Omnidirectional Images , 2018, ECCV.

[45]  Frank Dellaert,et al.  Probabilistic temporal inference on reconstructed 3D scenes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[46]  Noah Snavely,et al.  Scene Chronology , 2014, ECCV.