Building extraction from multi-source remote sensing images via deep deconvolution neural networks

Building extraction from remote sensing images is of great importance in urban planning. Yet it is a longstanding problem for many complicate factors such as various scales and complex backgrounds. This paper proposes a novel supervised building extraction method via deep deconvolution neural networks (DeconvNet). Our method consists of three steps. First, we preprocess the multi-source remote sensing images provided by the IEEE GRSS Data Fusion Contest. A high-quality Vancouver building dataset is created on pansharpened images whose ground-truth are obtained from the OpenStreetMap project. Then, we pretrain a deep deconvolution network on a public large-scale Massachusetts building dataset, which is further fine-tuned by two band combinations (RGB and NRG) of our dataset, respectively. Moreover, the output saliency maps of the fine-tuned models are fused to produce the final building extraction result. Extensive experiments on our Vancouver building dataset demonstrate the effectiveness and efficiency of the proposed method. To the best of our knowledge, it is the first work to use deconvolution networks for building extraction from remote sensing images.

[1]  Geoffrey E. Hinton,et al.  Machine Learning for Aerial Image Labeling , 2013 .

[2]  Roberto Cipolla,et al.  SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[4]  Trevor Darrell,et al.  Caffe: Convolutional Architecture for Fast Feature Embedding , 2014, ACM Multimedia.

[5]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[6]  Mi Wang,et al.  Building detection in high resolution satellite urban image using segmentation, corner detection combined with adaptive windowed Hough Transform , 2013, 2013 IEEE International Geoscience and Remote Sensing Symposium - IGARSS.

[7]  J. Weber,et al.  Automatic Building Extraction in VHR Images Using Advanced Morphological Operators , 2007, 2007 Urban Remote Sensing Joint Event.

[8]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[9]  Nikos Komodakis,et al.  Building detection in very high resolution multispectral data with deep learning features , 2015, 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS).

[10]  Alan R. Gillespie,et al.  Color enhancement of highly correlated images. II. Channel ratio and “chromaticity” transformation techniques , 1987 .

[11]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.