Evaluation the performance of fully convolutional networks for building extraction compared with shallow models

With the development of machine learning, many researchers have used machine learning models for building extraction in high resolution remote sensing images. Especially for the recently proposed deep learning models, it has been widely used for building detection in the urban monitoring. In this study, the performances of images segmentation for building extraction based on Fully Convolutional Networks (FCN) model and shallow models are qualitatively and quantitatively compared. Firstly, the public aerial dataset of Massachusetts building dataset[1] are preprocessed to extract features. Then, we trained shallow models and deep model on Massachusetts building dataset. Moreover, the trained FCN model and shallow models are used to extract the building on the same image. Finally, we compared performances between shallow models and deep model. It is found that FCN gives the highest recall 0.63, precision 0.62, and F-measure rate 0.63. The qualitative and quantitative analysis of the building extraction results fully demonstrates that FCN gives the best performance compared with traditional shallow models.

[1]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[2]  Koen E. A. van de Sande,et al.  Evaluating Color Descriptors for Object and Scene Recognition , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Junwei Han,et al.  A Survey on Object Detection in Optical Remote Sensing Images , 2016, ArXiv.

[4]  Ansgar Brunn,et al.  Hierarchical Bayesian nets for building extraction using dense digital surface models , 1998 .

[5]  Arno Schäpe,et al.  Multiresolution Segmentation : an optimization approach for high quality multi-scale image segmentation , 2000 .

[6]  Matti Pietikäinen,et al.  Performance evaluation of texture measures with classification based on Kullback discrimination of distributions , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[7]  Chander Kumar Singh,et al.  Evaluation of LiDAR and image segmentation based classification techniques for automatic building footprint extraction for a segment of Atlantic County, New Jersey , 2016 .

[8]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[9]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[10]  Andrea Lagorio,et al.  On the Use of SIFT Features for Face Authentication , 2006, 2006 Conference on Computer Vision and Pattern Recognition Workshop (CVPRW'06).

[11]  James M. Rehg,et al.  CENTRIST: A Visual Descriptor for Scene Categorization , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.