A Hierarchical Building Detection Method for Very High Resolution Remotely Sensed Images Combined with DSM Using Graph Cut Optimization

Detecting buildings in remotely sensed data plays an important role for urban analysis and geographical information systems. This study proposes a hierarchical approach for extracting buildings from very high resolution (9 cm GSD (Ground Sampling Distance)), multi-spectral aerial images and matched DSMs (Digital Surface Models). There are three steps in the proposed method: first, shadows are detected with a morphological index, and corrected for NDVI (Normalized Difference Vegetation Index) computation; second, the NDVI is incorporated using a top-hat reconstruction of the DSM to obtain the initial building mask; finally, a graph cut optimization based on modified superpixel segmentation is carried out to consolidate building segments with high probability and thus eliminates segments that have low probability to be buildings. Experiments were performed over the whole Vaihingen dataset, covering 3.4 km2 with around 3000 buildings. The proposed algorithm effectively extracted 94 percent of the buildings with 87 percent correctness. This demonstrates that the proposed method achieved satisfactory results over a large dataset and has the potential for many practical applications. Introduction The identification and localization of buildings in an urban area is very important for planning, building analysis, automatic 3D reconstruction of building models and change detection (Qin and Gruen, 2014). The development of very high resolution (VHR) remote sensing images (Qin et al., 2013) creates a possible avenue to sense individual buildings in an urban scenario, e.g., Ikonos with 1-meter resolution, or Worldview with 0.5-meter resolution. Sensors with even higher resolution are in the planning stages (e.g., Geoeye-2 and Worldview-3 with 0.3-meter resolution). However, this increasing level of detail does not necessarily facilitate building detection with an improved accuracy (Huang and Zhang, 2011). Indeed, more detailed image contents actually increase spectral ambiguities in remotely sensed images, such as symbol patterns on the road, and big vehicles. Therefore, researchers have devoted a lot of effort toward using multisource data and designing better detection strategies to increase the building detection rate. Multispectral images provide shadow information as primitives for building locations. Furthermore, shadow information are especially effective in single image based methods (Huang and Zhang, 2012; Ok, 2013; Ok et al., 2013). Meanwhile, NDVI data extracted from a multispectral image can be used as vegetation indicators to eliminate trees. Vector features such as parallel lines and corner junctions reveal the characteristics of rectangular buildings, which have been investigated and used to develop single-image based methods for building detection (Lin and Nevatia, 1998; Sirmacek and Unsalan, 2011; Sirmacek and Unsalan, 2010; Sirmaçek and Unsalan, 2009). Lidar (Light Detection and Ranging) point clouds provide height information for a ground scene and are used for building detection. By subtracting the DTM (Digital Terrain Model) from the DSM (Digital Surface Model), a nDSM (normalized DSM) can be computed to obtain off-terrain points for building detection (Weidner and Förstner, 1995). In addition, the multi-return characteristics of lidar provide useful information to eliminate the vegetation for point clouds based methods (Ekhtari et al., 2008; Meng et al., 2009), to increase the accuracy of building detection. Both multispectral image and lidar point clouds have their advantages and deficiencies. Complex algorithms based on a single image usually have assumptions concerning building distribution and sometimes are only able to detect certain types of buildings. For example, methods based on feature point extraction from a single image are only able to detect isolated buildings with regular patterns, and methods relying on parallel lines are not able to detect dome roofs. As compared to multi-spectral images, lidar point clouds provide accurate height information, but less accurate boundaries. There are also null values for lidar point clouds due to occlusion and specular reflection from water surfaces on the roofs. Therefore, integration of both sources is a possible direction for improving building detection accuracy as well as robustness. There has been a spate of integrated methods proposed in the literature. Rottensteiner et al. (2007) and Rottensteiner et al. (2005) proposed a supervised classification-based building Rongjun Qin is with the Singapore ETH Center, Future Cities Laboratory, ETH, Zurich. 1 CREATE Way, #06-01 CREATE Tower, Singapore 138602 (rqin@student.ethz.ch). Wei Fang is with the Singapore ETH Center, Future Cities Laboratory, ETH, Zurich. 1 CREATE Way, #06-01 CREATE Tower, Singapore 138602, and the State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing (LIESMARS), Wuhan University, China. #129, Luoyu Road, Wuchang District, LIESMARS, Wuhan University, Wuhan, P. R. China, 430079. Photogrammetric Engineering & Remote Sensing Vol. 80, No. 8, September 2014, pp. 000–000. 0099-1112/14/8009–000 © 2014 American Society for Photogrammetry and Remote Sensing doi: 10.14358/PERS.80.9.000 PHOTOGRAMMETRIC ENGINEERING & REMOTE SENSING September 2014 37 This is the pre-print verison, technical content is appeared as it is, with only a few minor differences due to language editing

[1]  Jianya Gong,et al.  A Coarse Elevation Map-based Registration Method for Super-resolution of Three-line Scanner Images , 2013 .

[2]  P. Dare Shadow Analysis in High-Resolution Satellite Imagery of Urban Areas , 2005 .

[3]  P. Shi,et al.  Shadow information recovery in urban areas from very high resolution satellite imagery , 2007 .

[4]  John Trinder,et al.  Using the Dempster-Shafer method for the fusion of LIDAR data and multi-spectral images for building detection , 2005, Inf. Fusion.

[5]  Victor J. D. Tsai,et al.  A comparative study on shadow compensation of color aerial images in invariant color models , 2006, IEEE Transactions on Geoscience and Remote Sensing.

[6]  D. Grigillo,et al.  Automatic extraction and building change detection from digital surface model and multispectral orthophoto , 2011 .

[7]  Vladimir Kolmogorov,et al.  Graph cut based image segmentation with connectivity priors , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Xiaojun Yang,et al.  Detect Residential Buildings from Lidar and Aerial Photographs through Object-Oriented Land-Use Classification , 2012 .

[9]  Pascal Kosuth,et al.  Influence of training sampling protocol and of feature space optimization methods on supervised classification results , 2007, 2007 IEEE International Geoscience and Remote Sensing Symposium.

[10]  E. Baltsavias,et al.  A TEST OF AUTOMATIC ROAD EXTRACTION APPROACHES , 2006 .

[11]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Deren Li,et al.  Shadow Detection in Remotely Sensed Images Based on Self-Adaptive Feature Selection , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[13]  A. Gruen,et al.  3D change detection at street level using mobile laser scanning point clouds and terrestrial images , 2014 .

[14]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[15]  Xin Huang,et al.  A Multidirectional and Multiscale Morphological Index for Automatic Building Extraction from Multispectral GeoEye-1 Imagery , 2011 .

[16]  M. J. Valadan Zoej,et al.  AUTOMATIC BUILDING DETECTION FROM LIDAR POINT CLOUD DATA , 2008 .

[17]  Ramakant Nevatia,et al.  Building Detection and Description from a Single Intensity Image , 1998, Comput. Vis. Image Underst..

[18]  E. R. Davies,et al.  Machine vision - theory, algorithms, practicalities , 2004 .

[19]  Ian T. Jolliffe,et al.  Principal Component Analysis , 2002, International Encyclopedia of Statistical Science.

[20]  Michael Cramer,et al.  The DGPF-Test on Digital Airborne Camera Evaluation - Over- view and Test Design , 2010 .

[21]  Pierre Soille,et al.  Morphological Image Analysis: Principles and Applications , 2003 .

[22]  Luc Vincent,et al.  Morphological grayscale reconstruction in image analysis: applications and efficient algorithms , 1993, IEEE Trans. Image Process..

[23]  Lei Chen,et al.  Building detection in an urban area using lidar data and QuickBird imagery , 2012 .

[24]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[25]  Donald P. Greenberg,et al.  Color spaces for computer graphics , 1978, SIGGRAPH.

[26]  Çaglar Senaras,et al.  Automated Detection of Arbitrarily Shaped Buildings in Complex Environments From Monocular VHR Optical Satellite Imagery , 2013, IEEE Transactions on Geoscience and Remote Sensing.

[27]  C. Fraser,et al.  Automatic Detection of Residential Buildings Using LIDAR Data and Multispectral Imagery , 2010 .

[28]  Markus Gerke,et al.  The ISPRS benchmark on urban object classification and 3D building reconstruction , 2012 .

[29]  Cem Ünsalan,et al.  A Probabilistic Framework to Detect Buildings in Aerial and Satellite Images , 2011, IEEE Transactions on Geoscience and Remote Sensing.

[30]  John Trinder,et al.  Building detection by fusion of airborne laser scanner data and multi-spectral images : Performance evaluation and sensitivity analysis , 2007 .

[31]  Liangpei Zhang,et al.  Building Change Detection From Multitemporal High-Resolution Remotely Sensed Images Based on a Morphological Building Index , 2014, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[32]  Qian Du,et al.  A Hybrid Approach for Building Extraction From Spaceborne Multi-Angular Optical Imagery , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[33]  Cem Ünsalan,et al.  Urban Area Detection Using Local Feature Points and Spatial Voting , 2010, IEEE Geoscience and Remote Sensing Letters.

[34]  Jaewook Jung,et al.  Results of the ISPRS benchmark on urban object detection and 3D building reconstruction , 2014 .

[35]  Heng Tao Shen,et al.  Principal Component Analysis , 2009, Encyclopedia of Biometrics.

[36]  Wolfgang Förstner,et al.  Towards automatic building extraction from high-resolution digital elevation models , 1995 .

[37]  Ali Ozgun Ok,et al.  Automated detection of buildings from single VHR multispectral images using shadow information and graph cuts , 2013 .

[38]  Liangpei Zhang,et al.  Morphological Building/Shadow Index for Building Extraction From High-Resolution Imagery Over Urban Areas , 2012, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[39]  Le Wang,et al.  MORPHOLOGY-BASED BUILDING DETECTION FROM AIRBORNE LIDAR DATA , 2009 .

[40]  Vladimir Kolmogorov,et al.  An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Kurt Kubik,et al.  Automatic Building Detection Using the Dempster-Shafer Algorithm , 2006 .

[42]  Cem Ünsalan,et al.  Urban-Area and Building Detection Using SIFT Keypoints and Graph Theory , 2009, IEEE Transactions on Geoscience and Remote Sensing.