Enhancment of dense urban digital surface models from VHR optical satellite stereo data by pre-segmentation and object detection

The generation of digital surface models (DSM) of urban areas from very high resolution (VHR) stereo satellite imagery requires advanced methods. In the classical approach of DSM generation from stereo satellite imagery, interest points are extracted and correlated between the stereo mates using an area based matching followed by a least-squares sub-pixel refinement step. After a region growing the 3D point list is triangulated to the resulting DSM. In urban areas this approach fails due to the size of the correlation window, which smoothes out the usual steep edges of buildings. Also missing correlations as for partly – in one or both of the images – occluded areas will simply be interpolated in the triangulation step. So an urban DSM generated with the classical approach results in a very smooth DSM with missing steep walls, narrow streets and courtyards. To overcome these problems algorithms from computer vision are introduced and adopted to satellite imagery. These algorithms do not work using local optimisation like the area-based matching but try to optimize a (semi-)global cost function. Analysis shows that dynamic programming approaches based on epipolar images like dynamic line warping or semiglobal matching yield the best results according to accuracy and processing time. These algorithms can also detect occlusions – areas not visible in one or both of the stereo images. Beside these also the time and memory consuming step of handling and triangulating large point lists can be omitted due to the direct operation on epipolar images and direct generation of a so called disparity image fitting exactly on the first of the stereo images. This disparity image – representing already a sort of a dense DSM – contains the distances measured in pixels in the epipolar direction (or a no-data value for a detected occlusion) for each pixel in the image. Despite the global optimization of the cost function many outliers, mismatches and erroneously detected occlusions remain, especially if only one stereo pair is available. To enhance these dense DSM – the disparity image – a pre-segmentation approach is presented in this paper. Since the disparity image is fitting exactly on the first of the two stereo partners (beforehand transformed to epipolar geometry) a direct correlation between image pixels and derived heights (the disparities) exist. This feature of the disparity image is exploited to integrate additional knowledge from the image into the DSM. This is done by segmenting the stereo image, transferring the segmentation information to the DSM and performing a statistical analysis on each of the created DSM segments. Based on this analysis and spectral information a coarse object detection and classification can be performed and in turn the DSM can be enhanced. After the description of the proposed method some results are shown and discussed.

[1]  S. Birchfiled A Pixel Dissimilarity Measure That Is Insensitive to Image Sampling , 1998 .

[2]  H. Mayer,et al.  LEVELS OF DETAIL IN 3 D BUILDING RECONSTRUCTION FROM LIDAR DATA , 2008 .

[3]  Peter Reinartz,et al.  Automatic generation of digital terrain models from Cartosat-1 stereo images , 2009 .

[4]  M. Lehner,et al.  Semi-Automatic Derivation of Digital Elevation Models from Stereoscopic 3-Line Scanner Data , 1992 .

[5]  K. Jacobsen,et al.  GEOMETRIC MODELS FOR THE ORIENTATION OF HIGH RESOLUTION OPTICAL SATELLITE SENSORS , 2005 .

[6]  P. Reinartz,et al.  DEM Generation from Very High Resolution Stereo Satellite Data in Urban Areas Using Dynamic Programming , 2005 .

[7]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[8]  Peter Reinartz,et al.  Towards Automated DEM Generation from High Resolution Stereo Satellite Images , 2008 .

[10]  A. Habib,et al.  Epipolar resampling of space-borne linear array scanner scenes using parallel projection , 2006 .

[11]  C. Brenner,et al.  3D URBAN GIS FROM LASER ALTIMETER AND 2D MAP DATA , 1997 .

[12]  Luc Vincent,et al.  Morphological grayscale reconstruction in image analysis: applications and efficient algorithms , 1993, IEEE Trans. Image Process..

[13]  BLUNDER ELIMINATION TECHNIQUES IN ADAPTIVE AUTOMATIC TERRAIN EXTRACTION , 2008 .

[14]  Peter Reinartz,et al.  Refinement of urban digital elevation models from very high resolution stereo satellite images , 2009 .

[15]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[16]  Wolfgang Schickler,et al.  SURFACE ESTIMATION BASED ON LIDAR , 2001 .

[17]  Cem Ünsalan,et al.  Urban-Area and Building Detection Using SIFT Keypoints and Graph Theory , 2009, IEEE Transactions on Geoscience and Remote Sensing.