ENHANCEMENT OF DEPTH MAP BY FUSION USING ADAPTIVE AND SEMANTIC-GUIDED SPATIOTEMPORAL FILTERING

Abstract. Extracting detailed geometric information about a scene relies on the quality of the depth maps (e.g. Digital Elevation Surfaces, DSM) to enhance the performance of 3D model reconstruction. Elevation information from LiDAR is often expensive and hard to obtain. The most common approach to generate depth maps is through multi-view stereo (MVS) methods (e.g. dense stereo image matching). The quality of single depth maps, however, is often prone to noise, outliers, and missing data points due to the quality of the acquired image pairs. A reference multi-view image pair must be noise-free and clear to ensure high-quality depth maps. To avoid such a problem, current researches are headed toward fusing multiple depth maps to recover the shortcomings of single-depth maps resulted from a single pair of multi-view images. Several approaches tackled this problem by merging and fusing depth maps, using probabilistic and deterministic methods, but few discussed how these fused depth maps can be refined through adaptive spatiotemporal analysis algorithms (e.g. spatiotemporal filters). The motivation is to push towards preserving the high precision and detail level of depth maps while optimizing the performance, robustness, and efficiency of the algorithm.

[1]  Horst Bischof,et al.  Probabilistic Range Image Integration for DSM and True-Orthophoto Generation , 2013, SCIA.

[2]  Georg Kuschk,et al.  FUSION OF MULTI-RESOLUTION DIGITAL SURFACE MODELS , 2013 .

[3]  Rongjun Qin,et al.  RPC STEREO PROCESSOR (RSP) – A SOFTWARE PACKAGE FOR DIGITAL SURFACE MODEL AND ORTHOPHOTO GENERATION FROM SATELLITE STEREO IMAGERY , 2016 .

[4]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[5]  Ruigang Yang,et al.  Spatial-Temporal Fusion for High Accuracy Depth Maps Using Dynamic MRFs , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Georg Kuschk,et al.  LARGE SCALE URBAN RECONSTRUCTION FROM REMOTE SENSING IMAGERY , 2013 .

[7]  Enric Meinhardt,et al.  Automatic 3D Reconstruction from Multi-date Satellite Images , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[8]  Yi Dong,et al.  A comparison of stereo and multiview 3-D reconstruction using cross-sensor satellite imagery , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[9]  Rongjun Qin,et al.  Automated 3D recovery from very high resolution multi-view satellite images , 2019, ArXiv.

[10]  Rongjun Qin,et al.  A Hierarchical Building Detection Method for Very High Resolution Remotely Sensed Images Combined with DSM Using Graph Cut Optimization , 2014 .

[11]  Wuttipong Kumwilaisak,et al.  Optimal depth recovery using image guided TGV with depth confidence for high-quality view synthesis , 2016, J. Vis. Commun. Image Represent..

[12]  Zhaoqi Wang,et al.  Reliable Fusion of Stereo Matching and Depth Sensor for High Quality Dense Depth Maps , 2015, Sensors.

[13]  Daniel Cremers,et al.  Spatially Regularized Fusion of Multiresolution Digital Surface Models , 2017, IEEE Transactions on Geoscience and Remote Sensing.

[14]  Rongjun Qin,et al.  A critical analysis of satellite stereo pairs for digital surface model generation and a matching quality prediction model , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.