Dense height map estimation from oblique aerial image sequences

We are concerned with dense height map reconstruction from aerial oblique image sequences. This configuration occurs when estimating a DSM (digital surface model) of areas where flying over is not allowed or for updating an on-board DSM, for instance, in trajectory planning with obstacle avoidance. We present a complete process starting from a partially calibrated sequence and leading to an estimated height map. The calibration step consists in refining the extrinsic parameters given by on-board ego-motion sensors (GPS and inertial measurement unit, IMU) by means of interest points tracking and bundle adjustment. We then propose a dense matching process based on the minimization of a multi-view pixelwise similarity criterion combined with a discretized L1-norm or total variation (TV) regularization term. Minimization is conducted thanks to an optimal graph-cut approach. Occlusions are accounted for without additional computational cost by a modification of the similarity criterion based on a dictionary of visibility patterns. Finally, two ways of refinement of the height map are proposed. The first one uses a local similarity minimization followed by non-linear Gaussian smoothing. The second relies on a novel approach to increase the height map resolution which combines multi-view 3-D reconstruction and image super-resolution. This method is validated on various synthetic and real aerial sequences, on either side-looking or forward-looking configurations.

[1]  K. Hanna Direct multi-resolution estimation of ego-motion and structure from motion , 1991, Proceedings of the IEEE Workshop on Visual Motion.

[2]  Reinhard Koch,et al.  Multi Viewpoint Stereo from Uncalibrated Video Sequences , 1998, ECCV.

[3]  Guy Le Besnerais,et al.  HEIGHT ESTIMATION USING AERIAL SIDE LOOKING IMAGE SEQUENCES , 2003 .

[4]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[5]  Steven Douglas Cochran,et al.  Adaptive vergence for the stereo matching of oblique imagery , 1995 .

[6]  Vladimir Kolmogorov,et al.  An experimental comparison of min-cut/max- flow algorithms for energy minimization in vision , 2001, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Long Quan,et al.  A Surface Reconstruction Method Using Global Graph Cut Optimization , 2006, International Journal of Computer Vision.

[8]  R. Zabih,et al.  Efficient Graph-Based Energy Minimization Methods in Computer Vision , 1999 .

[9]  Moon Gi Kang,et al.  Super-resolution image reconstruction: a technical overview , 2003, IEEE Signal Process. Mag..

[10]  Long Quan,et al.  A quasi-dense approach to surface reconstruction from uncalibrated images , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Gilles Foulon,et al.  Determination of a dense depth map from an image sequence: application to aerial imagery , 1998, Remote Sensing.

[12]  Nicholas Ayache,et al.  Accurate and robust stereovision with a large number of aerial images , 1995, Remote Sensing.

[13]  Sébastien Roy,et al.  Geo-consistency for wide multi-camera stereo , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[14]  Reinhard Koch,et al.  Self-calibration and metric reconstruction in spite of varying and unknown internal camera parameters , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[15]  Nanning Zheng,et al.  Stereo Matching Using Belief Propagation , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Sylvain Paris,et al.  Robust acquisition of 3D informations from short image sequences , 2003, Graph. Model..

[17]  Vladimir Kolmogorov,et al.  Generalized Multi-camera Scene Reconstruction Using Graph Cuts , 2003, EMMCVPR.

[18]  Frédéric Champagnat,et al.  An Improved Observation Model for Super-Resolution Under Affine Motion , 2006, IEEE Transactions on Image Processing.

[19]  Guy Demoment,et al.  Image reconstruction and restoration: overview of common estimation structures and problems , 1989, IEEE Trans. Acoust. Speech Signal Process..

[20]  P. Anandan,et al.  Hierarchical Model-Based Motion Estimation , 1992, ECCV.

[21]  Matthieu Cord,et al.  Building Detection and Reconstruction from Mid- and High-Resolution Aerial Imagery , 1998, Comput. Vis. Image Underst..

[22]  Takeo Kanade,et al.  A Multiple-Baseline Stereo , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Richard Szeliski,et al.  A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[24]  Andrew Calway,et al.  Interpolating Novel Views from Image Sequences by Probabilistic Depth Carving , 2004, ECCV.

[25]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[26]  Andrew W. Fitzgibbon,et al.  Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[27]  Tomoharu Nakahara,et al.  A Multiple-baseline Stereo Method , 2002 .

[28]  Harpreet S. Sawhney,et al.  Correlation-based estimation of ego-motion and structure from motion and stereo , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[29]  Richard Szeliski,et al.  Handling occlusions in dense multi-view stereo , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[30]  H. Ishikawa Global Optimization Using Embedded Graphs , 2000 .

[31]  Robert T. Collins,et al.  A space-sweep approach to true multi-image matching , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[32]  Richard Szeliski,et al.  Recovering 3D shape and motion from image streams using nonlinear least squares , 1993, Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[33]  L. Rudin,et al.  Nonlinear total variation based noise removal algorithms , 1992 .

[34]  Frédéric Champagnat,et al.  Dense optical flow by iterative local window registration , 2005, IEEE International Conference on Image Processing 2005.

[35]  Roberto Cipolla,et al.  A Statistical Consistency Check for the Space Carving Algorithm , 2000, BMVC.

[36]  Keith J. Hanna,et al.  Combining stereo and motion analysis for direct estimation of scene structure , 1993, 1993 (4th) International Conference on Computer Vision.

[37]  Richard Szeliski,et al.  Scene reconstruction from multiple cameras , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).

[38]  Steven M. Seitz,et al.  Photorealistic Scene Reconstruction by Voxel Coloring , 1997, International Journal of Computer Vision.

[39]  G. Winkler,et al.  Noise Reduction in Images: Some Recent Edge-Preserving Methods , 1998 .

[40]  Kiriakos N. Kutulakos,et al.  A Theory of Shape by Space Carving , 2000, International Journal of Computer Vision.

[41]  Pau Gargallo,et al.  Bayesian 3D modeling from images using multiple depth maps , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[42]  Ramakant Nevatia,et al.  Expandable Bayesian Networks for 3D Object Description from Multiple Views and Multiple Mode Inputs , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[43]  Ingemar J. Cox,et al.  A maximum-flow formulation of the N-camera stereo correspondence problem , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[44]  M. Sanfourche On the choice of the correlation term for multi-baseline stereo-vision , 2004 .

[45]  Larry S. Davis,et al.  A probabilistic framework for surface reconstruction from multiple images , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[46]  E. Baltsavias,et al.  Automatic Extraction of Man-Made Objects from Aerial and Space Images (II) , 1995 .

[47]  Yuichi Ohta,et al.  Occlusion detectable stereo-occlusion patterns in camera matrix , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.