OSSIM: An Object-Based Multiview Stereo Algorithm Using SSIM Index Matching Cost

Multiview stereo (MVS) is a crucial process in image-based automatic 3-D reconstruction and mapping applications. In a dense matching process, the matching cost is generally computed between image pairs, making the efficiency low due to the large number of stereo pairs. This paper presents a novel object-based MVS algorithm using structural similarity (SSIM) index matching cost in a coarse-to-fine workflow. As far as we know, this is the first time SSIM index is introduced to calculate the matching cost of MVS applications. In contrast to classical stereo methods, the proposed object-based structural similarity (OSSIM) method computes only a depth map for each image. Thus, the efficiency can be greatly improved when the overlap between images is large. To obtain an optimized depth map, the winner-take-all and semi-global matching strategies are implemented. Moreover, an object-based multiview consistency checking strategy is also proposed to eliminate wrong matches and perform pixelwise view selection. The proposed method was successfully applied on a close-range Fountain-P11 data set provided by EPFL and aerial data sets of Vaihingen and Zürich by the ISPRS. Experimental results demonstrate that the proposed method can deliver matches at high completeness and accuracy. For the Vaihingen data set, the correctness and completeness rate were 71.12% and 95.99% with an RMSE of 2.8 GSD. For the Foutain-P11 data set, the proposed method outperformed the other existing methods with the ratio of pixels less than 2 cm. Extensive comparison using Zürich data set shows that it can derive results comparable to the state-of-the-art software (PhotoScan, Pix4d, and Smart3D) in urban buildings areas.

[1]  I. Colomina,et al.  Unmanned aerial systems for photogrammetry and remote sensing: A review , 2014 .

[2]  Heiko Hirschmüller,et al.  Semi-Global Matching-Motivation, Developments and Applications , 2011 .

[3]  Nanning Zheng,et al.  Stereo Matching Using Belief Propagation , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Takeo Kanade,et al.  Stereo by Intra- and Inter-Scanline Search Using Dynamic Programming , 1985, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  N. Haala The Landscape of Dense Image Matching Algorithms , 2013 .

[6]  Ines Ernst,et al.  Mutual Information Based Semi-Global Stereo Matching on the GPU , 2008, ISVC.

[7]  Pascal Fua,et al.  Efficient large-scale multi-view stereo for ultra high-resolution image sets , 2011, Machine Vision and Applications.

[8]  Reinhard Koch,et al.  Automated reconstruction of 3D scenes from sequences of images , 2000 .

[9]  Michael Goesele,et al.  Multi-View Stereo for Community Photo Collections , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[10]  Fabio Remondino,et al.  Aerial multi-camera systems: Accuracy and block triangulation issues , 2015 .

[11]  Roberto Cipolla,et al.  Multi-view stereo via volumetric graph-cuts , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[12]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[13]  Charles K. Toth,et al.  Remote sensing platforms and sensors: A survey , 2016 .

[14]  M. Pierrot-Deseilligny,et al.  A MULTIRESOLUTION AND OPTIMIZATION-BASED IMAGE MATCHING APPROACH : AN APPLICATION TO SURFACE RECONSTRUCTION FROM SPOT 5-HRS STEREO IMAGERY , 2006 .

[15]  Vincent Lepetit,et al.  DAISY: An Efficient Dense Descriptor Applied to Wide-Baseline Stereo , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Li Yan,et al.  A Multi-View Dense Image Matching Method for High-Resolution Aerial Imagery Based on a Graph Network , 2016, Remote. Sens..

[17]  M. Rothermel,et al.  Oblique Image Data Processing – Potential , Experiences and Recommendations , 2013 .

[18]  G. Jóźków,et al.  Analysis of Point Cloud Generation from UAS Images , 2014 .

[19]  Radu Horaud,et al.  Topology-Adaptive Mesh Deformation for Surface Evolution, Morphing, and Multiview Reconstruction , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Shai Avidan,et al.  Semi-Global Matching: A Principled Derivation in Terms of Message Passing , 2014, GCPR.

[21]  M. Rothermel,et al.  QUALITY OF 3D POINT CLOUDS FROM HIGHLY OVERLAPPING UAV IMAGERY , 2013 .

[22]  Fabio Remondino,et al.  State of the art in high density image matching , 2014 .

[23]  Richard Szeliski,et al.  A Comparative Study of Energy Minimization Methods for Markov Random Fields with Smoothness-Based Priors , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Andrew Zisserman,et al.  A PLANE-SWEEP STRATEGY FOR THE 3D RECONSTRUCTION OF BUILDINGS FROM MULTIPLE IMAGES , 2000 .

[25]  A. Behan Digital Photogrammetry: A Practical Course (Third Edition) , 2010 .

[26]  Konrad Schindler,et al.  Massively Parallel Multiview Stereopsis by Surface Normal Diffusion , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[27]  Christian Früh,et al.  Automated texture mapping of 3D city models with oblique aerial imagery , 2004, Proceedings. 2nd International Symposium on 3D Data Processing, Visualization and Transmission, 2004. 3DPVT 2004..

[28]  Zhou Wang,et al.  Why is image quality assessment so difficult? , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[29]  Vincent Lepetit,et al.  A fast local descriptor for dense matching , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  H. Hirschmüller Accurate and Efficient Stereo Processing by Semi-Global Matching and Mutual Information , 2005, CVPR.

[31]  Christian Heipke,et al.  Information from imagery: ISPRS scientific vision and research agenda , 2016 .

[32]  Tomás Pajdla,et al.  Multi-view reconstruction preserving weakly-supported surfaces , 2011, CVPR 2011.

[33]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[34]  M. Downey,et al.  SEMI-GLOBAL MATCHING : AN ALTERNATIVE TO LIDAR FOR DSM GENERATION ? , 2010 .

[35]  Jan-Michael Frahm,et al.  Pixelwise View Selection for Unstructured Multi-View Stereo , 2016, ECCV.

[36]  J. P. Lewis Fast Normalized Cross-Correlation , 2010 .

[37]  Mathias Rothermel,et al.  Extracting 3D urban models from oblique aerial images , 2015, 2015 Joint Urban Remote Sensing Event (JURSE).

[38]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  M. Pierrot Deseilligny,et al.  APERO, AN OPEN SOURCE BUNDLE ADJUSMENT SOFTWARE FOR AUTOMATIC CALIBRATION AND ORIENTATION OF SET OF IMAGES , 2012 .

[40]  Heiko Hirschmüller,et al.  Stereo Processing by Semiglobal Matching and Mutual Information , 2008, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Alan C. Bovik,et al.  Blind Image Quality Assessment: From Natural Scene Statistics to Perceptual Quality , 2011, IEEE Transactions on Image Processing.

[42]  M. Rothermel,et al.  SURE – The ifp Software for Dense Image Matching , 2013 .

[43]  Alfred Schmitt,et al.  Real-Time Stereo by using Dynamic Programming , 2003, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[44]  Pascal Fua,et al.  On benchmarking camera calibration and multi-view stereo for high resolution imagery , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[45]  Robert T. Collins,et al.  A space-sweep approach to true multi-image matching , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[46]  Xiaoyan Hu,et al.  Least Commitment, Viewpoint-Based, Multi-view Stereo , 2012, 2012 Second International Conference on 3D Imaging, Modeling, Processing, Visualization & Transmission.

[47]  F. Nex,et al.  OBLIQUE MULTI-CAMERA SYSTEMS - ORIENTATION AND DENSE MATCHING ISSUES , 2014 .

[48]  Enhua Wu,et al.  Constant Time Weighted Median Filtering for Stereo Matching and Beyond , 2013, 2013 IEEE International Conference on Computer Vision.

[49]  Jean-Philippe Pons,et al.  High Accuracy and Visibility-Consistent Dense Multiview Stereo , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[50]  Richard Szeliski,et al.  A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[51]  M. Rothermel,et al.  SURE : PHOTOGRAMMETRIC SURFACE RECONSTRUCTION FROM IMAGER Y , 2013 .

[52]  Mathias Rothermel,et al.  Fast and Robust Generation of Semantic Urban Terrain Models from UAV Video Streams , 2014, 2014 22nd International Conference on Pattern Recognition.

[53]  Radim Sara,et al.  Refinement of Surface Mesh for Accurate Multi-View Reconstruction , 2010, Int. J. Virtual Real..

[54]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[55]  Jan-Michael Frahm,et al.  PatchMatch Based Joint View Selection and Depthmap Estimation , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[56]  Alan C. Bovik,et al.  A Statistical Evaluation of Recent Full Reference Image Quality Assessment Algorithms , 2006, IEEE Transactions on Image Processing.