Efficient structure from motion for large-scale UAV images: A review and a comparison of SfM tools

Abstract Unmanned aerial vehicle (UAV) images have gained extensive attention in varying fields, and the Structure from Motion (SfM) technique has become the gold standard for aerial triangulation of UAV images. With increasing data volume caused by the use of multi-view and high-resolution imaging systems and the enhancement of UAV platform’s endurance, the capability for orientation of large-scale UAV images is becoming a prominent and necessary feature for SfM-based solutions. A classical SfM pipeline consists of three major steps, i.e., (i) feature extraction for an individual image, (ii) feature matching for each image pair, and (iii) parameter solving based on iterative bundle adjustment. Most of the time costs are consumed in the second and third steps. This can be explained from three main aspects. First, for feature matching the large number of images and high overlapping degrees cause high combinational complexity of match pairs. Second, the efficiency of commonly utilized techniques for outlier removal would be seriously degenerated because of high outlier ratios of initial matches. Third, for parameter solving of camera poses and scene structures, the iterative execution of bundle adjustment (BA) leads to high computational costs in the incremental SfM workflow. Thus, this paper gives a systematic survey of the state-of-the-art for match pair selection from both ordered and unordered datasets, for outlier removal of initial matches dominated by outliers, and for efficiency improvement of BA, and conducts an experimental evaluation for six well-known SfM-based software packages on UAV image orientation.

[1]  Olaf Hellwich,et al.  MODERN METHODS OF BUNDLE ADJUSTMENT ON THE GPU , 2016 .

[2]  Hideo Saito,et al.  Fast Line Description for Line-based SLAM , 2012, BMVC.

[3]  Cecilia Di Ruberto Generalized Hough Transform for Shape Matching , 2012 .

[4]  Jubai An,et al.  A Simple and Robust Feature Point Matching Algorithm Based on Restricted Spatial Order Constraints for Aerial Image Registration , 2012, IEEE Transactions on Geoscience and Remote Sensing.

[5]  Qingwu Hu,et al.  Robust feature matching via support-line voting and affine-invariant ratios , 2017 .

[6]  Jan-Michael Frahm,et al.  A Vote-and-Verify Strategy for Fast Spatial Verification in Image Retrieval , 2016, ACCV.

[7]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[8]  Dieter Fritsch,et al.  STRUCTURELESS BUNDLE ADJUSTMENT WITH SELF-CALIBRATION USING ACCUMULATED CONSTRAINTS , 2016 .

[9]  Bing-Yu Chen,et al.  Robust Feature Matching with Alternate Hough and Inverted Hough Transforms , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[10]  Fabio Remondino,et al.  AUTOMATIC ORIENTATION OF LARGE BLOCKS OF OBLIQUE IMAGES , 2013 .

[11]  Yongpeng Wu,et al.  RFVTM: A Recovery and Filtering Vertex Trichotomy Matching for Remote Sensing Image Registration , 2022, IEEE Transactions on Geoscience and Remote Sensing.

[12]  Bin Li,et al.  RSCJ: Robust Sample Consensus Judging Algorithm for Remote Sensing Image Registration , 2012, IEEE Geoscience and Remote Sensing Letters.

[13]  Richard Szeliski,et al.  Building Rome in a day , 2009, ICCV.

[14]  Maoteng Zheng,et al.  A New GPU Bundle Adjustment Method for Large-Scale Data , 2017 .

[15]  Subhashis Banerjee,et al.  Divide and conquer: A hierarchical approach to large-scale structure-from-motion , 2017, Comput. Vis. Image Underst..

[16]  Tat-Seng Chua,et al.  Tour the world: Building a web-scale landmark recognition engine , 2009, CVPR.

[17]  Jan-Michael Frahm,et al.  Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs , 2008, ECCV.

[18]  Bing-Yu Chen,et al.  Co-Segmentation Guided Hough Transform for Robust Feature Matching , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Pengjie Tao,et al.  Geometrical Consistency Voting Strategy for Outlier Detection in Image Matching , 2016 .

[20]  Long Quan,et al.  Graph-Based Consistent Matching for Structure-from-Motion , 2016, ECCV.

[21]  Zhizhong Kang,et al.  A Robust Image Matching Method based on Optimized BaySAC , 2014 .

[22]  Jan-Michael Frahm,et al.  Reconstructing the World* in Six Days *(As Captured by the Yahoo 100 Million Image Dataset) , 2015, CVPR 2015.

[23]  Shuigeng Zhou,et al.  A Novel Image Registration Algorithm for Remote Sensing Under Affine Transformation , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[24]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[25]  Zhengyou Zhang,et al.  Determining the Epipolar Geometry and its Uncertainty: A Review , 1998, International Journal of Computer Vision.

[26]  Shuhan Shen,et al.  Progressive Large-Scale Structure-from-Motion with Orthogonal MSTs , 2018, 2018 International Conference on 3D Vision (3DV).

[27]  Xin Liu,et al.  Hybrid Parallel Bundle Adjustment for 3D Scene Reconstruction with Massive Points , 2012, Journal of Computer Science and Technology.

[28]  Jiri Matas,et al.  Locally Optimized RANSAC , 2003, DAGM-Symposium.

[29]  Naif Alajlan,et al.  Shape retrieval using triangle-area representation and dynamic space warping , 2007, Pattern Recognit..

[30]  Hujun Bao,et al.  Efficient Non-Consecutive Feature Tracking for Robust Structure-From-Motion , 2015, IEEE Transactions on Image Processing.

[31]  Gary R. Bradski,et al.  ORB: An efficient alternative to SIFT or SURF , 2011, 2011 International Conference on Computer Vision.

[32]  Qi Tian,et al.  SIFT Meets CNN: A Decade Survey of Instance Retrieval , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  S. Ullman The interpretation of structure from motion , 1979, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[34]  Gamini Dissanayake,et al.  Linear SFM: A hierarchical approach to solving structure-from-motion problems by decoupling the linear and nonlinear components , 2018 .

[35]  J. Gonçalves,et al.  UAV photogrammetry for topographic monitoring of coastal areas , 2015 .

[36]  Lei Yan,et al.  Line matching based on planar homography for stereo aerial images , 2015 .

[37]  Jie Jiang,et al.  A Robust Point-Matching Algorithm Based on Integrated Spatial Structure Constraint for Remote Sensing Image Registration , 2016, IEEE Geoscience and Remote Sensing Letters.

[38]  Zhuowen Tu,et al.  Robust Point Matching via Vector Field Consensus , 2014, IEEE Transactions on Image Processing.

[39]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[40]  Frank Dellaert,et al.  Incremental Light Bundle Adjustment: Probabilistic Analysis and Application to Robotic Navigation , 2015 .

[41]  S. Verykokou,et al.  Automatic Rough Georeferencing of Multiview Oblique and Vertical Aerial Image Datasets of Urban Scenes , 2016 .

[42]  Ken Sakurada,et al.  OpenVSLAM: A Versatile Visual SLAM Framework , 2019, ACM Multimedia.

[43]  Jiri Matas,et al.  Optimal Randomized RANSAC , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Amin Sedaghat,et al.  High-resolution image registration based on improved SURF detector and localized GTM , 2018, International Journal of Remote Sensing.

[45]  Shaojie Shen,et al.  VINS-Mono: A Robust and Versatile Monocular Visual-Inertial State Estimator , 2017, IEEE Transactions on Robotics.

[46]  George Vosselman,et al.  Minimal Camera Networks for 3D Image Based Modeling of Cultural Heritage Objects , 2014, Sensors.

[47]  Qingwu Hu,et al.  4FP-Structure: A Robust Local Region Feature Descriptor , 2017 .

[48]  P. J. Narayanan,et al.  Practical Time Bundle Adjustment for 3D Reconstruction on the GPU , 2010, ECCV Workshops.

[49]  Subhashis Banerjee,et al.  Divide and Conquer: Efficient Large-Scale Structure from Motion Using Graph Partitioning , 2014, ACCV.

[50]  Davide Scaramuzza,et al.  SVO: Fast semi-direct monocular visual odometry , 2014, 2014 IEEE International Conference on Robotics and Automation (ICRA).

[51]  Ming Zhao,et al.  Bi-SOGC: A Graph Matching Approach Based on Bilateral KNN Spatial Orders Around Geometric Centers for Remote Sensing Image Registration , 2013, IEEE Geoscience and Remote Sensing Letters.

[52]  Wanshou Jiang,et al.  Efficient match pair selection for oblique UAV images based on adaptive vocabulary tree , 2020 .

[53]  Andrew Zisserman,et al.  MLESAC: A New Robust Estimator with Application to Estimating Image Geometry , 2000, Comput. Vis. Image Underst..

[54]  Noah Snavely,et al.  Scene Reconstruction and Visualization from Internet Photo Collections: A Survey , 2011, IPSJ Trans. Comput. Vis. Appl..

[55]  Gamini Dissanayake,et al.  L2-SIFT: SIFT feature extraction and matching for large images in large-scale aerial photogrammetry , 2014 .

[56]  Richard Szeliski,et al.  Modeling the World from Internet Photo Collections , 2008, International Journal of Computer Vision.

[57]  Xiaoyu Chen,et al.  Unmanned Aerial Vehicle for Remote Sensing Applications - A Review , 2019, Remote. Sens..

[58]  Wanshou Jiang,et al.  Efficient Structure from Motion for Oblique UAV Images Based on Maximal Spanning Tree Expansions , 2017, ArXiv.

[59]  Yannis Avrithis,et al.  Speeded-up, relaxed spatial matching , 2011, 2011 International Conference on Computer Vision.

[60]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[61]  Feifei Zhang,et al.  Robust Image Registration Using Structure Features , 2014, IEEE Geoscience and Remote Sensing Letters.

[62]  Xia Li,et al.  A Feature Point Matching Based on Spatial Order Constraints Bilateral-Neighbor Vote , 2015, IEEE Transactions on Image Processing.

[63]  Guna Seetharaman,et al.  Fast Structure from Motion for Sequential and Wide Area Motion Imagery , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[64]  Cordelia Schmid,et al.  Evaluation of GIST descriptors for web-scale image search , 2009, CIVR '09.

[65]  Wanshou Jiang,et al.  Hierarchical Motion Consistency Constraint for Efficient Geometrical Verification in UAV Image Matching , 2018, ArXiv.

[66]  Frank Dellaert,et al.  Out-of-Core Bundle Adjustment for Large-Scale 3D Reconstruction , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[67]  Parvaneh Saeedi,et al.  Robust Weighted Graph Transformation Matching for Rigid and Nonrigid Image Registration , 2012, IEEE Transactions on Image Processing.

[68]  Olivier Stasse,et al.  MonoSLAM: Real-Time Single Camera SLAM , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[69]  F. Nex,et al.  OBLIQUE MULTI-CAMERA SYSTEMS - ORIENTATION AND DENSE MATCHING ISSUES , 2014 .

[70]  Reinhard Koch,et al.  An efficient and robust line segment matching approach based on LBD descriptor and pairwise geometric consistency , 2013, J. Vis. Commun. Image Represent..

[71]  S. Guha,et al.  Approximation Algorithms for Connected Dominating Sets , 1998, Algorithmica.

[72]  Zhou Zhang,et al.  Automated Ortho-Rectification of UAV-Based Hyperspectral Data over an Agricultural Field Using Frame RGB Imagery , 2016, Remote. Sens..

[73]  Lei Yan,et al.  RBA: Reduced Bundle Adjustment for oblique aerial photogrammetry , 2016 .

[74]  Xin Wang,et al.  Structure from motion for ordered and unordered image sets based on random k-d forests and global pose estimation , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[75]  Guna Seetharaman,et al.  Robust Camera Pose Refinement and Rapid SfM for Multiview Aerial Imagery—Without RANSAC , 2015, IEEE Geoscience and Remote Sensing Letters.

[76]  Ali Ridho Barakbah,et al.  Hierarchical K-means: an algorithm for centroids initialization for K-means , 2007 .

[77]  Marc Pollefeys,et al.  Fast robust large-scale mapping from video and internet photo collections , 2010 .

[78]  Zhihua Xu,et al.  EXTRACTION OF IMAGE TOPOLOGICAL GRAPH FOR RECOVERING THE SCENE GEOMETRY FROM UAV COLLECTIONS , 2014 .

[79]  Arko Lucieer,et al.  Direct Georeferencing of Ultrahigh-Resolution UAV Imagery , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[80]  Renaud Marlet,et al.  Virtual Line Descriptor and Semi-Local Graph Matching Method for Reliable Feature Correspondence , 2012, BMVC.

[81]  Shin'ichi Satoh,et al.  Geometric verification using semi-2D constraints for 3D object retrieval , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[82]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[83]  Jan-Michael Frahm,et al.  USAC: A Universal Framework for Random Sample Consensus , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[84]  Sei Ikeda,et al.  Visual SLAM algorithms: a survey from 2010 to 2016 , 2017, IPSJ Transactions on Computer Vision and Applications.

[85]  Wolfram Burgard,et al.  G2o: A general framework for graph optimization , 2011, 2011 IEEE International Conference on Robotics and Automation.

[86]  Jan-Michael Frahm,et al.  Structure-from-Motion Revisited , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[87]  Sunglok Choi,et al.  Performance Evaluation of RANSAC Family , 2009, BMVC.

[88]  Torsten Sattler,et al.  SCRAMSAC: Improving RANSAC's efficiency with a spatial consistency filter , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[89]  David Nistér,et al.  An efficient solution to the five-point relative pose problem , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[90]  Wenbing Tao,et al.  A Constrained Radial Agglomerative Clustering Algorithm for Efficient Structure From Motion , 2018, IEEE Signal Processing Letters.

[91]  Yanning Zhang,et al.  Hierarchical Clustering-Aligning Framework Based Fast Large-Scale 3D Reconstruction Using Aerial Imagery , 2019, Remote. Sens..

[92]  Zhanyi Hu,et al.  MSLD: A robust descriptor for line matching , 2009, Pattern Recognit..

[93]  Daniel Cremers,et al.  LSD-SLAM: Large-Scale Direct Monocular SLAM , 2014, ECCV.

[94]  Andrew Zisserman,et al.  Efficient Visual Search of Videos Cast as Text Retrieval , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[95]  Manolis I. A. Lourakis,et al.  SBA: A software package for generic sparse bundle adjustment , 2009, TOMS.

[96]  Jan-Michael Frahm,et al.  Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs , 2008, International Journal of Computer Vision.

[97]  J. M. M. Montiel,et al.  ORB-SLAM: A Versatile and Accurate Monocular SLAM System , 2015, IEEE Transactions on Robotics.

[98]  Zhanyi Hu,et al.  Efficient Large-Scale Structure From Motion by Fusing Auxiliary Imaging Information , 2015, IEEE Transactions on Image Processing.

[99]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[100]  Fabio Remondino,et al.  Aerial multi-camera systems: Accuracy and block triangulation issues , 2015 .

[101]  Jan-Michael Frahm,et al.  A Comparative Analysis of RANSAC Techniques Leading to Adaptive Real-Time Random Sample Consensus , 2008, ECCV.

[102]  Amin Sedaghat,et al.  Remote Sensing Image Matching Based on Adaptive Binning SIFT Descriptor , 2015, IEEE Transactions on Geoscience and Remote Sensing.

[103]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[104]  Fabio Remondino,et al.  Oblique aerial imagery : a review , 2015 .

[105]  Z. Du,et al.  Reliable Spatial Relationship Constrained Feature Point Matching of Oblique Aerial Images , 2015 .

[106]  Mozhdeh Shahbazi,et al.  Evolutionary Optimization for Robust Epipolar-Geometry Estimation and Outlier Detection , 2017, Algorithms.

[107]  Mingwei Cao,et al.  Fast and robust feature tracking for 3D reconstruction , 2019, Optics & Laser Technology.

[108]  Wanshou Jiang,et al.  Efficient SfM for Oblique UAV Images: From Match Pair Selection to Geometrical Verification , 2018, Remote. Sens..

[109]  Liang Yang,et al.  UAV-Based Oblique Photogrammetry for Outdoor Data Acquisition and Offsite Visual Inspection of Transmission Line , 2017, Remote. Sens..

[110]  Kurt Konolige,et al.  Sparse Sparse Bundle Adjustment , 2010, BMVC.

[111]  Pascal Monasse,et al.  Adaptive Structure from Motion with a Contrario Model Estimation , 2012, ACCV.

[112]  Ran Wang,et al.  Skeletal camera network embedded structure-from-motion for 3D scene reconstruction from UAV images , 2016 .

[113]  Maoguo Gong,et al.  A Novel Point-Matching Algorithm Based on Fast Sample Consensus for Image Registration , 2015, IEEE Geoscience and Remote Sensing Letters.

[114]  Yong Zhang,et al.  Block Partitioning and Merging for Processing Large-Scale Structure From Motion Problems in Distributed Manner , 2019, IEEE Access.

[115]  Wanshou Jiang,et al.  Reliable image matching via photometric and geometric constraints structured by Delaunay triangulation , 2019, ISPRS Journal of Photogrammetry and Remote Sensing.

[116]  José Ruíz Ascencio,et al.  Visual simultaneous localization and mapping: a survey , 2012, Artificial Intelligence Review.

[117]  Jan-Michael Frahm,et al.  From single image query to detailed 3D reconstruction , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[118]  Jingtai Liu,et al.  Photography Constraint Aided Keyframe Selection and Matching Method for UAV 3D Reconstruction , 2018, 2018 Chinese Control And Decision Conference (CCDC).

[119]  Juan D. Tardós,et al.  ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras , 2016, IEEE Transactions on Robotics.

[120]  Andrea Fusiello,et al.  Hierarchical structure-and-motion recovery from uncalibrated images , 2015, Comput. Vis. Image Underst..

[121]  M. Havlena,et al.  Recent developments in large-scale tie-point matching , 2016 .

[122]  Arturo Espinosa-Romero,et al.  A robust Graph Transformation Matching for non-rigid registration , 2009, Image Vis. Comput..

[123]  Andrea Fusiello,et al.  Improving the efficiency of hierarchical structure-and-motion , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[124]  Ming Zhao,et al.  A Robust Delaunay Triangulation Matching for Multispectral/Multidate Remote Sensing Image Registration , 2015, IEEE Geoscience and Remote Sensing Letters.

[125]  Kai Zhang,et al.  A Robust Point-Matching Algorithm for Remote Sensing Image Registration , 2014, IEEE Geoscience and Remote Sensing Letters.

[126]  Lei Yan,et al.  ParallaxBA: bundle adjustment using parallax angle feature parametrization , 2015, Int. J. Robotics Res..

[127]  Xuelong Li,et al.  When Location Meets Social Multimedia , 2015, ACM Transactions on Intelligent Systems and Technology.