Recovering absolute scale for Structure from Motion using the law of free fall

Abstract Reconstructions of objects (or scenes) using Structure from Motion (SfM) based on monocular camera or multiple uncalibrated cameras are only possible up to an unknown scale. Most of existing approaches for recovering absolute scale for SfM rely on additional constraints in absolute scale quantified by either metric data of the object (or scene) or motion data of the camera(s). This paper presented a novel approach for recovering absolute scale for SfM based on motion constraints defined by a relative free fall motion between the camera and the object. The proposed approach is strict because the adopted relative free fall motion is directly reconstructed by SfM and is therefore not complicated by the intrinsic problem that the motion of the camera recorded by external sensors cannot truly represent the motion of the center of perspective of the camera. Experiments of the proposed approach based on a widely used SfM-MVS pipeline had derived scale factors with relative errors smaller than 5%, yet returned camera motions that did not approximate with a reasonable degree of accuracy to what the law of free fall predicted. It is suggested that this deviation is owing to that SfM processes are not constrained by physical laws, and might be moderated by fusing kinematic constraints of free fall motion into bundle adjustment. In addition, uncertainties related to camera properties and experiment designs should also be reduced so that the proposed closed-form analytical solution for recovering absolute scale can work properly in practice. To our knowledge, this paper for the first time shows the potential of recovering absolute scale for geometric SfM reconstruction using a universal physical law.

[1]  Roland Siegwart,et al.  Real-time monocular visual odometry for on-road vehicles with 1-point RANSAC , 2009, 2009 IEEE International Conference on Robotics and Automation.

[2]  Simon Lucey,et al.  Hand Waving Away Scale , 2014, ECCV.

[3]  Mark W. Smith,et al.  Structure from Motion in the Geosciences , 2016 .

[4]  Richard Szeliski,et al.  Building Rome in a day , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[5]  C. Derek Martin,et al.  RockFall analyst: A GIS extension for three-dimensional and spatially distributed rockfall hazard modeling , 2007, Comput. Geosci..

[6]  Maxime Lhuillier Incremental Fusion of Structure-from-Motion and GPS Using Constrained Bundle Adjustments , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Chenghu Zhou,et al.  Rockfall hazard analysis using LiDAR and spatial modeling , 2010 .

[8]  Helder Araújo,et al.  Robustified Structure from Motion with rolling-shutter camera using straightness constraint , 2018, Pattern Recognit. Lett..

[9]  Jan-Michael Frahm,et al.  Towards Urban 3D Reconstruction from Video , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[10]  Clark C. Guest,et al.  High Accuracy Monocular SFM and Scale Correction for Autonomous Driving , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Jan-Michael Frahm,et al.  Building Rome on a Cloudless Day , 2010, ECCV.

[12]  Roland Siegwart,et al.  Absolute scale in structure from motion from a single vehicle mounted camera by exploiting nonholonomic constraints , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13]  Carlos Castillo,et al.  Image-based surface reconstruction in geomorphometry - merits, limits and developments , 2016 .

[14]  M. Favalli,et al.  Multiview 3D reconstruction in geosciences , 2012, Comput. Geosci..

[15]  R. Leine,et al.  Simulation of rockfall trajectories with consideration of rock shape , 2014 .

[16]  Feng Cheng,et al.  Research on three-dimensional model reconstruction of slope erosion based on sequence images , 2017, Optics & Laser Technology.

[17]  Michael Felsberg,et al.  Structure and motion estimation from rolling shutter video , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[18]  Cai Meng,et al.  Monocular Visual Odometry and Obstacle Detection System Based on Ground Constraints , 2012, ICSR.

[19]  Camillo J. Taylor,et al.  Camera trajectory estimation using inertial sensor measurements and structure from motion results , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[20]  Maarten Vergauwen,et al.  Image-based 3D acquisition of archaeological heritage and applications , 2001, VAST '01.

[21]  Richard Szeliski,et al.  Building Rome in a day , 2009, ICCV.

[22]  Gianfranco Forlani,et al.  PERFORMANCE EVALUATION OF A STRUCTURE AND MOTION STRATEGY IN ARCHITECTURE AND CULTURAL HERITAGE , 2012 .

[23]  S. Robson,et al.  Straightforward reconstruction of 3D surfaces and topography with a camera: Accuracy and geoscience application , 2012 .

[24]  Michel Dhome,et al.  Real-time vehicle global localisation with a single camera in dense urban areas: Exploitation of coarse 3D city models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Arko Lucieer,et al.  Direct Georeferencing of Ultrahigh-Resolution UAV Imagery , 2014, IEEE Transactions on Geoscience and Remote Sensing.

[26]  M. Westoby,et al.  ‘Structure-from-Motion’ photogrammetry: A low-cost, effective tool for geoscience applications , 2012 .

[27]  Sanjiv Singh,et al.  Monocular Visual Odometry using a Planar Road Model to Solve Scale Ambiguity , 2011, ECMR.

[28]  Chen Xiao,et al.  A novel approach to improve the precision of monocular visual odometry , 2015, 2015 IEEE International Conference on Information and Automation.

[29]  Roberto Scopigno,et al.  Fast Metric Acquisition with Mobile Devices , 2016, VMV.

[30]  Gabriel Ramírez-Torres,et al.  On-Line Dense Point Cloud Generation from Monocular Images with Scale Estimation , 2014, MICAI.

[31]  Hengxing Lan,et al.  Probabilistic modeling of rockfall trajectories: a review , 2015, Bulletin of Engineering Geology and the Environment.

[32]  Frank Dellaert,et al.  Line-Based Structure from Motion for Urban Environments , 2006, Third International Symposium on 3D Data Processing, Visualization, and Transmission (3DPVT'06).

[33]  Christian Wöhler,et al.  Monocular 3D scene reconstruction at absolute scale , 2009 .

[34]  Michel Dhome,et al.  Fast Odometry Integration in Local Bundle Adjustment-Based Visual SLAM , 2010, 2010 20th International Conference on Pattern Recognition.

[35]  David W. Murray,et al.  Using Learning of Speed to Stabilize Scale in Monocular Localization and Mapping , 2017, 2017 International Conference on 3D Vision (3DV).

[36]  Josechu J. Guerrero,et al.  True scaled 6 DoF egocentric localisation with monocular wearable systems , 2016, Image Vis. Comput..