GPU-accelerated feature tracking for 3D reconstruction

Abstract 3D reconstruction based on structure from motion is one of the most techniques to produce sparse point-cloud model and camera parameter. However, this technique heavily relies on feature tracking method to obtain feature correspondences, then resulting in a heavy computation burden. To speed up 3D reconstruction, in this paper, we design a novel GPU-accelerated feature tracking (GFT) method for large-scale structure from motion (SFM)-based 3D reconstruction. The proposed GFT method consists of GPU-based Gaussian of image (DOG) keypoint detector, RootSIFT descriptor extractor, k nearest matching, and outlier removing. Firstly, our GPU-based DOG implementation can detect thousands of keypoints in real-time, whose speed is 30 times faster than that of the CPU version. Secondly, our GPU-based RootSIFT descriptor can compute thousands of descriptors in real-time. Thirdly, our GPU-based descriptor matching is 10 times faster than that of the state-of-the-art methods. Finally, we conduct thorough experiments on different datasets to evaluate the proposed method. Experimental results demonstrate the effectiveness and efficiency of the proposed method.

[1]  Long Quan,et al.  Fast Descriptors and Correspondence Propagation for Robust Global Point Cloud Registration , 2017, IEEE Transactions on Image Processing.

[2]  Andreas Lanitis,et al.  Model-based generation of personalized full-body 3D avatars from uncalibrated multi-view photographs , 2017, Multimedia Tools and Applications.

[3]  Antoine Manzanera,et al.  Video Extruder: a semi-dense point tracker for extracting beams of trajectories in real time , 2014, Journal of Real-Time Image Processing.

[4]  Wei Jia,et al.  Robust bundle adjustment for large-scale structure from motion , 2017, Multimedia Tools and Applications.

[5]  Wei Jia,et al.  Fast and robust absolute camera pose estimation with known focal length , 2018, Neural Computing and Applications.

[6]  Gang Wang,et al.  ROML: A Robust Feature Correspondence Approach for Matching Objects in A Set of Images , 2014, International Journal of Computer Vision.

[7]  Niloy J. Mitra,et al.  Coupled structure-from-motion and 3D symmetry detection for urban facades , 2014, ACM Trans. Graph..

[8]  Antoine Manzanera,et al.  Real Time Semi-dense Point Tracking , 2012, ICIAR.

[9]  Chao Peng,et al.  A GPU-Accelerated Approach for Feature Tracking in Time-Varying Imagery Datasets , 2017, IEEE Transactions on Visualization and Computer Graphics.

[10]  Jan-Michael Frahm,et al.  USAC: A Universal Framework for Random Sample Consensus , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Subhashis Banerjee,et al.  Divide and Conquer: Efficient Large-Scale Structure from Motion Using Graph Partitioning , 2014, ACCV.

[12]  Javier Civera,et al.  1-Point RANSAC for extended Kalman filtering: Application to real-time structure from motion and visual odometry , 2010 .

[13]  Peter Wonka,et al.  BigSUR , 2017, ACM Trans. Graph..

[14]  Zhanyi Hu,et al.  HSfM: Hybrid Structure-from-Motion , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15]  Huimin Lu,et al.  Deep adversarial metric learning for cross-modal retrieval , 2019, World Wide Web.

[16]  Venu Madhav Govindu,et al.  Robust Relative Rotation Averaging , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Srikumar Ramalingam,et al.  A Unifying Model for Camera Calibration , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[19]  Huchuan Lu,et al.  Robust Visual Tracking via Least Soft-Threshold Squares , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Muhamad Risqi U. Saputra,et al.  Visual SLAM and Structure from Motion in Dynamic Environments , 2018, ACM Comput. Surv..

[21]  Huchuan Lu,et al.  Inverse Sparse Tracker With a Locally Weighted Distance Metric , 2015, IEEE Transactions on Image Processing.

[22]  Andrew Owens,et al.  Discrete-continuous optimization for large-scale structure from motion , 2011, CVPR 2011.

[23]  Yee-Hong Yang,et al.  Robust multi-view L2 triangulation via optimal inlier selection and 3D structure refinement , 2014, Pattern Recognit..

[24]  Jiri Matas,et al.  MODS: Fast and robust method for two-view matching , 2015, Comput. Vis. Image Underst..

[25]  Hujun Bao,et al.  Efficient Non-Consecutive Feature Tracking for Robust Structure-From-Motion , 2015, IEEE Transactions on Image Processing.

[26]  T. C. Hu,et al.  Multi-Terminal Network Flows , 1961 .

[27]  David G. Lowe,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[28]  Pascal Fua,et al.  Worldwide Pose Estimation Using 3D Point Clouds , 2012, ECCV.

[29]  ARNO KNAPITSCH,et al.  Tanks and temples , 2017, ACM Trans. Graph..

[30]  Jan-Michael Frahm,et al.  Feature tracking and matching in video using programmable graphics hardware , 2007, Machine Vision and Applications.