论文信息 - GPU-accelerated feature tracking for 3D reconstruction

GPU-accelerated feature tracking for 3D reconstruction

Abstract 3D reconstruction based on structure from motion is one of the most techniques to produce sparse point-cloud model and camera parameter. However, this technique heavily relies on feature tracking method to obtain feature correspondences, then resulting in a heavy computation burden. To speed up 3D reconstruction, in this paper, we design a novel GPU-accelerated feature tracking (GFT) method for large-scale structure from motion (SFM)-based 3D reconstruction. The proposed GFT method consists of GPU-based Gaussian of image (DOG) keypoint detector, RootSIFT descriptor extractor, k nearest matching, and outlier removing. Firstly, our GPU-based DOG implementation can detect thousands of keypoints in real-time, whose speed is 30 times faster than that of the CPU version. Secondly, our GPU-based RootSIFT descriptor can compute thousands of descriptors in real-time. Thirdly, our GPU-based descriptor matching is 10 times faster than that of the state-of-the-art methods. Finally, we conduct thorough experiments on different datasets to evaluate the proposed method. Experimental results demonstrate the effectiveness and efficiency of the proposed method.

[1] Long Quan,et al. Fast Descriptors and Correspondence Propagation for Robust Global Point Cloud Registration , 2017, IEEE Transactions on Image Processing.

[2] Andreas Lanitis,et al. Model-based generation of personalized full-body 3D avatars from uncalibrated multi-view photographs , 2017, Multimedia Tools and Applications.

[3] Antoine Manzanera,et al. Video Extruder: a semi-dense point tracker for extracting beams of trajectories in real time , 2014, Journal of Real-Time Image Processing.

[4] Wei Jia,et al. Robust bundle adjustment for large-scale structure from motion , 2017, Multimedia Tools and Applications.

[5] Wei Jia,et al. Fast and robust absolute camera pose estimation with known focal length , 2018, Neural Computing and Applications.

[6] Gang Wang,et al. ROML: A Robust Feature Correspondence Approach for Matching Objects in A Set of Images , 2014, International Journal of Computer Vision.

[7] Niloy J. Mitra,et al. Coupled structure-from-motion and 3D symmetry detection for urban facades , 2014, ACM Trans. Graph..

[8] Antoine Manzanera,et al. Real Time Semi-dense Point Tracking , 2012, ICIAR.

[9] Chao Peng,et al. A GPU-Accelerated Approach for Feature Tracking in Time-Varying Imagery Datasets , 2017, IEEE Transactions on Visualization and Computer Graphics.

[10] Jan-Michael Frahm,et al. USAC: A Universal Framework for Random Sample Consensus , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11] Subhashis Banerjee,et al. Divide and Conquer: Efficient Large-Scale Structure from Motion Using Graph Partitioning , 2014, ACCV.

[12] Javier Civera,et al. 1-Point RANSAC for extended Kalman filtering: Application to real-time structure from motion and visual odometry , 2010 .

[13] Peter Wonka,et al. BigSUR , 2017, ACM Trans. Graph..

[14] Zhanyi Hu,et al. HSfM: Hybrid Structure-from-Motion , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[15] Huimin Lu,et al. Deep adversarial metric learning for cross-modal retrieval , 2019, World Wide Web.

[16] Venu Madhav Govindu,et al. Robust Relative Rotation Averaging , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17] Srikumar Ramalingam,et al. A Unifying Model for Camera Calibration , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18] Takeo Kanade,et al. An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[19] Huchuan Lu,et al. Robust Visual Tracking via Least Soft-Threshold Squares , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[20] Muhamad Risqi U. Saputra,et al. Visual SLAM and Structure from Motion in Dynamic Environments , 2018, ACM Comput. Surv..

[21] Huchuan Lu,et al. Inverse Sparse Tracker With a Locally Weighted Distance Metric , 2015, IEEE Transactions on Image Processing.

[22] Andrew Owens,et al. Discrete-continuous optimization for large-scale structure from motion , 2011, CVPR 2011.

[23] Yee-Hong Yang,et al. Robust multi-view L2 triangulation via optimal inlier selection and 3D structure refinement , 2014, Pattern Recognit..

[24] Jiri Matas,et al. MODS: Fast and robust method for two-view matching , 2015, Comput. Vis. Image Underst..

[25] Hujun Bao,et al. Efficient Non-Consecutive Feature Tracking for Robust Structure-From-Motion , 2015, IEEE Transactions on Image Processing.

[26] T. C. Hu,et al. Multi-Terminal Network Flows , 1961 .

[27] David G. Lowe,et al. Distinctive Image Features from Scale-Invariant Keypoints , 2004, International Journal of Computer Vision.

[28] Pascal Fua,et al. Worldwide Pose Estimation Using 3D Point Clouds , 2012, ECCV.

[29] ARNO KNAPITSCH,et al. Tanks and temples , 2017, ACM Trans. Graph..

[30] Jan-Michael Frahm,et al. Feature tracking and matching in video using programmable graphics hardware , 2007, Machine Vision and Applications.