论文信息 - A computationally efficient pipeline for 3D point cloud reconstruction from video sequences

A computationally efficient pipeline for 3D point cloud reconstruction from video sequences

This paper presents a computationally efficient pipeline to achieve 3D point cloud reconstruction from video sequences. This pipeline involves a key frame selection step to improve the computational efficiency by generating reliable depth information from pair-wise frames. An outlier removal step is then applied in order to further improve the computational efficiency. The reconstruction is achieved based on a new absolute camera pose recovery approach in a computationally efficient manner. This pipeline is devised for both sparse and dense 3D reconstruction. The results obtained from video sequences exhibit higher computational efficiency and lower re-projection errors of the introduced pipeline compared to the existing pipelines.

Nasser Kehtarnavaz | Chih-Hsiang Chang | N. Kehtarnavaz | Chih-Hsiang Chang

[1] Daniel Cremers,et al. Robust odometry estimation for RGB-D cameras , 2013, 2013 IEEE International Conference on Robotics and Automation.

[2] David G. Lowe,et al. Scalable Nearest Neighbor Algorithms for High Dimensional Data , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3] Masatoshi Okutomi,et al. Stable Two View Reconstruction Using the Six-Point Algorithm , 2012, ACCV.

[4] Vincent Lepetit,et al. BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[5] David G. Stork,et al. Pattern classification, 2nd Edition , 2000 .

[6] Roland Siegwart,et al. Versatile distributed pose estimation and sensor self-calibration for an autonomous MAV , 2012, 2012 IEEE International Conference on Robotics and Automation.

[7] Richard Szeliski,et al. Skeletal graphs for efficient structure from motion , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[8] Reinhard Koch,et al. Automated reconstruction of 3D scenes from sequences of images , 2000 .

[9] Matthias Nießner,et al. Combining Inertial Navigation and ICP for Real-time 3D Surface Reconstruction , 2014, Eurographics.

[10] Horst Bischof,et al. Efficient structure from motion with weak position and orientation priors , 2011, CVPR 2011 WORKSHOPS.

[11] Luc Van Gool,et al. Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[12] D. Lowe,et al. Fast Matching of Binary Features , 2012, 2012 Ninth Conference on Computer and Robot Vision.

[13] Marc Levoy,et al. Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[14] Simon Baker,et al. Lucas-Kanade 20 Years On: A Unifying Framework , 2004, International Journal of Computer Vision.

[15] Richard Szeliski,et al. Modeling the World from Internet Photo Collections , 2008, International Journal of Computer Vision.

[16] Tomás Pajdla,et al. Omnidirectional Camera Motion Estimation , 2008, VISAPP.

[17] James R. Bergen,et al. Visual odometry , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[18] Luc Van Gool,et al. Surviving Dominant Planes in Uncalibrated Structure and Motion Recovery , 2002, ECCV.

[19] Michal Havlena,et al. Measuring camera translation by the dominant apical angle , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[20] Juho Kannala,et al. DT-SLAM: Deferred Triangulation for Robust SLAM , 2014, 2014 2nd International Conference on 3D Vision.

[21] Andrew J. Davison,et al. DTAM: Dense tracking and mapping in real-time , 2011, 2011 International Conference on Computer Vision.

[22] Luc Van Gool,et al. SURF: Speeded Up Robust Features , 2006, ECCV.

[23] Michael Frankfurter,et al. Numerical Recipes In C The Art Of Scientific Computing , 2016 .

[24] S. Shankar Sastry,et al. c ○ 2000 Kluwer Academic Publishers. Manufactured in The Netherlands. Linear Differential Algorithm for Motion Recovery: A Geometric Approach , 2022 .

[25] Matthew N. Dailey,et al. Robust Key Frame Extraction for 3D Reconstruction from Video Streams , 2010, VISAPP.

[26] David Nistér,et al. Preemptive RANSAC for live structure and motion estimation , 2005, Machine Vision and Applications.

[27] Jan-Michael Frahm,et al. Real-Time Plane-Sweeping Stereo with Multiple Sweeping Directions , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[28] Kevin P. Murphy,et al. Machine learning - a probabilistic perspective , 2012, Adaptive computation and machine learning series.

[29] Nasser Kehtarnavaz,et al. Computationally efficient approach to three-dimensional point cloud reconstruction from video image sequences , 2014, J. Electronic Imaging.

[30] David W. Murray,et al. Video-rate Recognition and Localization for Wearable Cameras , 2007, BMVC.

[31] Paul A. Beardsley,et al. Sequential Updating of Projective and Affine Structure from Motion , 1997, International Journal of Computer Vision.

[32] Matthijs C. Dorst. Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[33] Manolis I. A. Lourakis,et al. The design and implementation of a generic sparse bundle adjustment software package based on the Le , 2004 .

[34] Pierre Vandergheynst,et al. FREAK: Fast Retina Keypoint , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[35] David Nistér,et al. An efficient solution to the five-point relative pose problem , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[36] Steven M. Seitz,et al. Photo tourism: exploring photo collections in 3D , 2006, ACM Trans. Graph..