Robust Key Frame Extraction for 3D Reconstruction from Video Streams

Automatic reconstruction of 3D models from video sequences requires selection of appropriate video frames for performing the reconstruction. We introduce a complete method for key frame selection that automatically avoids degeneracies and is robust to inaccurate correspondences caused by motion blur. Our method combines selection criteria based on the number of frame-to-frame point correspondences, Torr’s geometrical robust information criterion (GRIC) scores for the frame-to-frame homography and fundamental matrix, and the point-to-epipolar line cost for the frame-to-frame point correspondence set. In a series of experiments with real and synthetic data sets, we show that our method achieves robust 3D reconstruction in the presence of noise and degenerate motion.

[1]  P. Torr Geometric motion segmentation and model selection , 1998, Philosophical Transactions of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences.

[2]  Sang-Hoon Kim,et al.  3D Estimation and Key-Frame Selection for Match Move , 2003 .

[3]  Bernhard P. Wrobel,et al.  Multiple View Geometry in Computer Vision , 2001 .

[4]  Reinhard Koch,et al.  Visual Modeling with a Hand-Held Camera , 2004, International Journal of Computer Vision.

[5]  Andrew W. Fitzgibbon,et al.  Maintaining multiple motion model hypotheses over many views to recover matching and structure , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[6]  Jong-Soo Choi,et al.  Optimal keyframe selection algorithm for three-dimensional reconstruction in uncalibrated multiple images , 2008 .