Optimized selection of key frames for monocular videogrammetric surveying of civil infrastructure

Videogrammetry is an inexpensive and easy-to-use technology for spatial 3D scene recovery. When applied to large scale civil infrastructure scenes, only a small percentage of the collected video frames are required to achieve robust results. However, choosing the right frames requires careful consideration. Videotaping a built infrastructure scene results in large video files filled with blurry, noisy, or redundant frames. This is due to frame rate to camera speed ratios that are often higher than necessary; camera and lens imperfections and limitations that result in imaging noise; and occasional jerky motions of the camera that result in motion blur; all of which can significantly affect the performance of the videogrammetric pipeline. To tackle these issues, this paper proposes a novel method for automating the selection of an optimized number of informative, high quality frames. According to this method, as the first step, blurred frames are removed using the thresholds determined based on a minimum level of frame quality required to obtain robust results. Then, an optimum number of key frames are selected from the remaining frames using the selection criteria devised by the authors. Experimental results show that the proposed method outperforms existing methods in terms of improved 3D reconstruction results, while maintaining the optimum number of extracted frames needed to generate high quality 3D point clouds.

[1]  Fei Dai,et al.  Comparison of Image-Based and Time-of-Flight-Based Technologies for Three-Dimensional Reconstruction of Infrastructure , 2013 .

[2]  D. Nistér Automatic passive recovery of 3D from images and video , 2004 .

[3]  Zubair Ahmed Memon,et al.  An automatic project progress monitoring model by integrating autocad and digital photos , 2005 .

[4]  Jong-Soo Choi,et al.  Optimal keyframe selection algorithm for three-dimensional reconstruction in uncalibrated multiple images , 2008 .

[5]  Matthew N. Dailey,et al.  Robust Key Frame Extraction for 3D Reconstruction from Video Streams , 2010, VISAPP.

[6]  Patricio A. Vela,et al.  Comparison of Camera Motion Estimation Methods for 3D Reconstruction of Infrastructure , 2011 .

[7]  Jonathan T. Black,et al.  Dot-Projection Photogrammetry and Videogrammetry of Gossamer Space Structures , 2003 .

[8]  Gaurav S. Sukhatme,et al.  A survey and evaluation of promising approaches for automatic image-based defect detection of bridge structures , 2009 .

[9]  Stefan Winkler,et al.  A no-reference perceptual blur metric , 2002, Proceedings. International Conference on Image Processing.

[10]  Lina J. Karam,et al.  An improved perception-based no-reference objective image sharpness metric using iterative edge refinement , 2008, 2008 15th IEEE International Conference on Image Processing.

[11]  Steven K. Thompson,et al.  Simple Random Sampling , 2012 .

[12]  David Nistér,et al.  An efficient solution to the five-point relative pose problem , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[13]  Andrew Zisserman,et al.  Multiple View Geometry , 1999 .

[14]  Mani Golparvar-Fard,et al.  Application of a Visualization Technique for Safety Management , 2009 .

[15]  Fei Dai,et al.  Assessing the Accuracy of Applying Photogrammetry to Take Geometric Measurements on Building Products , 2010 .

[16]  Jan-Michael Frahm,et al.  Detailed Real-Time Urban 3D Reconstruction from Video , 2007, International Journal of Computer Vision.

[17]  Richard Szeliski,et al.  Modeling the World from Internet Photo Collections , 2008, International Journal of Computer Vision.

[18]  Zhengyou Zhang,et al.  Flexible camera calibration by viewing a plane from unknown orientations , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[19]  Naruo Kano,et al.  Comparison of construction photograph and VR image in construction progress , 2008 .

[20]  S. B. Kang,et al.  Image deblurring using inertial measurement sensors , 2010, SIGGRAPH 2010.

[21]  Feniosky Peña-Mora,et al.  Integrated Sequential As-Built and As-Planned Representation with D4AR Tools in Support of Decision-Making Tasks in the AEC/FM Industry , 2011 .

[22]  Fernanda Leite,et al.  Evaluation of accuracy of as-built 3D modeling from photos taken by handheld digital cameras , 2012 .

[23]  Gustavo de Veciana,et al.  An information fidelity criterion for image quality assessment using natural scene statistics , 2005, IEEE Transactions on Image Processing.

[24]  Ioannis Brilakis,et al.  Innovative Stereo Vision-Based Approach to Generate Dense Depth Map of Transportation Infrastructure , 2011 .

[25]  Ioannis K. Brilakis,et al.  Automated sparse 3D point cloud generation of infrastructure using its distinctive visual features , 2011, Adv. Eng. Informatics.

[26]  Silvio Savarese,et al.  Application of D4AR - A 4-Dimensional augmented reality model for automating construction progress monitoring data collection, processing and communication , 2009, J. Inf. Technol. Constr..

[27]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[28]  Ioannis Brilakis,et al.  Comparison of Optical Sensor-Based Spatial Data Collection Techniques for Civil Infrastructure Modeling , 2009 .

[29]  Michael M. Kazhdan,et al.  Poisson surface reconstruction , 2006, SGP '06.

[30]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[31]  Jiaya Jia,et al.  High-quality motion deblurring from a single image , 2008, ACM Trans. Graph..

[32]  Dang Trung Kien A Review of 3 D Reconstruction from Video Sequences MediaMill 3 D technical reports series , 2005 .

[33]  Mohammad R. Jahanshahi,et al.  An innovative methodology for detection and quantification of cracks through incorporation of depth perception , 2011, Machine Vision and Applications.

[34]  Mani Golparvar-Fard,et al.  Automated Progress Monitoring Using Unordered Daily Construction Photographs and IFC-Based Building Information Models , 2015, J. Comput. Civ. Eng..

[35]  Giuseppe Patanè,et al.  Topology- and error-driven extension of scalar functions from surfaces to volumes , 2009, TOGS.

[36]  Gaurav S. Sukhatme,et al.  Multi-image stitching and scene reconstruction for evaluating defect evolution in structures , 2011 .

[37]  William T. Freeman,et al.  Removing camera shake from a single photograph , 2006, SIGGRAPH 2006.

[38]  Youssef M A Hashash,et al.  Digital image reasoning for tracking excavation activities , 2008 .

[39]  Sunghyun Cho,et al.  Fast motion deblurring , 2009, SIGGRAPH 2009.

[40]  Sarjinder Singh Simple Random Sampling , 2003 .

[41]  Sang-Hoon Kim,et al.  3D Estimation and Key-Frame Selection for Match Move , 2003 .

[42]  Sei-Wang Chen,et al.  A non-parametric blur measure based on edge analysis for image processing applications , 2004, IEEE Conference on Cybernetics and Intelligent Systems, 2004..

[43]  Jian-Feng Cai,et al.  Blind motion deblurring using multiple images , 2009, J. Comput. Phys..

[44]  Thorsten Thormählen,et al.  Keyframe Selection for Camera Motion and Structure Estimation from Multiple Views , 2004, ECCV.

[45]  Manolis I. A. Lourakis,et al.  SBA: A software package for generic sparse bundle adjustment , 2009, TOMS.

[46]  Mani Golparvar-Fard,et al.  Research in Visualization Techniques for Field Construction , 2011 .

[47]  Toby Howard,et al.  Accurate camera calibration for off-line, video-based augmented reality , 2002, Proceedings. International Symposium on Mixed and Augmented Reality.

[48]  Burcin Becerik-Gerber,et al.  Imaged-based verification of as-built documentation of operational buildings , 2012 .

[49]  Reinhard Koch,et al.  Visual Modeling with a Hand-Held Camera , 2004, International Journal of Computer Vision.

[50]  Ioannis Brilakis,et al.  Progressive 3D reconstruction of infrastructure with videogrammetry , 2011 .

[51]  Alan C. Bovik,et al.  No-reference image blur assessment using multiscale gradient , 2009, QOMEX 2009.

[52]  Mani Golparvar-Fard,et al.  Evaluation of image-based modeling and laser scanning accuracy for emerging automated performance monitoring techniques , 2011 .

[53]  Patricia Ladret,et al.  The blur effect: perception and estimation with a new no-reference perceptual blur metric , 2007, Electronic Imaging.

[54]  Andrew W. Fitzgibbon,et al.  Maintaining multiple motion model hypotheses over many views to recover matching and structure , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).