论文信息 - Alignment of continuous video onto 3D point clouds

Alignment of continuous video onto 3D point clouds

We propose a general framework for aligning continuous (oblique) video onto 3D sensor data. We align a point cloud computed from the video onto the point cloud directly obtained from a 3D sensor. This is in contrast to existing techniques where the 2D images are aligned to a 3D model derived from the 3D sensor data. Using point clouds enables the alignment for scenes full of objects that are difficult to model; for example, trees. To compute 3D point clouds from video, motion stereo is used along with a state-of-the-art algorithm for camera pose estimation. Our experiments with real data demonstrate the advantages of the proposed registration algorithm for texturing models in large-scale semi-urban environments. The capability to align video before a 3D model is built from the 3D sensor data offers new practical opportunities for 3D modeling. We introduce a novel modeling-through-registration approach that fuses 3D information from both the 3D sensor and the video. Initial experiments with real data illustrate the potential of the proposed approach.

Wenyi Zhao | David Nistér | Steven C. Hsu

[1] Ioannis Stamos,et al. Automatic registration of 2-D with 3-D imagery in urban environments , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[2] Paul A. Beardsley,et al. Sequential Updating of Projective and Affine Structure from Motion , 1997, International Journal of Computer Vision.

[3] David G. Lowe,et al. Fitting Parameterized Three-Dimensional Models to Images , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[4] Claus Brenner,et al. Generation Of 3D City Models From Airborne Laser Scanning Data , 1997 .

[5] Paul A. Viola,et al. Alignment by Maximization of Mutual Information , 1997, International Journal of Computer Vision.

[6] Jitendra Malik,et al. Modeling and Rendering Architecture from Photographs: A hybrid geometry- and image-based approach , 1996, SIGGRAPH.

[7] Petros G. Voulgaris,et al. On optimal ℓ∞ to ℓ∞ filtering , 1995, Autom..

[8] Suya You,et al. Augmented virtual environments (AVE): dynamic fusion of imagery and 3D models , 2003, IEEE Virtual Reality, 2003. Proceedings..

[9] Takeo Kanade,et al. Real-time 3-D pose estimation using a high-speed range sensor , 1993, Proceedings of the 1994 IEEE International Conference on Robotics and Automation.

[10] Marc Rioux,et al. Three-dimensional registration using range and intensity information , 1994, Other Conferences.

[11] Patrick J. Flynn,et al. A Survey Of Free-Form Object Representation and Recognition Techniques , 2001, Comput. Vis. Image Underst..

[12] Christian Früh,et al. Constructing 3D city models by merging ground-based and airborne views , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[13] Larry H. Matthies,et al. Kalman filter-based algorithms for estimating depth from image sequences , 1989, International Journal of Computer Vision.

[14] P. Anandan,et al. Parallax Geometry of Pairs of Points for 3D Scene Analysis , 1996, ECCV.

[15] Sunil Arya,et al. ANN: library for approximate nearest neighbor searching , 1998 .

[16] Bernhard P. Wrobel,et al. Multiple View Geometry in Computer Vision , 2001 .

[17] Paul J. Besl,et al. A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[18] David Nistér,et al. Reconstruction from Uncalibrated Sequences with a Hierarchy of Trifocal Tensors , 2000, ECCV.

[19] Ioannis Stamos,et al. Geometry and Texture Recovery of Scenes of Large Scale , 2002, Comput. Vis. Image Underst..

[20] Berthold K. P. Horn,et al. Closed-form solution of absolute orientation using unit quaternions , 1987 .

[21] Andrew W. Fitzgibbon,et al. Bundle Adjustment - A Modern Synthesis , 1999, Workshop on Vision Algorithms.

[22] Robert C. Bolles,et al. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[23] Claus Brenner,et al. Towards Fully Automated 3D City Model Generation , 2001 .

[24] Gérard G. Medioni,et al. Object modeling by registration of multiple range images , 1991, Proceedings. 1991 IEEE International Conference on Robotics and Automation.

[25] Christopher G. Harris,et al. A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[26] Trevor Darrell,et al. 3D pose tracking with linear depth and brightness constraints , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[27] Marc Levoy,et al. Efficient variants of the ICP algorithm , 2001, Proceedings Third International Conference on 3-D Digital Imaging and Modeling.

[28] Kia Ng,et al. Automated reconstruction of 3D models from real environments , 1999 .

[29] David Nistér,et al. An efficient solution to the five-point relative pose problem , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30] Yi-Ping Hung,et al. RANSAC-Based DARCES: A New Approach to Fast Automatic Registration of Partially Overlapping Range Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[31] Supun Samarasekera,et al. Video Flashlights: Real Time Rendering of Multiple Videosfor Immersive Model Visualization , 2002, Rendering Techniques.

[32] Katsuhiko Sakaue,et al. Registration and integration of multiple range images for 3-D model construction , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[33] S. P. Mudur,et al. Three-dimensional computer vision: a geometric viewpoint , 1993 .

[34] Andrew W. Fitzgibbon,et al. The Problem of Degeneracy in Structure and Motion Recovery from Uncalibrated Image Sequences , 1999, International Journal of Computer Vision.

[35] Zhengyou Zhang,et al. Iterative point matching for registration of free-form curves and surfaces , 1994, International Journal of Computer Vision.

[36] Carlo Tomasi,et al. Comparison of approaches to egomotion computation , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[37] Xavier Pennec,et al. Multi-scale EM-ICP: A Fast and Robust Approach for Surface Registration , 2002, ECCV.

[38] P. Anandan,et al. Direct recovery of shape from multiple views: a parallax based approach , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[39] Jake K. Aggarwal,et al. Matching Aerial Images to 3-D Terrain Maps , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[40] Supun Samarasekera,et al. Pose estimation, model refinement, and enhanced visualization using video , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).