Fusion of camera images and laser scans for wide baseline 3D scene alignment in urban environments

In this paper we address the problem of automatic laser scan registration in urban environments. This represents a challenging problem for two major reasons. First, two individual laser scans might be captured at significantly changed viewpoints (wide baseline) and have very little overlap. Second, man-made buildings usually contain many structures of similar appearances. This will result in considerable aliasing in the matching process. By sensor fusion of laser data with camera images, we propose a novel improvement to the existing 2D feature techniques to enable automatic 3D alignment between two widely separated scans. The key idea consists of extracting dominant planar structures from 3D point clouds and then utilizing the recovered 3D geometry to improve the performance of 2D image feature for wide baseline matching. The resulting feature descriptors become more robust to camera viewpoint changes after the procedure of viewpoint normalization. Moreover, the viewpoint normalized 2D features provide robust local feature information including patch scale and dominant orientation for effective repetitive structure matching in man-made environments. Comprehensive experimental evaluations with real data demonstrate the potential of the proposed method for automatic wide baseline 3D scan alignment in urban environments.

[1]  Diego González-Aguilera,et al.  An automatic procedure for co-registration of terrestrial laser scanners and digital cameras , 2009 .

[2]  Reinhard Koch,et al.  Perspectively Invariant Normal Features , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[3]  Atsushi Nakazawa,et al.  The Great Buddha Project: Digitally Archiving, Restoring, and Analyzing Cultural Heritage Objects , 2007, International Journal of Computer Vision.

[4]  Derek D. Lichti,et al.  ISPRS Journal of Photogrammetry and Remote Sensing theme issue “Terrestrial Laser Scanning” , 2006 .

[5]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[6]  Wei Zhang,et al.  Hierarchical building recognition , 2007, Image Vis. Comput..

[7]  Sagi Filin,et al.  REGISTRATION OF TERRESTRIAL LASER SCANS VIA IMAGE BASED FEATURES , 2007 .

[8]  Jan-Michael Frahm,et al.  3D model matching with Viewpoint-Invariant Patches (VIP) , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Z. Kang Automatic registration of terrestrial point cloud using panoramic reflectance images , 2008 .

[10]  Shi-Min Hu,et al.  Geometry and Convergence Analysis of Algorithms for Registration of 3D Shapes , 2006, International Journal of Computer Vision.

[11]  Luc Van Gool,et al.  3D Urban Scene Modeling Integrating Recognition and Reconstruction , 2008, International Journal of Computer Vision.

[12]  David Nister,et al.  Alignment of continuous video onto 3D point clouds , 2004, CVPR 2004.

[13]  Sang Wook Lee,et al.  ICP Registration Using Invariant Features , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Sang Wook Lee,et al.  Range data registration using photometric features , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[15]  Wolfgang Förstner,et al.  Plane Detection in Point Cloud Data , 2010 .

[16]  Jan Boehm Terrestrial laser scanning - a supplementary approach for 3D documentation and animation , 2005 .

[17]  Reinhard Koch,et al.  Visual Modeling with a Hand-Held Camera , 2004, International Journal of Computer Vision.

[18]  Jean Ponce,et al.  Accurate, Dense, and Robust Multiview Stereopsis , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Cordelia Schmid,et al.  A Performance Evaluation of Local Descriptors , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  George Wolberg,et al.  Multiview Geometry for Texture Mapping 2D Images Onto 3D Range Data , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[22]  T. Läbe AUTOMATIC RELATIVE ORIENTATION OF IMAGES , 2006 .

[23]  Robert C. Bolles,et al.  Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography , 1981, CACM.

[24]  Rasmus Fensholt,et al.  Remote Sensing , 2008, Encyclopedia of GIS.

[25]  Walterio W. Mayol-Cuevas,et al.  2nd International Symposium on Visual Computing , 2006 .

[26]  Michael Ying Yang,et al.  Robust alignment of wide baseline terrestrial laser scans via 3D viewpoint normalization , 2011, 2011 IEEE Workshop on Applications of Computer Vision (WACV).

[27]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[28]  Horst Bischof,et al.  Efficient Maximally Stable Extremal Region (MSER) Tracking , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[29]  Wolfgang Förstner,et al.  Robust Wide Baseline Scene Alignment Based on 3D Viewpoint Normalization , 2010, ISVC.

[30]  J. Rissanen,et al.  Modeling By Shortest Data Description* , 1978, Autom..

[31]  S. Filin,et al.  Keypoint based autonomous registration of terrestrial laser point-clouds , 2008 .

[32]  Cordelia Schmid,et al.  Scale & Affine Invariant Interest Point Detectors , 2004, International Journal of Computer Vision.