Accelerated parametric chamfer alignment using a parallel, pipelined GPU realization

Parametric chamfer alignment (PChA) is commonly employed for aligning an observed set of points with a corresponding set of reference points. PChA estimates optimal geometric transformation parameters that minimize an objective function formulated as the sum of the squared distances from each transformed observed point to its closest reference point. A distance transform enables efficient computation of the (squared) distances, and the objective function minimization is commonly performed via the Levenberg–Marquardt (LM) nonlinear least squares iterative optimization algorithm. The point-wise computations of the objective function, gradient, and Hessian approximation required for the LM iterations make PChA computationally demanding for large-scale datasets. We propose an acceleration of the PChA via a parallelized and pipelined realization that is particularly well suited for large-scale datasets and for modern GPU architectures. Specifically, we partition the observed points among the GPU blocks and decompose the expensive LM calculations in correspondence with the GPU’s single-instruction multiple-thread architecture to significantly speed up this bottleneck step for PChA on large-scale datasets. Additionally, by reordering computations, we propose a novel pipelining of the LM algorithm that offers further speedup by exploiting the low arithmetic latency of the GPU compared with its high global memory access latency. Results obtained on two different platforms for both 2D and 3D large-scale point datasets from our ongoing research demonstrate that the proposed PChA GPU implementation provides a significant speedup over its single CPU counterpart.

[1]  Richard Szeliski,et al.  Creating full view panoramic image mosaics and environment maps , 1997, SIGGRAPH.

[2]  Xiang Zhu,et al.  Efficient Parallel Levenberg-Marquardt Model Fitting towards Real-Time Automated Parametric Imaging Microscopy , 2013, PloS one.

[3]  Michael Felsberg,et al.  A Probabilistic Framework for Color-Based Point Set Registration , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Mongi A. Abidi,et al.  A new method for the registration of three-dimensional point-sets: The Gaussian Fields framework , 2010, Image Vis. Comput..

[5]  Ming-Hsuan Yang,et al.  Higher Dimensional Affine Registration and Vision Applications , 2008, ECCV.

[6]  Paul J. Besl,et al.  A Method for Registration of 3-D Shapes , 1992, IEEE Trans. Pattern Anal. Mach. Intell..

[7]  Tiow Seng Tan,et al.  Parallel Banding Algorithm to compute exact distance transform with the GPU , 2010, I3D '10.

[8]  Jie Cheng,et al.  Programming Massively Parallel Processors. A Hands-on Approach , 2010, Scalable Comput. Pract. Exp..

[9]  Zhengyou Zhang,et al.  Iterative point matching for registration of free-form curves and surfaces , 1994, International Journal of Computer Vision.

[10]  Gunilla Borgefors,et al.  Distance transformations in digital images , 1986, Comput. Vis. Graph. Image Process..

[11]  Angel Domingo Sappa,et al.  Correspondence free registration through a point-to-model distance minimization , 2011, 2011 International Conference on Computer Vision.

[12]  Andriy Myronenko,et al.  Point Set Registration: Coherent Point Drift , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Charles V. Stewart,et al.  Simultaneous Covariance Driven Correspondence (CDC) and Transformation Estimation in the Expectation Maximization Framework , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[14]  Gaurav Sharma,et al.  Automatic Registration of Wide Area Motion Imagery to Vector Road Maps by Exploiting Vehicle Detections , 2016, IEEE Transactions on Image Processing.

[15]  Gaurav Sharma,et al.  Vector Road Map Registration to Oblique Wide Area Motion Imagery by Exploiting Vehicle Movements , 2016 .

[16]  Nicolas David,et al.  Towards 3D lidar point cloud registration improvement using optimal neighborhood knowledge , 2013 .

[17]  Robert C. Bolles,et al.  Parametric Correspondence and Chamfer Matching: Two New Techniques for Image Matching , 1977, IJCAI.

[18]  Rama Chellappa,et al.  Fast directional chamfer matching , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  Bo Li,et al.  GPU Accelerated Non-rigid Registration for the Evaluation of Cardiac Function , 2008, MICCAI.

[20]  Markus H. Gross,et al.  Signed distance transform using graphics hardware , 2003, IEEE Visualization, 2003. VIS 2003..

[21]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[22]  Rodrigo Weber dos Santos,et al.  Comparing CUDA and OpenGL implementations for a Jacobi iteration , 2009, 2009 International Conference on High Performance Computing & Simulation.

[23]  Daniel P. Huttenlocher,et al.  Distance Transforms of Sampled Functions , 2012, Theory Comput..

[24]  Andrew W. Fitzgibbon Robust registration of 2D and 3D point sets , 2003, Image Vis. Comput..

[25]  Richard A. Robb,et al.  Image registration of multimodality 3-D medical images by chamfer matching , 1992, Electronic Imaging.

[26]  D K Smith,et al.  Numerical Optimization , 2001, J. Oper. Res. Soc..

[27]  Gaurav Sharma,et al.  Comparative analysis of homologous buildings using range imaging , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[28]  Ahmed Elliethy,et al.  Supplementary Material for “ Automatic Registration of Vector Road Maps with Wide Area Motion Imagery by Exploiting Vehicle Detections ” , 2016 .