A Parallel GPU Algorithm for Mutual Information Based 3D Nonrigid Image Registration

Many applications in biomedical image analysis require alignment or fusion of images acquired with different devices or at different times. Image registration geometrically aligns images allowing their fusion. Nonrigid techniques are usually required when the images contain anatomical structures of soft tissue. Nonrigid registration algorithms are very time consuming and can take hours for aligning a pair of 3D medical images on commodity workstation PCs. In this paper, we present parallel design and implementation of 3D non-rigid image registration for the Graphics Processing Units (GPUs). Existing GPU-based registration implementations are mainly limited to intra-modality registration problems. Our algorithm uses mutual information as the similarity metric and can process images of different modalities. The proposed design takes advantage of highly parallel and multi-threaded architecture of GPU containing large number of processing cores. The paper presents optimization techniques to effectively utilize high memory bandwidth provided by GPU using on-chip shared memory and co-operative memory update by multiple threads. Our results with optimized GPU implementation showed an average performance of 2.46 microseconds per voxel and achieved factor of 28 speedup over a CPU-based serial implementation. This improves the usability of nonrigid registration for some real world clinical applications and enables new ones, especially within intra-operative scenarios, where strict timing constraints apply.

[1]  Daniel Rueckert,et al.  Nonrigid registration using free-form deformations: application to breast MR images , 1999, IEEE Transactions on Medical Imaging.

[2]  Nicolas Courty,et al.  Accelerating 3D Non-Rigid Registration Using Graphics Hardware , 2008, Int. J. Image Graph..

[3]  Max A. Viergever,et al.  Mutual-information-based registration of medical images: a survey , 2003, IEEE Transactions on Medical Imaging.

[4]  John D. Owens,et al.  Fast Deformable Registration on the GPU: A CUDA Implementation of Demons , 2008, 2008 International Conference on Computational Sciences and Its Applications.

[5]  David R. Haynor,et al.  PET-CT image registration in the chest using free-form deformations , 2003, IEEE Transactions on Medical Imaging.

[6]  M. Unser,et al.  Spline Pyramids for Inter-Modal Image Registration Using Mutual Information , 1997 .

[7]  Michael A. Unser,et al.  Spline pyramids for intermodal image registration using mutual information , 1997, Optics & Photonics.

[8]  Richard Szeliski,et al.  Spline-Based Image Registration , 1997, International Journal of Computer Vision.

[9]  D. Hill,et al.  Non-rigid image registration: theory and practice. , 2004, The British journal of radiology.

[10]  Fumihiko Ino,et al.  Performance Study of Nonrigid Registration Algorithm for Investigating Lung Disease on Clusters , 2005, Sixth International Conference on Parallel and Distributed Computing Applications and Technologies (PDCAT'05).

[11]  D. Louis Collins,et al.  Design and construction of a realistic digital brain phantom , 1998, IEEE Transactions on Medical Imaging.

[12]  Michael Unser,et al.  Fast parametric elastic image registration , 2003, IEEE Trans. Image Process..

[13]  Torsten Rohlfing,et al.  Nonrigid image registration in shared-memory multiprocessor environments with application to brains, breasts, and bees , 2003, IEEE Transactions on Information Technology in Biomedicine.

[14]  Rüdiger Westermann,et al.  Non-rigid multi-modal registration on the GPU , 2007, SPIE Medical Imaging.

[15]  G C Sharp,et al.  GPU-based streaming architectures for fast cone-beam CT image reconstruction and demons deformable registration , 2007, Physics in medicine and biology.