Fast Parallel Unbiased Diffeomorphic Atlas Construction on Multi-Graphics Processing Units

Unbiased diffeomorphic atlas construction has proven to be a powerful technique for medical image analysis, particularly in brain imaging. The method operates on a large set of images, mapping them all into a common coordinate system, and creating an unbiased common template for studying intra-population variability and interpopulation differences. The technique has also proven effective in tissue and object segmentation via registration of anatomical labels. However, a major barrier to the use of this approach is its high computational cost. Especially with the increasing number of inputs and data size, it becomes impractical even with a fully optimized implementation on CPUs. Fortunately, the highly element-wise independence of the problem makes it well suited for parallel processing. This paper presents an efficient implementation of unbiased diffeomorphic atlas construction on the new parallel processing architecture based on Multi-Graphics Processing Units (Multi-GPUs). Our results show that the GPU implementation gives a substantial performance gain on the order of twenty to sixty times faster than a single CPU and provides an inexpensive alternative to large distributed-memory CPU clusters.

[1]  Rüdiger Westermann,et al.  Linear algebra operators for GPU implementation of numerical algorithms , 2003, SIGGRAPH Courses.

[2]  Pat Hanrahan,et al.  Stream computing on graphics hardware , 2005 .

[3]  Michael I. Miller,et al.  Individualizing Neuroanatomic Atlases Using a Massively Parallel Computer , 1996, Computer.

[4]  Raj Shekhar,et al.  FPGA-Accelerated Deformable Image Registration for Improved Target-Delineation During CT-Guided Interventions , 2007, IEEE Transactions on Biomedical Circuits and Systems.

[5]  Knut-Andreas Lie,et al.  How to Solve Systems of Conservation Laws Numerically Using the Graphics Processor as a High-Performance Computational Engine , 2007, Geometric Modelling, Numerical Simulation, and Optimization.

[6]  Nassir Navab,et al.  Fast Deformable Registration of 3D-Ultrasound Data Using a Variational Approach , 2006, MICCAI.

[7]  Martin Rumpf,et al.  Image Registration by a Regularized Gradient Flow. A Streaming Implementation in DX9 Graphics Hardware , 2004, Computing.

[8]  Steven G. Johnson,et al.  The Design and Implementation of FFTW3 , 2005, Proceedings of the IEEE.

[9]  Morten Bro-Nielsen,et al.  Fast Fluid Registration of Medical Images , 1996, VBC.

[10]  Martin Rumpf,et al.  Using Graphics Cards for Quantized FEM Computations , 2001, VIIP.

[11]  Peter Lorenzen,et al.  Unbiased Atlas Formation Via Large Deformations Metric Mapping , 2005, MICCAI.

[12]  Louis A. Hageman,et al.  Iterative Solution of Large Linear Systems. , 1971 .

[13]  Guido Gerig,et al.  Unbiased diffeomorphic atlas construction for computational anatomy , 2004, NeuroImage.

[14]  E ChristensenGary,et al.  Individualizing Neuroanatomical Atlases Using a Massively Parallel Computer , 1996 .

[15]  Ulf Grenander,et al.  General Pattern Theory: A Mathematical Study of Regular Structures , 1993 .

[16]  James Demmel,et al.  Applied Numerical Linear Algebra , 1997 .

[17]  P. Thomas Fletcher,et al.  Population Shape Regression From Random Design Data , 2007, ICCV.

[18]  Ross T. Whitaker,et al.  Interactive Visualization of Volumetric White Matter Connectivity in DT-MRI Using a Parallel-Hardware Hamilton-Jacobi Solver , 2007, IEEE Transactions on Visualization and Computer Graphics.

[19]  P. Thomas Fletcher,et al.  Population Shape Regression from Random Design Data , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[20]  Jostein R. Natvig,et al.  Solving the Euler Equations on Graphics Processing Units , 2006, International Conference on Computational Science.