Design and Implementation of Parallel Nonrigid Image Registration Using Off-the-Shelf Supercomputers

This paper presents a new parallel algorithm for nonrigid image registration using off-the-shelf supercomputers, or clusters of PCs. Our algorithm realizes scalable registration for high resolution three-dimensional (3-D) images by employing three techniques: (1) data distribution; (2) data-parallel processing; and (3) dynamic load balancing. The experimental results show that our parallel implementation on a cluster of 64 off-the-shelf PCs (with 128 processors) registers liver CT images of 512×512×159 voxels within 8 minutes while a sequential implementation takes 12 hours. Furthermore, our implementation allows processors to use less memory, and thereby enables us to align 1024×1024×590 voxel images, which is not easy for single processor systems due to the restrictions on the memory space and the processing time.

[1]  G. Amdhal,et al.  Validity of the single processor approach to achieving large scale computing capabilities , 1967, AFIPS '67 (Spring).

[2]  Message Passing Interface Forum MPI: A message - passing interface standard , 1994 .

[3]  Gerhard Goos,et al.  Open Hypermedia Systems and Structural Computing , 2002, Lecture Notes in Computer Science.

[4]  Hiroshi Tezuka,et al.  The design and implementation of zero copy MPI using commodity hardware with a high performance network , 1998, ICS '98.

[5]  Ronald L. Graham,et al.  Bounds on Multiprocessing Timing Anomalies , 1969, SIAM Journal of Applied Mathematics.

[6]  Ron Kikinis,et al.  Medical Image Computing and Computer-Assisted Intervention — MICCAI 2002 , 2002, Lecture Notes in Computer Science.

[7]  Wiro J. Niessen,et al.  Medical Image Computing and Computer-Assisted Intervention – MICCAI 2001 , 2001, Lecture Notes in Computer Science.

[8]  Siegfried Reich Introduction to OHS6 , 2000, OHS-6/SC-2.

[9]  Daniel Rueckert,et al.  Nonrigid registration using free-form deformations: application to breast MR images , 1999, IEEE Transactions on Medical Imaging.

[10]  Ron Kikinis,et al.  Real-Time Biomechanical Simulation of Volumetric Brain Deformation for Image Guided Neurosurgery , 2000, ACM/IEEE SC 2000 Conference (SC'00).

[11]  Kwan-Liu Ma,et al.  Parallel Volume Rendering Using Binary-Swap Image Composition , 2008, International Conference on Computer Graphics and Interactive Techniques.

[12]  Charles L. Seitz,et al.  Myrinet: A Gigabit-per-Second Local Area Network , 1995, IEEE Micro.

[13]  Rajkumar Buyya,et al.  High Performance Cluster Computing , 1999 .

[14]  Sébastien Ourselin,et al.  Robust Registration of Multi-modal Images: Towards Real-Time Clinical Applications , 2002, MICCAI.

[15]  Torsten Rohlfing,et al.  Nonrigid image registration in shared-memory multiprocessor environments with application to brains, breasts, and bees , 2003, IEEE Transactions on Information Technology in Biomedicine.

[16]  Haiying Liu,et al.  A Generic Framework for Non-rigid Registration Based on Non-uniform Multi-level Free-Form Deformations , 2001, MICCAI.