Accelerating Mutual-Information-Based Linear Registration on the Cell Broadband Engine Processor

Emerging multi-core processors are able to accelerate medical imaging applications by exploiting the parallelism available in their algorithms. We have implemented a mutual-information-based 3D linear registration algorithm on the Cell Broadband Enginetrade processor. By exploiting the highly parallel architecture and its high memory bandwidth, our implementation with two CBE processors can register a pair of 256x256x30 3D images in one second. This implementation is significantly faster than a conventional one on a traditional microprocessor or even faster than a previously reported custom-hardware implementation. In addition to parallelizing the code for multiple cores and organizing the data structure for reducing the amount of the memory traffic, it is also critical to optimize the code for the SIMD pipeline structure. We note that code optimization for the SIMD pipeline alone results in a 4.2x-8.7x acceleration for the computation of small kernels. Further, SIMD optimization alone results in a 4.5x end-end application speedup.

[1]  Hiroshi Inoue,et al.  REAL-TIME MUTUAL-INFORMATION-BASED LINEAR REGISTRATION ON THE CELL BROADBAND ENGINE PROCESSOR , 2007, 2007 4th IEEE International Symposium on Biomedical Imaging: From Nano to Macro.

[2]  Martin Rumpf,et al.  Image Registration by a Regularized Gradient Flow. A Streaming Implementation in DX9 Graphics Hardware , 2004, Computing.

[3]  David R. Haynor,et al.  PET-CT image registration in the chest using free-form deformations , 2003, IEEE Transactions on Medical Imaging.

[4]  Bradley J Erickson,et al.  Image Registration Improves Confidence and Accuracy of Image Interpretation , 2007, Cancer informatics.

[5]  Torsten Rohlfing,et al.  Nonrigid image registration in shared-memory multiprocessor environments with application to brains, breasts, and bees , 2003, IEEE Transactions on Information Technology in Biomedicine.

[6]  M. Knaup,et al.  Hyperfast Parallel--Beam Backprojection , 2006, 2006 IEEE Nuclear Science Symposium Conference Record.

[7]  Ron Kikinis,et al.  A High Performance Computing Approach to the Registration of Medical Imaging Data , 1998, Parallel Comput..

[8]  J.M. Jagadeesh,et al.  FAIR: a hardware architecture for real-time 3-D image registration , 2003, IEEE Transactions on Information Technology in Biomedicine.

[9]  H. Peter Hofstee,et al.  Introduction to the Cell multiprocessor , 2005, IBM J. Res. Dev..

[10]  Yaron Caspi,et al.  Scalability of Multimedia Applications on Next-Generation Processors , 2006, 2006 IEEE International Conference on Multimedia and Expo.