Performance-aware programming for intraoperative intensity-based image registration on graphics processing units

Intensity-based image registration has been proven essential in many applications accredited to its unparalleled ability to resolve image misalignments. However, long registration time for image realignment prohibits its use in intra-operative navigation systems. There has been much work on accelerating the registration process by improving the algorithm’s robustness, but the innate computation required by the registration algorithm has been unresolved. Intensity-based registration methods involve operations with high arithmetic load and memory access demand, which supposes to be reduced by graphics processing units (GPUs). Although GPUs are widespread and affordable, there is a lack of open-source GPU implementations optimized for non-rigid image registration. This paper demonstrates performance-aware programming techniques, which involves systematic exploitation of GPU features, by implementing the diffeomorphic log-demons algorithm. By resolving the pinpointed computation bottlenecks on GPU, our implementation of diffeomorphic log-demons on Nvidia GTX Titan X GPU has achieved ~ 95 times speed-up compared to the CPU and registered a 1.3-M voxel image in 286 ms. Even for large 37-M voxel images, our implementation is able to register in 8.56 s, which attained ~ 258 times speed-up. Our solution involves effective employment of GPU computation units, memory, and data bandwidth to resolve computation bottlenecks. The computation bottlenecks in diffeomorphic log-demons are pinpointed, analyzed, and resolved using various GPU performance-aware programming techniques. The proposed fast computation on basic image operations not only enhances the computation of diffeomorphic log-demons, but is also potentially extended to speed up many other intensity-based approaches. Our implementation is open-source on GitHub at https://bit.ly/2PYZxQz.

[1]  Jean-Philippe Thirion,et al.  Image matching as a diffusion process: an analogy with Maxwell's demons , 1998, Medical Image Anal..

[2]  Nicholas Ayache,et al.  Symmetric Log-Domain Diffeomorphic Registration: A Demons-Based Approach , 2008, MICCAI.

[3]  Tom Vercauteren,et al.  Diffeomorphic demons: Efficient non-parametric image registration , 2009, NeuroImage.

[4]  Hao Su,et al.  Techniques for Stereotactic Neurosurgery: Beyond the Frame, Toward the Intraoperative Magnetic Resonance Imaging-Guided and Robot-Assisted Approaches. , 2018, World neurosurgery.

[5]  Max A. Viergever,et al.  elastix: A Toolbox for Intensity-Based Medical Image Registration , 2010, IEEE Transactions on Medical Imaging.

[6]  Daniel Rueckert,et al.  Consistent groupwise non-rigid registration for atlas construction , 2004, 2004 2nd IEEE International Symposium on Biomedical Imaging: Nano to Macro (IEEE Cat No. 04EX821).

[7]  Rokiatou Diarra,et al.  Towards Automatic Restrictification of CUDA Kernel Arguments , 2018, 2018 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE).

[8]  F. Maes,et al.  Cardiac Three-Dimensional Magnetic Resonance Imaging and Fluoroscopy Merging: A New Approach for Electroanatomic Mapping to Assist Catheter Ablation , 2005, Circulation.

[9]  Wayne Luk,et al.  Abstract 18568: Interfacing Fast Multi-phase Cardiac Image Registration with MRI-based Catheter Tracking for MRI-guided Electrophysiological Ablative Procedures , 2014 .

[10]  John Ashburner,et al.  A fast diffeomorphic image registration algorithm , 2007, NeuroImage.

[11]  Karl J. Friston,et al.  Unified segmentation , 2005, NeuroImage.

[12]  Nikos Paragios,et al.  Deformable Medical Image Registration: A Survey , 2013, IEEE Transactions on Medical Imaging.

[13]  Gerhard Wellein,et al.  Introduction to High Performance Computing for Scientists and Engineers , 2010, Chapman and Hall / CRC computational science series.

[14]  Stephen M. Moore,et al.  The Cancer Imaging Archive (TCIA): Maintaining and Operating a Public Information Repository , 2013, Journal of Digital Imaging.

[15]  P. Hoskin,et al.  Review of intraoperative imaging and planning techniques in permanent seed prostate brachytherapy. , 2010, Radiotherapy and oncology : journal of the European Society for Therapeutic Radiology and Oncology.

[16]  Guy B. Williams,et al.  A New Fast Accurate Nonlinear Medical Image Registration Program Including Surface Preserving Regularization , 2014, IEEE Transactions on Medical Imaging.

[17]  Wayne Luk,et al.  MR Safe Robotic Manipulator for MRI-Guided Intracardiac Catheterization , 2018, IEEE/ASME Transactions on Mechatronics.

[18]  David R. Kaeli,et al.  Exploiting Memory Access Patterns to Improve Memory Performance in Data-Parallel Architectures , 2011, IEEE Transactions on Parallel and Distributed Systems.

[19]  Lieven Eeckhout,et al.  Cole: compiler optimization level exploration , 2008, CGO '08.

[20]  Joshua J. E. Blauer,et al.  Identification and Acute Targeting of Gaps in Atrial Ablation Lesion Sets Using a Real-Time Magnetic Resonance Imaging System , 2012, Circulation. Arrhythmia and electrophysiology.

[21]  Robert A. McLaughlina,et al.  Intensity-based Registration versus Feature-based Registration for Neurointerventions , 2001 .

[22]  Milan Sonka,et al.  3D Slicer as an image computing platform for the Quantitative Imaging Network. , 2012, Magnetic resonance imaging.

[23]  Michael J Ackerman,et al.  Engineering and algorithm design for an image processing Api: a technical report on ITK--the Insight Toolkit. , 2002, Studies in health technology and informatics.

[24]  Steve B Jiang,et al.  Implementation and evaluation of various demons deformable image registration algorithms on a GPU. , 2010, Physics in medicine and biology.

[25]  Wei Liu,et al.  Accelerated Diffeomorphic Non-Rigid Image Registration with CUDA Based on Demons Algorithm , 2010, 2010 4th International Conference on Bioinformatics and Biomedical Engineering.

[26]  Joe Y. Chang,et al.  Validation of an accelerated ‘demons’ algorithm for deformable image registration in radiation therapy , 2005, Physics in medicine and biology.

[27]  Hugh Calkins,et al.  Initial Experience in the Use of Integrated Electroanatomic Mapping with Three‐Dimensional MR/CT Images to Guide Catheter Ablation of Atrial Fibrillation , 2006, Journal of cardiovascular electrophysiology.

[28]  J. Weaver,et al.  Multiresolution elastic image registration. , 1998, Medical physics.

[29]  Kebin Jia,et al.  Deep Adaptive Log-Demons: Diffeomorphic Image Registration with Very Large Deformations , 2015, Comput. Math. Methods Medicine.

[30]  Nicholas Ayache,et al.  Spherical Demons: Fast Diffeomorphic Landmark-Free Surface Registration , 2010, IEEE Transactions on Medical Imaging.

[31]  Nathan D. Cahill,et al.  A Demons Algorithm for Image Registration with Locally Adaptive Regularization , 2009, MICCAI.

[32]  Inanc Senocak,et al.  Accelerating incompressible flow computations with a Pthreads-CUDA implementation on small-footprint multi-GPU platforms , 2010, The Journal of Supercomputing.