A GPU-Accelerated Multivoxel Update Scheme for Iterative Coordinate Descent (ICD) Optimization in Statistical Iterative CT Reconstruction (SIR)

Statistical iterative reconstruction (SIR) algorithms have shown great potential for improving image quality in reduced and low dose X-ray computed tomography (CT). However, high computational cost and long reconstruction times have so far prevented the use of SIR in practical applications. Various optimization algorithms have been proposed to make SIR parallelizable for execution on multicore computational platforms, whereas others have sought to improve its convergence rate. Parallelizing on a set of decoupled voxels within an iterative coordinate descent (ICD) optimization framework has shown good promise to achieve both of these premises. However, so far these types of frameworks come at the price of additional complexities or are limited to parallel beam geometry only. We improve on this prior research and present a framework, which also achieves parallelism by processing sets of independent voxels, but does not introduce additional complexities and has no restrictions on beam geometry. Our method uses a novel multivoxel update (MVU) scheme within a general ICD framework fully optimized for acceleration on commodity GPUs. We also investigate different GPU memory access patterns to increase cache hit-rates that result in improved time performance in the ICD framework. Experiments demonstrate speedups of two orders of magnitude for clinical datasets in cone-beam CT geometry, compared to the single-voxel update scheme native to conventional ICD-based SIR. Finally, since our MVU scheme operates on fully independent voxels, it maintains the fast convergence properties of ICD-based SIR. Consequently, the speedups achieved by parallel computing are not diminished by slower convergence of the iterative updates or by any additional overhead to decouple conflicting voxels.

[1]  Jeffrey A. Fessler,et al.  GPU Acceleration of 3 D Forward and Backward Projection Using Separable Footprints for X-ray CT Image Reconstruction , 2011 .

[2]  Charles A. Bouman,et al.  Model-based Iterative CT Image Reconstruction on GPUs , 2017, PPoPP.

[3]  Sungsoo Ha,et al.  An Algorithm to Compute Independent Sets of Voxels for Parallelization of ICD-based Statistical Iterative Reconstruction , .

[4]  Charles A. Bouman,et al.  High performance model based image reconstruction , 2016, PPoPP.

[5]  Jeffrey A. Fessler,et al.  Conjugate-gradient preconditioning methods for shift-variant PET image reconstruction , 1999, IEEE Trans. Image Process..

[6]  Jing Wang,et al.  Statistical image reconstruction for low-dose CT using nonlocal means-based regularization , 2014, Comput. Medical Imaging Graph..

[7]  Hakan Erdogan,et al.  Ordered subsets algorithms for transmission tomography. , 1999, Physics in medicine and biology.

[8]  Jeffrey A. Fessler,et al.  Axial block coordinate descent ( ABCD ) algorithm for X-ray CT image reconstruction , 2011 .

[9]  Freek J. Beekman,et al.  Accelerated iterative transmission CT reconstruction using an ordered subsets convex algorithm , 1998, IEEE Transactions on Medical Imaging.

[10]  Jeffrey A. Fessler,et al.  Fast X-Ray CT Image Reconstruction Using a Linearized Augmented Lagrangian Method With Ordered Subsets , 2014, IEEE Transactions on Medical Imaging.

[11]  Wei Xu,et al.  On the efficiency of iterative ordered subset reconstruction algorithms for acceleration on GPUs , 2010, Comput. Methods Programs Biomed..

[12]  Jeffrey A. Fessler,et al.  3D Forward and Back-Projection for X-Ray CT Using Separable Footprints , 2010, IEEE Transactions on Medical Imaging.

[13]  Jianhua Ma,et al.  Statistical image reconstruction for low-dose CT using nonlocal means-based regularization. Part II: An adaptive approach , 2015, Comput. Medical Imaging Graph..

[14]  Sungsoo Ha,et al.  Efficient Area-Based Ray Integration Using Summed Area Tables and Regression Models , .

[15]  J. Fessler,et al.  Relaxed Linearized Algorithms for Faster X-Ray CT Image Reconstruction. , 2016, IEEE transactions on medical imaging.

[16]  Jean-Baptiste Thibault,et al.  A three-dimensional statistical approach to improved image quality for multislice helical CT. , 2007, Medical physics.

[17]  Jeffrey A. Fessler,et al.  Grouped-coordinate ascent algorithms for penalized-likelihood transmission image reconstruction , 1997, IEEE Transactions on Medical Imaging.

[18]  Rui Liu,et al.  GPU-Based Branchless Distance-Driven Projection and Backprojection , 2017, IEEE Transactions on Computational Imaging.

[19]  Simon R. Cherry,et al.  Fast gradient-based methods for Bayesian reconstruction of transmission and emission PET images , 1994, IEEE Trans. Medical Imaging.

[20]  Fang Xu,et al.  Accelerating popular tomographic reconstruction algorithms on commodity PC graphics hardware , 2005, IEEE Transactions on Nuclear Science.

[21]  Jiang Hsieh,et al.  A study of four minimization approaches for iterative reconstruction in X-ray CT , 2005, IEEE Nuclear Science Symposium Conference Record, 2005.

[22]  Zhou Yu,et al.  Fast Model-Based X-Ray CT Reconstruction Using Spatially Nonhomogeneous ICD Optimization , 2011, IEEE Transactions on Image Processing.

[23]  Jeffrey A. Fessler,et al.  Combining Ordered Subsets and Momentum for Accelerated X-Ray CT Image Reconstruction , 2015, IEEE Transactions on Medical Imaging.

[24]  Thomas M. Benson,et al.  Block-based iterative coordinate descent , 2010, IEEE Nuclear Science Symposuim & Medical Imaging Conference.

[25]  Charles A. Bouman,et al.  Fast voxel line update for time-space image reconstruction , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[26]  C. Bouman,et al.  A recursive filter for noise reduction in statistical iterative tomographic imaging , 2006, Electronic Imaging.

[27]  Ken D. Sauer,et al.  A local update strategy for iterative reconstruction from projections , 1993, IEEE Trans. Signal Process..

[28]  Ken D. Sauer,et al.  A generalized Gaussian image model for edge-preserving MAP estimation , 1993, IEEE Trans. Image Process..

[29]  Klaus Mueller,et al.  IOP PUBLISHING PHYSICS IN MEDICINE AND BIOLOGY , 2007 .

[30]  Gengsheng L. Zeng,et al.  Total variation regulated EM algorithm [SPECT reconstruction] , 1999 .

[31]  K. Mueller,et al.  GPU-Accelerated Forward and Back-Projections With Spatially Varying Kernels for 3D DIRECT TOF PET Reconstruction , 2013, IEEE Transactions on Nuclear Science.

[32]  R. Siddon Fast calculation of the exact radiological path for a three-dimensional CT array. , 1985, Medical physics.

[33]  Jeffrey A. Fessler,et al.  Accelerating Ordered Subsets Image Reconstruction for X-ray CT Using Spatially Nonuniform Optimization Transfer , 2013, IEEE Transactions on Medical Imaging.

[34]  B. De Man,et al.  Distance-driven projection and backprojection in three dimensions. , 2004, Physics in medicine and biology.

[35]  Tinsu Pan,et al.  Preconditioning methods for improved convergence rates in iterative reconstructions , 1993, IEEE Trans. Medical Imaging.

[36]  Ken D. Sauer,et al.  A unified approach to statistical tomography using coordinate descent optimization , 1996, IEEE Trans. Image Process..

[37]  Donald Geman,et al.  Constrained Restoration and the Recovery of Discontinuities , 1992, IEEE Trans. Pattern Anal. Mach. Intell..