Fully 3D list-mode time-of-flight PET image reconstruction on GPUs using CUDA.

PURPOSE List-mode processing is an efficient way of dealing with the sparse nature of positron emission tomography (PET) data sets and is the processing method of choice for time-of-flight (ToF) PET image reconstruction. However, the massive amount of computation involved in forward projection and backprojection limits the application of list-mode reconstruction in practice, and makes it challenging to incorporate accurate system modeling. METHODS The authors present a novel formulation for computing line projection operations on graphics processing units (GPUs) using the compute unified device architecture (CUDA) framework, and apply the formulation to list-mode ordered-subsets expectation maximization (OSEM) image reconstruction. Our method overcomes well-known GPU challenges such as divergence of compute threads, limited bandwidth of global memory, and limited size of shared memory, while exploiting GPU capabilities such as fast access to shared memory and efficient linear interpolation of texture memory. Execution time comparison and image quality analysis of the GPU-CUDA method and the central processing unit (CPU) method are performed on several data sets acquired on a preclinical scanner and a clinical ToF scanner. RESULTS When applied to line projection operations for non-ToF list-mode PET, this new GPU-CUDA method is >200 times faster than a single-threaded reference CPU implementation. For ToF reconstruction, we exploit a ToF-specific optimization to improve the efficiency of our parallel processing method, resulting in GPU reconstruction >300 times faster than the CPU counterpart. For a typical whole-body scan with 75 × 75 × 26 image matrix, 40.7 million LORs, 33 subsets, and 3 iterations, the overall processing time is 7.7 s for GPU and 42 min for a single-threaded CPU. Image quality and accuracy are preserved for multiple imaging configurations and reconstruction parameters, with normalized root mean squared (RMS) deviation less than 1% between CPU and GPU-generated images for all cases. CONCLUSIONS A list-mode ToF OSEM library was developed on the GPU-CUDA platform. Our studies show that the GPU reformulation is considerably faster than a single-threaded reference CPU method especially for ToF processing, while producing virtually identical images. This new method can be easily adapted to enable more advanced algorithms for high resolution PET reconstruction based on additional information such as depth of interaction (DoI), photon energy, and point spread functions (PSFs).

[1]  Pablo Aguiar,et al.  Geometrical and Monte Carlo projectors in 3D PET reconstruction. , 2010, Medical physics.

[2]  A J Reader,et al.  Statistical list-mode image reconstruction for the high resolution research tomograph. , 2004, Physics in medicine and biology.

[3]  Yuchen Yan,et al.  A system for the 3D reconstruction of retracted-septa PET data using the EM algorithm , 1995 .

[4]  H. Malcolm Hudson,et al.  Accelerated image reconstruction using ordered subsets of projection data , 1994, IEEE Trans. Medical Imaging.

[5]  Fang Xu,et al.  High performance tomosynthesis enabled via a GPU-based iterative reconstruction framework , 2009, Medical Imaging.

[6]  J. Karp,et al.  Performance of Philips Gemini TF PET/CT scanner with special consideration for its time-of-flight imaging capabilities. , 2007, Journal of nuclear medicine : official publication, Society of Nuclear Medicine.

[7]  Klaus Mueller,et al.  IOP PUBLISHING PHYSICS IN MEDICINE AND BIOLOGY , 2007 .

[8]  Joachim Hornegger,et al.  Comparing performance of many-core CPUs and GPUs for static and motion compensated reconstruction of C-arm CT data. , 2011, Medical physics.

[9]  Russell H. Taylor,et al.  Real-time dose computation: GPU-accelerated source modeling and superposition/convolution. , 2010, Medical physics.

[10]  R. Leahy,et al.  High-resolution 3D Bayesian image reconstruction using the microPET small-animal scanner. , 1998, Physics in medicine and biology.

[11]  Cheng-Yuan Liou,et al.  Dynamic Positron Emission Tomography Data-Driven Analysis Using Sparse Bayesian Learning , 2008, IEEE Transactions on Medical Imaging.

[12]  W. W. Moses,et al.  List-mode maximum-likelihood reconstruction applied to positron emission mammography (PEM) with irregular sampling , 2000, IEEE Transactions on Medical Imaging.

[13]  D. M. Hutton,et al.  The Art of Multiprocessor Programming , 2008 .

[14]  Joel S. Karp,et al.  Efficient 3-D TOF PET Reconstruction Using View-Grouped Histo-Images: DIRECT—Direct Image Reconstruction for TOF , 2009, IEEE Transactions on Medical Imaging.

[15]  H. Barrett,et al.  List-mode likelihood: EM algorithm and image quality estimation demonstrated on 2-D PET , 1998, IEEE Transactions on Medical Imaging.

[16]  Guillem Pratx,et al.  Online detector response calculations for high-resolution PET image reconstruction , 2011, Physics in medicine and biology.

[17]  Christoph Hoeschen,et al.  Fast implementation of the image reconstruction algorithm OPED , 2009, Medical Imaging.

[18]  K. Erlandsson,et al.  Fast accurate iterative reconstruction for low-statistics positron volume imaging. , 1998, Physics in medicine and biology.

[19]  P. Joseph An Improved Algorithm for Reprojecting Rays through Pixel Images , 1983, IEEE Transactions on Medical Imaging.

[20]  James F. Young,et al.  MicroPET: a high resolution PET scanner for imaging small animals , 1996, IEEE Nuclear Science Symposium Conference Record.

[21]  S. Surti,et al.  Efficient 3D TOF PET Reconstruction Using View-Grouped Histo-Images: DIRECT - Direct Image Reconstruction for TOF , 2006, 2006 IEEE Nuclear Science Symposium Conference Record.

[22]  Robert M. Lewitt,et al.  Practical considerations for 3-D image reconstruction using spherically symmetric volume elements , 1996, IEEE Trans. Medical Imaging.

[23]  Z. H. Cho,et al.  Ultra Fast Symmetry and SIMD-Based Projection-Backprojection (SSP) Algorithm for 3-D PET Image Reconstruction , 2007, IEEE Transactions on Medical Imaging.

[24]  Benoît Ozell,et al.  GPUMCD: A new GPU-oriented Monte Carlo dose calculation platform. , 2011, Medical physics.

[25]  L. Shepp,et al.  Maximum Likelihood Reconstruction for Emission Tomography , 1983, IEEE Transactions on Medical Imaging.

[26]  Jurgen Seidel,et al.  Performance evaluation of the GE healthcare eXplore VISTA dual-ring small-animal PET scanner. , 2006, Journal of nuclear medicine : official publication, Society of Nuclear Medicine.

[27]  Suleman Surti,et al.  Benefit of Time-of-Flight in PET: Experimental and Clinical Results , 2008, Journal of Nuclear Medicine.

[28]  R. Siddon Fast calculation of the exact radiological path for a three-dimensional CT array. , 1985, Medical physics.

[29]  Feng Zhang,et al.  Detection of gamma ray polarization using a 3-D position-sensitive CdZnTe detector , 2005, IEEE Transactions on Nuclear Science.

[30]  T. Yamashita,et al.  A high resolution PET for animal studies , 1991, Conference Record of the 1991 IEEE Nuclear Science Symposium and Medical Imaging Conference.

[31]  M. Werner,et al.  Dynamic Load Balancing on Distributed Listmode Time-of-Flight Image Reconstruction , 2006, 2006 IEEE Nuclear Science Symposium Conference Record.

[32]  Fang Xu,et al.  Accelerating popular tomographic reconstruction algorithms on commodity PC graphics hardware , 2005, IEEE Transactions on Nuclear Science.

[33]  S. Matej,et al.  Iterative image reconstruction using geometrically ordered subsets with list-mode data , 2004, IEEE Symposium Conference Record Nuclear Science 2004..

[34]  Joel S. Karp,et al.  Investigation of time-of-flight benefit for fully 3-DPET , 2006, IEEE Transactions on Medical Imaging.

[35]  Steven G. Ross,et al.  Application and Evaluation of a Measured Spatially Variant System Model for PET Image Reconstruction , 2010, IEEE Transactions on Medical Imaging.

[36]  Lei Xing,et al.  GPU computing in medical physics: a review. , 2011, Medical physics.

[37]  Long Zhang,et al.  Fast and memory-efficient Monte Carlo-based image reconstruction for whole-body PET. , 2010, Medical physics.

[38]  Craig S. Levin,et al.  Fast, Accurate and Shift-Varying Line Projections for Iterative Reconstruction Using the GPU , 2009, IEEE Transactions on Medical Imaging.

[39]  E. Veklerov,et al.  MLE reconstruction of a brain phantom using a Monte Carlo transition matrix and a statistical stopping rule , 1988 .