Forward-Projection Architecture for Fast Iterative Image Reconstruction in X-Ray CT

Iterative image reconstruction can dramatically improve the image quality in X-ray computed tomography (CT), but the computation involves iterative steps of 3D forward- and back-projection, which impedes routine clinical use. To accelerate forward-projection, we analyze the CT geometry to identify the intrinsic parallelism and data access sequence for a highly parallel hardware architecture. To improve the efficiency of this architecture, we propose a water-filling buffer to remove pipeline stalls, and an out-of-order sectored processing to reduce the off-chip memory access by up to three orders of magnitude. We make a floating-point to fixed-point conversion based on numerical simulations and demonstrate comparable image quality at a much lower implementation cost. As a proof of concept, a 5-stage fully pipelined, 55-way parallel separable-footprint forward-projector is prototyped on a Xilinx Virtex-5 FPGA for a throughput of 925.8 million voxel projections/s at 200 MHz clock frequency, 4.6 times higher than an optimized 16-threaded program running on an 8-core 2.8-GHz CPU. A similar architecture can be applied to back-projection for a complete iterative image reconstruction system. The proposed algorithm and architecture can also be applied to hardware platforms such as graphics processing unit and digital signal processor to achieve significant accelerations.

[1]  Scott Hauck,et al.  Impulse C vs. VHDL for Accelerating Tomographic Reconstruction , 2010, 2010 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines.

[2]  Jeffrey A. Fessler,et al.  Hardware acceleration of iterative image reconstruction for X-ray computed tomography , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[3]  Jeffrey A. Fessler,et al.  3D Forward and Back-Projection for X-Ray CT Using Separable Footprints , 2010, IEEE Transactions on Medical Imaging.

[4]  Jean-Baptiste Thibault,et al.  A three-dimensional statistical approach to improved image quality for multislice helical CT. , 2007, Medical physics.

[5]  Jeffrey A. Fessler,et al.  Image Reconstruction: Algorithms and Analysis , 2013 .

[6]  R. Geise Computed Tomography: Physical Principles, Clinical Applications, and Quality Control , 1995 .

[7]  O. Nalcioglu,et al.  Constrained Iterative Reconstruction by the Conjugate Gradient Method , 1985, IEEE Transactions on Medical Imaging.

[8]  J. Fessler Statistical Image Reconstruction Methods for Transmission Tomography , 2000 .

[9]  T. Pan Computed Tomography: from Photon Statistics to Modern Cone-Beam CT , 2009, Journal of Nuclear Medicine.

[10]  Iain Goddard,et al.  High-speed cone-beam reconstruction: an embedded systems approach , 2002, SPIE Medical Imaging.

[11]  P. Tseng,et al.  On the convergence of the coordinate descent method for convex differentiable minimization , 1992 .

[12]  Hakan Erdogan,et al.  Ordered subsets algorithms for transmission tomography. , 1999, Physics in medicine and biology.

[13]  L. Feldkamp,et al.  Practical cone-beam algorithm , 1984 .

[14]  J H Siewerdsen,et al.  Cone-beam computed tomography with a flat-panel imager: effects of image lag. , 1999, Medical physics.

[15]  Klaus Mueller,et al.  IOP PUBLISHING PHYSICS IN MEDICINE AND BIOLOGY , 2007 .

[16]  J. Wong,et al.  Flat-panel cone-beam computed tomography for image-guided radiation therapy. , 2002, International journal of radiation oncology, biology, physics.

[17]  Fang Xu,et al.  Accelerating popular tomographic reconstruction algorithms on commodity PC graphics hardware , 2005, IEEE Transactions on Nuclear Science.

[18]  Jie Cheng,et al.  CUDA by Example: An Introduction to General-Purpose GPU Programming , 2010, Scalable Comput. Pract. Exp..