Accelerating ring artifact correction for flat-detector CT using the CUDA framework

Ring artifacts often appear in flat-detector CT because of imperfect or defect detector elements or calibration. In high-spatial resolution CT images reducing such artifacts becomes a necessity. In this paper, we used the post-processing ring correction in polar coordinates (RCP)1 to eliminate the ring artifacts. The median filter is applied to the uncorrected images in polar coordinates and ring artifacts are extracted from the original images. The algorithm has a very high computational cost due to the time-expensive median filtering and coordinate transformation on CPUs. Graphics processing units (GPUs)ca n be seen as parallel co-processors with high computational power. All steps of the RCP algorithm were implemented with CUDA2(Compute Unified Device Architecture, NVIDIA). We introduced a new GPU-based branchless vectorized median (BVM)filter. 3, 4 This algorithm is based on minmax sorting and keeps track of a sorted array from which values are deleted and to which new values are inserted. For comparison purpose a modified pivot median filter5 on GPUs was presented, which compares a pivot element to all other values and recursively finds the median element. We evaluated the performance of the RCP method using 512 slices, each slice consisted of 512×512 pixels. This post-processing method efficiently reduces ring artifacts in the reconstructed images and improves image quality. Our CUDAbased RCP is up to 13.6 times faster than the optimized CPU-based (single core)r outine. Comparing our two GPU-based median filters showed a performance benefit by roughly 60% when switching from Pivot to BVM code. The main reason is that the BVM algorithm is branchless and makes use of data-level parallelism. The BVM method is better suited to the model of modern graphics processing. A multi-GPU solution showed that the performance scaled nearly linearly.

[1]  M. Kachelriess Branchless vectorized median filtering , 2009, 2009 IEEE Nuclear Science Symposium Conference Record (NSS/MIC).

[2]  Klaus Mueller,et al.  IOP PUBLISHING PHYSICS IN MEDICINE AND BIOLOGY , 2007 .

[3]  Jan Sijbers,et al.  Reduction of ring artefacts in high resolution micro-CT reconstructions. , 2004, Physics in medicine and biology.

[4]  Martti Juhola,et al.  Comparison of algorithms for standard median filtering , 1991, IEEE Trans. Signal Process..

[5]  L. Feldkamp,et al.  Practical cone-beam algorithm , 1984 .

[6]  Willi A. Kalender,et al.  A New Method to Identify and to Correct Circular Artifacts in X-Ray CT Images. , 2004 .

[7]  Wei Chen,et al.  High performance median filtering using commodity graphics hardware , 2009, 2009 IEEE Nuclear Science Symposium Conference Record (NSS/MIC).

[8]  Rüdiger Westermann,et al.  A fast and high-quality cone beam reconstruction pipeline using the GPU , 2007, SPIE Medical Imaging.

[9]  W. Kalender,et al.  Comparison of ring artifact correction methods for flat-detector CT , 2009, Physics in medicine and biology.

[10]  Ivan Viola,et al.  Hardware-based nonlinear filtering and segmentation using high-level shading languages , 2003, IEEE Visualization, 2003. VIS 2003..