Parallel computing for simultaneous iterative tomographic imaging by graphics processing units

In this paper, we address the problem of accelerating inversion algorithms for nonlinear acoustic tomographic imaging by parallel computing on graphics processing units (GPUs). Nonlinear inversion algorithms for tomographic imaging often rely on iterative algorithms for solving an inverse problem, thus computationally intensive. We study the simultaneous iterative reconstruction technique (SIRT) for the multiple-input-multiple-output (MIMO) tomography algorithm which enables parallel computations of the grid points as well as the parallel execution of multiple source excitation. Using graphics processing units (GPUs) and the Compute Unified Device Architecture (CUDA) programming model an overall improvement of 26.33x was achieved when combining both approaches compared with sequential algorithms. Furthermore we propose an adaptive iterative relaxation factor and the use of non-uniform weights to improve the overall convergence of the algorithm. Using these techniques, fast computations can be performed in parallel without the loss of image quality during the reconstruction process.

[1]  Yuanwei Jin,et al.  Abstract: GPU Accelerated Ultrasonic Tomography Using Propagation and Backpropagation Method , 2012, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis.

[2]  Yuanwei Jin,et al.  MIMO Nonlinear Ultrasonic Tomography by Propagation and Backpropagation Method , 2013, IEEE Transactions on Image Processing.

[3]  Michael Garland,et al.  Implementing sparse matrix-vector multiplication on throughput-oriented processors , 2009, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis.

[4]  Kevin L. Priddy,et al.  A study of multi-static ultrasonic tomography using propagation and back-propagation method , 2011, Defense + Commercial Sensing.

[5]  P. Sadayappan,et al.  Stencil-Aware GPU Optimization of Iterative Solvers , 2013, SIAM J. Sci. Comput..

[6]  Massimiliano Fatica,et al.  Implementing the Himeno benchmark with CUDA on GPU clusters , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[7]  Michael A. Jensen,et al.  A massively parallel computation strategy for FDTD: time and space parallelism applied to electromagnetics problems , 1995 .

[8]  Naoya Maruyama,et al.  Optimizing Stencil Computations for NVIDIA Kepler GPUs , 2014 .

[9]  Harold S. Stone,et al.  A Parallel Algorithm for the Efficient Solution of a General Class of Recurrence Equations , 1973, IEEE Transactions on Computers.

[10]  Yuanwei Jin,et al.  Graphics processing units accelerated MIMO tomographic image reconstruction using target sparseness , 2014, Sensing Technologies + Applications.

[11]  Yuanwei Jin,et al.  GPU accelerated MIMO ultrasonic imaging using propagation and back-propagation method , 2013, 2013 IEEE China Summit and International Conference on Signal and Information Processing.