GPU acceleration of nonlinear diffusion tensor estimation using CUDA and MPI

Diffusion MRI is a non-invasive magnetic resonance technique and has been increasingly used in imaging neuroscience. It is currently the only method capable of depicting the complex structure of white matter of the brain in vivo. One of the most popular techniques is diffusion tensor imaging (DTI) which is commonly used clinically to produce in vivo images of biological tissues with local microstructural characteristics such as water diffusion. Diffusion tensor maps are usually computed on a voxel-by-voxel basis by fitting the signal intensities of diffusion weighted images as a function of their corresponding data acquisition parameters (b-matrices). This processing is computation-intensive and time-consuming which can constraint the clinical practice of DTI. This study presents the application of using high performance GPU clusters in addition to CPUs for diffusion tensor estimation by accelerating the multivariate non-linear regression. The results are tested using both simulated and clinical diffusion datasets and show significant performance gain in tensor fitting. The proposed GPU implementation framework can significantly reduce the processing time of DTI data especially for the datasets with high spatial and temporal resolution, and hence further promote the clinical use of DTI. It also can be used to accelerate statistical analysis of DTI where Monte Carlo simulations are employed, be readily applied to quantitative assessment of DTI using bootstrap analysis, robust diffusion tensor estimation and should be applicable to other MR imaging techniques that use univariate or multivariate regression to fit MRI data to a model.

[1]  José M. García,et al.  Accelerating Fibre Orientation Estimation from Diffusion Weighted Magnetic Resonance Imaging Using GPUs , 2012, PDP.

[2]  J. Navarro-Pedreño Numerical Methods for Least Squares Problems , 1996 .

[3]  I. Peña,et al.  Matrix computations on graphics processors and clusters of gpus , 2011 .

[4]  Tim McGraw,et al.  Stochastic DT-MRI Connectivity Mapping on the GPU , 2007, IEEE Transactions on Visualization and Computer Graphics.

[5]  Arnaud Doucet,et al.  On the Utility of Graphics Cards to Perform Massively Parallel Simulation of Advanced Monte Carlo Methods , 2009, Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America.

[6]  Naga K. Govindaraju,et al.  A Survey of General‐Purpose Computation on Graphics Hardware , 2007 .

[7]  Sinisa Pajevic,et al.  Color schemes to represent the orientation of anisotropic tissues from diffusion tensor data: Application to white matter fiber tract mapping in the human brain , 1999, Magnetic resonance in medicine.

[8]  Eros Comunello,et al.  Performing Real-Time Interactive Fiber Tracking , 2011, Journal of Digital Imaging.

[9]  Cheng Guan Koay,et al.  A simple scheme for generating nearly uniform distribution of antipodally symmetric points on the unit sphere , 2011, J. Comput. Sci..

[10]  Jie Cheng,et al.  Programming Massively Parallel Processors. A Hands-on Approach , 2010, Scalable Comput. Pract. Exp..

[11]  Carlo Pierpaoli,et al.  Informed RESTORE: A method for robust estimation of diffusion tensor from low redundancy datasets in the presence of physiological noise artifacts , 2012, Magnetic resonance in medicine.

[12]  Jungsoo Lee,et al.  Divide et impera: Acceleration of DTI tractography using multi‐GPU parallel processing , 2013, Int. J. Imaging Syst. Technol..

[13]  Christopher Nimsky,et al.  Hybrid Visualization for White Matter Tracts using Triangle Strips and Point Sprites , 2006, IEEE Transactions on Visualization and Computer Graphics.

[14]  Derek K. Jones,et al.  RESTORE: Robust estimation of tensors by outlier rejection , 2005, Magnetic resonance in medicine.

[15]  Frederica Darema,et al.  The SPMD Model : Past, Present and Future , 2001, PVM/MPI.

[16]  Carlo Pierpaoli,et al.  Variance of estimated DTI‐derived parameters via first‐order perturbation methods , 2007, Magnetic resonance in medicine.

[17]  P. Basser,et al.  Estimation of the effective self-diffusion tensor from the NMR spin echo. , 1994, Journal of magnetic resonance. Series B.

[18]  P. Basser,et al.  Microstructural and physiological features of tissues elucidated by quantitative-diffusion-tensor MRI. , 1996, Journal of magnetic resonance. Series B.

[19]  Zhiwei Xu,et al.  Scalable Parallel Computing: Technology, Architecture, Programming , 1998 .

[20]  David Kaeli,et al.  Heterogeneous Computing with OpenCL , 2011 .

[21]  Steen Moeller,et al.  Multiband multislice GE‐EPI at 7 tesla, with 16‐fold acceleration using partial parallel imaging with application to high spatial and temporal whole‐brain fMRI , 2010, Magnetic resonance in medicine.

[22]  P. Basser,et al.  Diffusion tensor MR imaging of the human brain. , 1996, Radiology.

[23]  Kenneth Levenberg A METHOD FOR THE SOLUTION OF CERTAIN NON – LINEAR PROBLEMS IN LEAST SQUARES , 1944 .

[24]  Inanc Senocak,et al.  An MPI-CUDA Implementation for Massively Parallel Incompressible Flow Computations on Multi-GPU Clusters , 2010 .

[25]  Erkki Oja,et al.  Solving Large Regression Problems using an Ensemble of GPU-accelerated ELMs , 2010, ESANN.

[26]  Timothy Edward John Behrens,et al.  Accelerating Fibre Orientation Estimation from Diffusion Weighted Magnetic Resonance Imaging Using GPUs , 2012, 2012 20th Euromicro International Conference on Parallel, Distributed and Network-based Processing.

[27]  John E. Stone,et al.  GPU clusters for high-performance computing , 2009, 2009 IEEE International Conference on Cluster Computing and Workshops.

[28]  Anna Vilanova,et al.  CUDA-Accelerated Geodesic Ray-Tracing for Fiber Tracking , 2011, Int. J. Biomed. Imaging.

[29]  Eros Comunello,et al.  Diffusion tensor fiber tracking on graphics processing units , 2008, Comput. Medical Imaging Graph..

[30]  Heidi Johansen-Berg,et al.  Diffusion MRI at 25: Exploring brain tissue structure and function , 2012, NeuroImage.

[31]  D. Marquardt An Algorithm for Least-Squares Estimation of Nonlinear Parameters , 1963 .

[32]  P. Basser,et al.  The b matrix in diffusion tensor echo‐planar imaging , 1997, Magnetic resonance in medicine.

[33]  Anders Eklund,et al.  Medical image processing on the GPU - Past, present and future , 2013, Medical Image Anal..

[34]  Bin Li,et al.  Architecture comparisons between Nvidia and ATI GPUs: Computation parallelism and data communications , 2011, 2011 IEEE International Symposium on Workload Characterization (IISWC).

[35]  Ross T. Whitaker,et al.  Interactive Visualization of Volumetric White Matter Connectivity in DT-MRI Using a Parallel-Hardware Hamilton-Jacobi Solver , 2007, IEEE Transactions on Visualization and Computer Graphics.