Block stochastic gradient descent for large-scale tomographic reconstruction in a parallel network

Iterative algorithms have many advantages for linear tomographic image reconstruction when compared to back-projection based methods. However, iterative methods tend to have significantly higher computational complexity. To overcome this, parallel processing schemes that can utilise several computing nodes are desirable. Popular methods here are row action methods, which update the entire image simultaneously and column action methods, which require access to all measurements at each node. In large scale tomographic reconstruction with limited storage capacity of each node, data communication overheads between nodes becomes a significant performance limiting factor. To reduce this overhead, we proposed a row action method BSGD. The method is based on the stochastic gradient descent method but it does not update the entire image at each iteration, which reduces between node communication. To further increase convergence speeds, an importance sampling strategy is proposed. We compare BSGD to other existing stochastic methods and show its effectiveness and efficiency. Other properties of BSGD are also explored, including its ability to incorporate total variation (TV) regularization and automatic parameter tuning.

[1]  Thomas M. Benson,et al.  Block-based iterative coordinate descent , 2010, IEEE Nuclear Science Symposuim & Medical Imaging Conference.

[2]  Charles A. Bouman,et al.  Model-based Iterative CT Image Reconstruction on GPUs , 2017, PPoPP.

[3]  Jeffrey A. Fessler,et al.  Parallelizable algorithms for X-ray CT image reconstruction with spatially non-uniform updates , 2012 .

[4]  M.D. Jones,et al.  Hybrid MPI-OpenMP Programming for Parallel OSEM PET Reconstruction , 2006, IEEE Transactions on Nuclear Science.

[5]  Daniel Kolditz,et al.  Iterative reconstruction methods in X-ray CT. , 2012, Physica medica : PM : an international journal devoted to the applications of physics to medicine and biology : official journal of the Italian Association of Biomedical Physics.

[6]  Alfred O. Hero,et al.  A Convergent Incremental Gradient Method with a Constant Step Size , 2007, SIAM J. Optim..

[7]  Yushan Gao,et al.  A Joint Row and Column Action Method for Cone-Beam Computed Tomography , 2018, IEEE Transactions on Computational Imaging.

[8]  Alexander J. Smola,et al.  Efficient mini-batch training for stochastic optimization , 2014, KDD.

[9]  Y. Censor,et al.  BICAV: a block-iterative parallel algorithm for sparse systems with pixel-related weighting , 2001, IEEE Transactions on Medical Imaging.

[10]  Charles A. Bouman,et al.  Massively Parallel 3 D Image Reconstruction , 2017 .

[11]  Gumersindo Verdú Martín,et al.  Fast parallel algorithm for CT image reconstruction , 2012, 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society.

[12]  J. Borwein,et al.  Two-Point Step Size Gradient Methods , 1988 .

[13]  Shiqian Ma,et al.  Barzilai-Borwein Step Size for Stochastic Gradient Descent , 2016, NIPS.

[14]  Arindam Banerjee,et al.  Randomized Block Coordinate Descent for Online and Stochastic Optimization , 2014, ArXiv.

[15]  Fabian Pedregosa,et al.  ASAGA: Asynchronous Parallel SAGA , 2016, AISTATS.

[16]  Yair Censor,et al.  Component averaging: An efficient iterative parallel algorithm for large and sparse unstructured problems , 2001, Parallel Comput..

[17]  M Soleimani,et al.  Introduction: a brief overview of iterative algorithms in X-ray computed tomography , 2015, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[18]  Bruno De Man,et al.  Evaluation of convergence speed of a modified Nesterov gradient method for CT reconstruction , 2012, 2012 IEEE Nuclear Science Symposium and Medical Imaging Conference Record (NSS/MIC).

[19]  Jun Ni,et al.  Parallelism of iterative CT reconstruction based on local reconstruction algorithm , 2008, The Journal of Supercomputing.

[20]  Wu-Jun Li,et al.  Fast Asynchronous Parallel Stochastic Gradient Descent: A Lock-Free Approach with Convergence Guarantee , 2016, AAAI.

[21]  Tong Zhang,et al.  Accelerating Stochastic Gradient Descent using Predictive Variance Reduction , 2013, NIPS.

[22]  J. R. BILBAO-CASTRO,et al.  Performance of Parallel 3 D Iterative Reconstruction Algorithms , 2004 .

[23]  Bruno De Man,et al.  An outlook on x-ray CT research and development. , 2008, Medical physics.

[24]  Jeffrey A. Fessler,et al.  Iterative Helical CT Reconstruction in the Cloud for Ten Dollars in Five Minutes , 2013 .

[25]  Manuchehr Soleimani,et al.  TIGRE: a MATLAB-GPU toolbox for CBCT image reconstruction , 2016 .

[26]  Zhou Yu,et al.  Fast Model-Based X-Ray CT Reconstruction Using Spatially Nonhomogeneous ICD Optimization , 2011, IEEE Transactions on Image Processing.

[27]  Patrick L. Combettes,et al.  Signal Recovery by Proximal Forward-Backward Splitting , 2005, Multiscale Model. Simul..

[28]  Jacques Felblinger,et al.  CT image quality improvement using adaptive iterative dose reduction with wide-volume acquisition on 320-detector CT , 2012, European Radiology.

[29]  Mark W. Schmidt,et al.  Online Learning Rate Adaptation with Hypergradient Descent , 2017, ICLR.

[30]  L. Feldkamp,et al.  Practical cone-beam algorithm , 1984 .

[31]  Jan Sijbers,et al.  The ASTRA Toolbox: A platform for advanced algorithm development in electron tomography. , 2015, Ultramicroscopy.

[32]  Wei Xu,et al.  On the efficiency of iterative ordered subset reconstruction algorithms for acceleration on GPUs , 2010, Comput. Methods Programs Biomed..

[33]  David Isaacson,et al.  Adaptive Kaczmarz method for image reconstruction in electrical impedance tomography , 2013, Physiological measurement.

[34]  Mark W. Schmidt,et al.  Minimizing finite sums with the stochastic average gradient , 2013, Mathematical Programming.

[35]  William Pavlicek,et al.  Abdominal CT: comparison of low-dose CT with adaptive statistical iterative reconstruction and routine-dose CT with filtered back projection in 53 patients. , 2010, AJR. American journal of roentgenology.

[36]  Stephen J. Wright,et al.  Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent , 2011, NIPS.

[37]  Xue-Ping Guo Convergence studies on block iterative algorithms for image reconstruction , 2016, Appl. Math. Comput..

[38]  Sebastian Ruder,et al.  An overview of gradient descent optimization algorithms , 2016, Vestnik komp'iuternykh i informatsionnykh tekhnologii.

[39]  Charles A. Bouman,et al.  High performance model based image reconstruction , 2016, PPoPP.

[40]  Jens Gregor,et al.  Computational Analysis and Improvement of SIRT , 2008, IEEE Transactions on Medical Imaging.

[41]  James T. Kwok,et al.  Fast Distributed Asynchronous SGD with Variance Reduction , 2015, ArXiv.

[42]  Bjorn De Sutter,et al.  A Fast Algorithm to Calculate the Exact Radiological Path through a Pixel or Voxel Space , 1998 .

[43]  Thomas Rodet,et al.  The cone-beam algorithm of Feldkamp, Davis, and Kress preserves oblique line integrals. , 2004, Medical physics.

[44]  E. Hoffman,et al.  Quantitation in Positron Emission Computed Tomography: 1. Effect of Object Size , 1979, Journal of computer assisted tomography.

[45]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[46]  Willem Jan Palenstijn,et al.  A distributed SIRT implementation for the ASTRA Toolbox , 2015 .

[47]  Stephen P. Boyd,et al.  Distributed Optimization and Statistical Learning via the Alternating Direction Method of Multipliers , 2011, Found. Trends Mach. Learn..

[48]  Arnold M. R. Schilham,et al.  Iterative reconstruction techniques for computed tomography Part 1: Technical principles , 2013, European Radiology.

[49]  Neil S. O'Brien,et al.  Computed Laminography of CFRP Using an X-Ray Cone-Beam and Robotic Sample Manipulator Systems , 2019, IEEE Transactions on Nuclear Science.

[50]  Ge Wang,et al.  Review of Parallel Computing Techniques for Computed Tomography Image Reconstruction , 2006 .