Convolutional Analysis Operator Learning: Acceleration and Convergence

Convolutional operator learning is gaining attention in many signal processing and computer vision applications. Learning kernels has mostly relied on so-called patch-domain approaches that extract and store many overlapping patches across training signals. Due to memory demands, patch-domain methods have limitations when learning kernels from large datasets – particularly with multi-layered structures, e.g., convolutional neural networks – or when applying the learned kernels to high-dimensional signal recovery problems. The so-called convolution approach does not store many overlapping patches, and thus overcomes the memory problems particularly with careful algorithmic designs; it has been studied within the “synthesis” signal model, e.g., convolutional dictionary learning. This paper proposes a new convolutional analysis operator learning (CAOL) framework that learns an analysis sparsifying regularizer with the convolution perspective, and develops a new convergent Block Proximal Extrapolated Gradient method using a Majorizer (BPEG-M) to solve the corresponding block multi-nonconvex problems. To learn diverse filters within the CAOL framework, this paper introduces an orthogonality constraint that enforces a tight-frame filter condition, and a regularizer that promotes diversity between filters. Numerical experiments show that, with sharp majorizers, BPEG-M significantly accelerates the CAOL convergence rate compared to the state-of-the-art block proximal gradient (BPG) method. Numerical experiments for sparse-view computational tomography show that a convolutional sparsifying regularizer learned via CAOL significantly improves reconstruction quality compared to a conventional edge-preserving regularizer. Using more and wider kernels in a learned regularizer better preserves edges in reconstructed images.

[1]  Jeffrey A. Fessler,et al.  Momentum-Net: Fast and Convergent Iterative Neural Network for Inverse Problems , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  David L. Donoho,et al.  De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[3]  I. Johnstone,et al.  Adapting to Unknown Smoothness via Wavelet Shrinkage , 1995 .

[4]  Jeffrey A. Fessler,et al.  Low-Rank Plus Sparse Tensor Models for Light-field Reconstruction from Focal Stack Data , 2018, 2018 IEEE 13th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP).

[5]  Guillermo Sapiro,et al.  Online dictionary learning for sparse coding , 2009, ICML '09.

[6]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[7]  Jean Ponce,et al.  Sparse Modeling for Image and Vision Processing , 2014, Found. Trends Comput. Graph. Vis..

[8]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9]  Jian Sun,et al.  Deep ADMM-Net for Compressive Sensing MRI , 2016, NIPS.

[10]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[11]  Klaus Diepold,et al.  Analysis Operator Learning and its Application to Image Reconstruction , 2012, IEEE Transactions on Image Processing.

[12]  Jeffrey A. Fessler,et al.  Sparse-View X-Ray CT Reconstruction Using 𝓵1 Prior with Learned Transform , 2017, ArXiv.

[13]  Gordon Wetzstein,et al.  Fast and flexible convolutional sparse coding , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Graham W. Taylor,et al.  Deconvolutional networks , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15]  Quanzheng Li,et al.  Iterative Low-Dose CT Reconstruction With Priors Trained by Artificial Neural Network , 2017, IEEE Transactions on Medical Imaging.

[16]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[17]  Thierry Blu,et al.  The SURE-LET Approach to Image Denoising , 2007, IEEE Transactions on Image Processing.

[18]  Michael Elad,et al.  The Little Engine That Could: Regularization by Denoising (RED) , 2016, SIAM J. Imaging Sci..

[19]  A. Kruger On Fréchet Subdifferentials , 2003 .

[20]  Jian-Feng Cai,et al.  Data-driven tight frame construction and image denoising , 2014 .

[21]  Jeffrey A. Fessler,et al.  Convolutional Dictionary Learning: Acceleration and Convergence , 2017, IEEE Transactions on Image Processing.

[22]  Yoram Bresler,et al.  Learning sparsifying filter banks , 2015, SPIE Optical Engineering + Applications.

[23]  Stephen P. Boyd,et al.  Monotonicity and restart in fast gradient methods , 2014, 53rd IEEE Conference on Decision and Control.

[24]  Jeffrey A. Fessler,et al.  Fast and convergent iterative image recovery using trained convolutional neural networks , 2018, 2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[25]  Jeffrey A. Fessler,et al.  Model-Based Image Reconstruction for MRI , 2010, IEEE Signal Processing Magazine.

[26]  Yoram Bresler,et al.  Learning overcomplete sparsifying transforms for signal processing , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[27]  Rémi Gribonval,et al.  Constrained Overcomplete Analysis Operator Learning for Cosparse Signal Modelling , 2012, IEEE Transactions on Signal Processing.

[28]  Wotao Yin,et al.  Online convolutional dictionary learning , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[29]  Jeffrey A. Fessler,et al.  Deep BCD-Net Using Identical Encoding-Decoding CNN Structures for Iterative Image Recovery , 2018, 2018 IEEE 13th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP).

[30]  Chenglong Bao,et al.  Convergence analysis for iterative data-driven tight frame construction scheme , 2015 .

[31]  Wen Gao,et al.  Image denoising via adaptive soft-thresholding based on non-local samples , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32]  Jeffrey A. Fessler,et al.  Convolutional Analysis Operator Learning: Dependence on Training Data , 2019, IEEE Signal Processing Letters.

[33]  D. Hunter,et al.  Optimization Transfer Using Surrogate Objective Functions , 2000 .

[34]  Il Yong Chun,et al.  Incorporating Handcrafted Filters in Convolutional Analysis Operator Learning for Ill-Posed Inverse Problems , 2019, 2019 IEEE 8th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[35]  Lei Zhang,et al.  Low-Dose X-ray CT Reconstruction via Dictionary Learning , 2012, IEEE Transactions on Medical Imaging.

[36]  Stephen P. Boyd,et al.  Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[37]  Geoffrey E. Hinton,et al.  Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[38]  Yann LeCun,et al.  What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[39]  Emmanuel J. Candès,et al.  Adaptive Restart for Accelerated Gradient Schemes , 2012, Foundations of Computational Mathematics.

[40]  Y. Nesterov Gradient methods for minimizing composite objective function , 2007 .

[41]  Jeffrey A. Fessler,et al.  Improved low-count quantitative PET reconstruction with a variational neural network , 2019, ArXiv.

[42]  Ben Adcock,et al.  Compressed Sensing and Parallel Acquisition , 2016, IEEE Transactions on Information Theory.

[43]  P. Tseng Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization , 2001 .

[44]  Michael Elad,et al.  Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[45]  Jian-Feng Cai,et al.  Fast Sparsity-Based Orthogonal Dictionary Learning for Image Restoration , 2013, 2013 IEEE International Conference on Computer Vision.

[46]  Jeffrey A. Fessler,et al.  On complete-data spaces for PET reconstruction algorithms , 1993 .

[47]  Yoram Bresler,et al.  Automatic parameter tuning for image denoising with learned sparsifying transforms , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[48]  Ben Adcock,et al.  Uniform Recovery from Subgaussian Multi-Sensor Measurements , 2016, Applied and Computational Harmonic Analysis.

[49]  J. Fessler,et al.  Spatial resolution properties of penalized-likelihood image reconstruction: space-invariant tomographs , 1996, 5th IEEE EMBS International Summer School on Biomedical Imaging, 2002..

[50]  Jeffrey A. Fessler,et al.  BCD-Net for Low-dose CT Reconstruction: Acceleration, Convergence, and Generalization , 2019, MICCAI.

[51]  Mark D. Plumbley,et al.  Learning Incoherent Dictionaries for Sparse Approximation Using Iterative Projections and Rotations , 2013, IEEE Transactions on Signal Processing.

[52]  C. Reinsch Smoothing by spline functions , 1967 .

[53]  C. Reinsch Smoothing by spline functions. II , 1971 .

[54]  Anders P. Eriksson,et al.  Fast Convolutional Sparse Coding , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[55]  Jeffrey A. Fessler,et al.  An Expanded Theoretical Treatment of Iteration-Dependent Majorize-Minimize Algorithms , 2007, IEEE Transactions on Image Processing.

[56]  Guillermo Sapiro,et al.  Supervised Dictionary Learning , 2008, NIPS.

[57]  R. Tibshirani,et al.  Sparse Principal Component Analysis , 2006 .

[58]  Brendt Wohlberg,et al.  Efficient Algorithms for Convolutional Sparse Representations , 2016, IEEE Transactions on Image Processing.

[59]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[60]  Hu Chen,et al.  LEARN: Learned Experts’ Assessment-Based Reconstruction Network for Sparse-Data CT , 2017, IEEE Transactions on Medical Imaging.

[61]  Andrew Y. Ng,et al.  Learning Feature Representations with K-Means , 2012, Neural Networks: Tricks of the Trade.

[62]  Jeffrey A. Fessler,et al.  Convergent convolutional dictionary learning using Adaptive Contrast Enhancement (CDL-ACE): Application of CDL to image denoising , 2017, 2017 International Conference on Sampling Theory and Applications (SampTA).

[63]  Karin Schnass,et al.  Dictionary Identification—Sparse Matrix-Factorization via $\ell_1$ -Minimization , 2009, IEEE Transactions on Information Theory.

[64]  Yoram Bresler,et al.  $\ell_{0}$ Sparsifying Transform Learning With Efficient Optimal Updates and Convergence Guarantees , 2015, IEEE Transactions on Signal Processing.

[65]  Martin Vetterli,et al.  Adaptive wavelet thresholding for image denoising and compression , 2000, IEEE Trans. Image Process..

[66]  Simon R. Arridge,et al.  Solving inverse problems using data-driven models , 2019, Acta Numerica.

[67]  Lars Eldèn Solving Quadratically Constrained Least Squares Problems Using a Differential-Geometric Approach , 2002 .

[68]  Yunjin Chen,et al.  Trainable Nonlinear Reaction Diffusion: A Flexible Framework for Fast and Effective Image Restoration , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[69]  Wangmeng Zuo,et al.  Learning Deep CNN Denoiser Prior for Image Restoration , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[70]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[71]  Jeffrey A. Fessler,et al.  Application of trained Deep BCD-Net to iterative low-count PET image reconstruction , 2018, 2018 IEEE Nuclear Science Symposium and Medical Imaging Conference Proceedings (NSS/MIC).

[72]  W P Segars,et al.  Realistic CT simulation using the 4D XCAT phantom. , 2008, Medical physics.

[73]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[74]  Wotao Yin,et al.  A Globally Convergent Algorithm for Nonconvex Optimization Based on Block Coordinate Update , 2014, J. Sci. Comput..

[75]  Stanley H. Chan,et al.  Plug-and-Play Unplugged: Optimization Free Reconstruction using Consensus Equilibrium , 2017, SIAM J. Imaging Sci..

[76]  Michael Elad,et al.  Convolutional Neural Networks Analyzed via Convolutional Sparse Coding , 2016, J. Mach. Learn. Res..

[77]  Jeffrey A. Fessler,et al.  Convolutional analysis operator learning: Application to sparse-view CT : (Invited Paper) , 2018, 2018 52nd Asilomar Conference on Signals, Systems, and Computers.

[78]  Wotao Yin,et al.  A Block Coordinate Descent Method for Regularized Multiconvex Optimization with Applications to Nonnegative Tensor Factorization and Completion , 2013, SIAM J. Imaging Sci..