论文信息 - Convolutional Analysis Operator Learning: Acceleration and Convergence

Convolutional Analysis Operator Learning: Acceleration and Convergence

Convolutional operator learning is gaining attention in many signal processing and computer vision applications. Learning kernels has mostly relied on so-called patch-domain approaches that extract and store many overlapping patches across training signals. Due to memory demands, patch-domain methods have limitations when learning kernels from large datasets – particularly with multi-layered structures, e.g., convolutional neural networks – or when applying the learned kernels to high-dimensional signal recovery problems. The so-called convolution approach does not store many overlapping patches, and thus overcomes the memory problems particularly with careful algorithmic designs; it has been studied within the “synthesis” signal model, e.g., convolutional dictionary learning. This paper proposes a new convolutional analysis operator learning (CAOL) framework that learns an analysis sparsifying regularizer with the convolution perspective, and develops a new convergent Block Proximal Extrapolated Gradient method using a Majorizer (BPEG-M) to solve the corresponding block multi-nonconvex problems. To learn diverse filters within the CAOL framework, this paper introduces an orthogonality constraint that enforces a tight-frame filter condition, and a regularizer that promotes diversity between filters. Numerical experiments show that, with sharp majorizers, BPEG-M significantly accelerates the CAOL convergence rate compared to the state-of-the-art block proximal gradient (BPG) method. Numerical experiments for sparse-view computational tomography show that a convolutional sparsifying regularizer learned via CAOL significantly improves reconstruction quality compared to a conventional edge-preserving regularizer. Using more and wider kernels in a learned regularizer better preserves edges in reconstructed images.

Jeffrey A. Fessler | Il Yong Chun | J. Fessler

[1] Jeffrey A. Fessler,et al. Momentum-Net: Fast and Convergent Iterative Neural Network for Inverse Problems , 2019, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2] David L. Donoho,et al. De-noising by soft-thresholding , 1995, IEEE Trans. Inf. Theory.

[3] I. Johnstone,et al. Adapting to Unknown Smoothness via Wavelet Shrinkage , 1995 .

[4] Jeffrey A. Fessler,et al. Low-Rank Plus Sparse Tensor Models for Light-field Reconstruction from Focal Stack Data , 2018, 2018 IEEE 13th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP).

[5] Guillermo Sapiro,et al. Online dictionary learning for sparse coding , 2009, ICML '09.

[6] Marc Teboulle,et al. A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[7] Jean Ponce,et al. Sparse Modeling for Image and Vision Processing , 2014, Found. Trends Comput. Graph. Vis..

[8] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[9] Jian Sun,et al. Deep ADMM-Net for Compressive Sensing MRI , 2016, NIPS.

[10] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[11] Klaus Diepold,et al. Analysis Operator Learning and its Application to Image Reconstruction , 2012, IEEE Transactions on Image Processing.

[12] Jeffrey A. Fessler,et al. Sparse-View X-Ray CT Reconstruction Using 𝓵1 Prior with Learned Transform , 2017, ArXiv.

[13] Gordon Wetzstein,et al. Fast and flexible convolutional sparse coding , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14] Graham W. Taylor,et al. Deconvolutional networks , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[15] Quanzheng Li,et al. Iterative Low-Dose CT Reconstruction With Priors Trained by Artificial Neural Network , 2017, IEEE Transactions on Medical Imaging.

[16] Eero P. Simoncelli,et al. Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[17] Thierry Blu,et al. The SURE-LET Approach to Image Denoising , 2007, IEEE Transactions on Image Processing.

[18] Michael Elad,et al. The Little Engine That Could: Regularization by Denoising (RED) , 2016, SIAM J. Imaging Sci..

[19] A. Kruger. On Fréchet Subdifferentials , 2003 .

[20] Jian-Feng Cai,et al. Data-driven tight frame construction and image denoising , 2014 .

[21] Jeffrey A. Fessler,et al. Convolutional Dictionary Learning: Acceleration and Convergence , 2017, IEEE Transactions on Image Processing.

[22] Yoram Bresler,et al. Learning sparsifying filter banks , 2015, SPIE Optical Engineering + Applications.

[23] Stephen P. Boyd,et al. Monotonicity and restart in fast gradient methods , 2014, 53rd IEEE Conference on Decision and Control.

[24] Jeffrey A. Fessler,et al. Fast and convergent iterative image recovery using trained convolutional neural networks , 2018, 2018 56th Annual Allerton Conference on Communication, Control, and Computing (Allerton).

[25] Jeffrey A. Fessler,et al. Model-Based Image Reconstruction for MRI , 2010, IEEE Signal Processing Magazine.

[26] Yoram Bresler,et al. Learning overcomplete sparsifying transforms for signal processing , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[27] Rémi Gribonval,et al. Constrained Overcomplete Analysis Operator Learning for Cosparse Signal Modelling , 2012, IEEE Transactions on Signal Processing.

[28] Wotao Yin,et al. Online convolutional dictionary learning , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[29] Jeffrey A. Fessler,et al. Deep BCD-Net Using Identical Encoding-Decoding CNN Structures for Iterative Image Recovery , 2018, 2018 IEEE 13th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP).

[30] Chenglong Bao,et al. Convergence analysis for iterative data-driven tight frame construction scheme , 2015 .

[31] Wen Gao,et al. Image denoising via adaptive soft-thresholding based on non-local samples , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[32] Jeffrey A. Fessler,et al. Convolutional Analysis Operator Learning: Dependence on Training Data , 2019, IEEE Signal Processing Letters.

[33] D. Hunter,et al. Optimization Transfer Using Surrogate Objective Functions , 2000 .

[34] Il Yong Chun,et al. Incorporating Handcrafted Filters in Convolutional Analysis Operator Learning for Ill-Posed Inverse Problems , 2019, 2019 IEEE 8th International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP).

[35] Lei Zhang,et al. Low-Dose X-ray CT Reconstruction via Dictionary Learning , 2012, IEEE Transactions on Medical Imaging.

[36] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[37] Geoffrey E. Hinton,et al. Rectified Linear Units Improve Restricted Boltzmann Machines , 2010, ICML.

[38] Yann LeCun,et al. What is the best multi-stage architecture for object recognition? , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[39] Emmanuel J. Candès,et al. Adaptive Restart for Accelerated Gradient Schemes , 2012, Foundations of Computational Mathematics.

[40] Y. Nesterov. Gradient methods for minimizing composite objective function , 2007 .

[41] Jeffrey A. Fessler,et al. Improved low-count quantitative PET reconstruction with a variational neural network , 2019, ArXiv.

[42] Ben Adcock,et al. Compressed Sensing and Parallel Acquisition , 2016, IEEE Transactions on Information Theory.

[43] P. Tseng. Convergence of a Block Coordinate Descent Method for Nondifferentiable Minimization , 2001 .

[44] Michael Elad,et al. Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[45] Jian-Feng Cai,et al. Fast Sparsity-Based Orthogonal Dictionary Learning for Image Restoration , 2013, 2013 IEEE International Conference on Computer Vision.

[46] Jeffrey A. Fessler,et al. On complete-data spaces for PET reconstruction algorithms , 1993 .

[47] Yoram Bresler,et al. Automatic parameter tuning for image denoising with learned sparsifying transforms , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[48] Ben Adcock,et al. Uniform Recovery from Subgaussian Multi-Sensor Measurements , 2016, Applied and Computational Harmonic Analysis.

[49] J. Fessler,et al. Spatial resolution properties of penalized-likelihood image reconstruction: space-invariant tomographs , 1996, 5th IEEE EMBS International Summer School on Biomedical Imaging, 2002..

[50] Jeffrey A. Fessler,et al. BCD-Net for Low-dose CT Reconstruction: Acceleration, Convergence, and Generalization , 2019, MICCAI.

[51] Mark D. Plumbley,et al. Learning Incoherent Dictionaries for Sparse Approximation Using Iterative Projections and Rotations , 2013, IEEE Transactions on Signal Processing.

[52] C. Reinsch. Smoothing by spline functions , 1967 .

[53] C. Reinsch. Smoothing by spline functions. II , 1971 .

[54] Anders P. Eriksson,et al. Fast Convolutional Sparse Coding , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[55] Jeffrey A. Fessler,et al. An Expanded Theoretical Treatment of Iteration-Dependent Majorize-Minimize Algorithms , 2007, IEEE Transactions on Image Processing.

[56] Guillermo Sapiro,et al. Supervised Dictionary Learning , 2008, NIPS.

[57] R. Tibshirani,et al. Sparse Principal Component Analysis , 2006 .

[58] Brendt Wohlberg,et al. Efficient Algorithms for Convolutional Sparse Representations , 2016, IEEE Transactions on Image Processing.

[59] M. Elad,et al. $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[60] Hu Chen,et al. LEARN: Learned Experts’ Assessment-Based Reconstruction Network for Sparse-Data CT , 2017, IEEE Transactions on Medical Imaging.

[61] Andrew Y. Ng,et al. Learning Feature Representations with K-Means , 2012, Neural Networks: Tricks of the Trade.

[62] Jeffrey A. Fessler,et al. Convergent convolutional dictionary learning using Adaptive Contrast Enhancement (CDL-ACE): Application of CDL to image denoising , 2017, 2017 International Conference on Sampling Theory and Applications (SampTA).

[63] Karin Schnass,et al. Dictionary Identification—Sparse Matrix-Factorization via $\ell_1$ -Minimization , 2009, IEEE Transactions on Information Theory.

[64] Yoram Bresler,et al. $\ell_{0}$ Sparsifying Transform Learning With Efficient Optimal Updates and Convergence Guarantees , 2015, IEEE Transactions on Signal Processing.

[65] Martin Vetterli,et al. Adaptive wavelet thresholding for image denoising and compression , 2000, IEEE Trans. Image Process..

[66] Simon R. Arridge,et al. Solving inverse problems using data-driven models , 2019, Acta Numerica.

[67] Lars Eldèn. Solving Quadratically Constrained Least Squares Problems Using a Differential-Geometric Approach , 2002 .

[68] Yunjin Chen,et al. Trainable Nonlinear Reaction Diffusion: A Flexible Framework for Fast and Effective Image Restoration , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[69] Wangmeng Zuo,et al. Learning Deep CNN Denoiser Prior for Image Restoration , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[70] A. Bruckstein,et al. K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[71] Jeffrey A. Fessler,et al. Application of trained Deep BCD-Net to iterative low-count PET image reconstruction , 2018, 2018 IEEE Nuclear Science Symposium and Medical Imaging Conference Proceedings (NSS/MIC).

[72] W P Segars,et al. Realistic CT simulation using the 4D XCAT phantom. , 2008, Medical physics.

[73] David J. Field,et al. Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[74] Wotao Yin,et al. A Globally Convergent Algorithm for Nonconvex Optimization Based on Block Coordinate Update , 2014, J. Sci. Comput..

[75] Stanley H. Chan,et al. Plug-and-Play Unplugged: Optimization Free Reconstruction using Consensus Equilibrium , 2017, SIAM J. Imaging Sci..

[76] Michael Elad,et al. Convolutional Neural Networks Analyzed via Convolutional Sparse Coding , 2016, J. Mach. Learn. Res..

[77] Jeffrey A. Fessler,et al. Convolutional analysis operator learning: Application to sparse-view CT : (Invited Paper) , 2018, 2018 52nd Asilomar Conference on Signals, Systems, and Computers.

[78] Wotao Yin,et al. A Block Coordinate Descent Method for Regularized Multiconvex Optimization with Applications to Nonnegative Tensor Factorization and Completion , 2013, SIAM J. Imaging Sci..