Learnable Descent Algorithm for Nonsmooth Nonconvex Image Reconstruction

We propose a general learning based framework for solving nonsmooth and nonconvex image reconstruction problems. We model the regularization function as the composition of the $l_{2,1}$ norm and a smooth but nonconvex feature mapping parametrized as a deep convolutional neural network. We develop a provably convergent descent-type algorithm to solve the nonsmooth nonconvex minimization problem by leveraging the Nesterov's smoothing technique and the idea of residual learning, and learn the network parameters such that the outputs of the algorithm match the references in training data. Our method is versatile as one can employ various modern network structures into the regularization, and the resulting network inherits the guaranteed convergence of the algorithm. We also show that the proposed network is parameter-efficient and its performance compares favorably to the state-of-the-art methods in a variety of image reconstruction problems in practice.

[1]  Bo Wen,et al.  A proximal difference-of-convex algorithm with extrapolation , 2016, Computational Optimization and Applications.

[2]  Chun-Liang Li,et al.  One Network to Solve Them All — Solving Linear Inverse Problems Using Deep Projection Models , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[3]  Yipeng Liu,et al.  AMP-Net: Denoising based Deep Unfolding for Compressive Image Sensing , 2020, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[4]  Wen Gao,et al.  Maximal Sparsity with Deep Networks? , 2016, NIPS.

[5]  Le Thi Hoai An,et al.  The DC (Difference of Convex Functions) Programming and DCA Revisited with DC Models of Real World Nonconvex Optimization Problems , 2005, Ann. Oper. Res..

[6]  Wangmeng Zuo,et al.  Learning Deep CNN Denoiser Prior for Image Restoration , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[7]  Guangcan Liu,et al.  Differentiable Linearized ADMM , 2019, ICML.

[8]  Antoine Soubeyran,et al.  A proximal point method for difference of convex functions in multi-objective optimization with application to group dynamic problems , 2019, Computational Optimization and Applications.

[9]  Sanja Fidler,et al.  Proximal Deep Structured Models , 2016, NIPS.

[10]  Wen Gao,et al.  Optimization-Inspired Compact Deep Compressive Sensing , 2020, IEEE Journal of Selected Topics in Signal Processing.

[11]  Lu Gan Block Compressed Sensing of Natural Images , 2007, 2007 15th International Conference on Digital Signal Processing.

[12]  Thomas Brox,et al.  iPiano: Inertial Proximal Algorithm for Nonconvex Optimization , 2014, SIAM J. Imaging Sci..

[13]  Byeungwoo Jeon,et al.  Measurement coding for compressive imaging using a structural measuremnet matrix , 2013, 2013 IEEE International Conference on Image Processing.

[14]  Xiaohan Chen,et al.  Theoretical Linear Convergence of Unfolded ISTA and its Practical Weights and Thresholds , 2018, NeurIPS.

[15]  Michael Möller,et al.  Learning Proximal Operators: Using Denoising Networks for Regularizing Inverse Imaging Problems , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[16]  Pavan K. Turaga,et al.  ReconNet: Non-Iterative Reconstruction of Images from Compressively Sensed Measurements , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  Thomas Brox,et al.  On Iteratively Reweighted Algorithms for Nonsmooth Nonconvex Optimization in Computer Vision , 2015, SIAM J. Imaging Sci..

[18]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[19]  Xiaohan Chen,et al.  ALISTA: Analytic Weights Are As Good As Learned Weights in LISTA , 2018, ICLR.

[20]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[21]  Antoine Soubeyran,et al.  A generalized proximal linearized algorithm for DC functions with application to the optimal size of the firm problem , 2018, Ann. Oper. Res..

[22]  Charless C. Fowlkes,et al.  Contour Detection and Hierarchical Image Segmentation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Liwei Wang,et al.  The Expressive Power of Neural Networks: A View from the Width , 2017, NIPS.

[24]  James E. Fowler,et al.  Block-Based Compressed Sensing of Images and Video , 2012, Found. Trends Signal Process..

[25]  Jian Sun,et al.  Deep ADMM-Net for Compressive Sensing MRI , 2016, NIPS.

[26]  Xiaoou Tang,et al.  Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[27]  Wuzhen Shi,et al.  Deep networks for compressed image sensing , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[28]  E FowlerJames,et al.  Block-Based Compressed Sensing of Images and Video , 2012 .

[29]  Dmitry Yarotsky,et al.  Error bounds for approximations with deep ReLU networks , 2016, Neural Networks.

[30]  Xiaojun Chen,et al.  Smoothing Projected Gradient Method and Its Application to Stochastic Linear Complementarity Problems , 2009, SIAM J. Optim..

[31]  Jieping Ye,et al.  A General Iterative Shrinkage and Thresholding Algorithm for Non-convex Regularized Optimization Problems , 2013, ICML.

[32]  Radu Ioan Bot,et al.  An inertial forward–backward algorithm for the minimization of the sum of two nonconvex functions , 2014, EURO J. Comput. Optim..

[33]  Yin Zhang,et al.  An efficient augmented Lagrangian method with applications to total variation minimization , 2013, Computational Optimization and Applications.

[34]  Nikos Komodakis,et al.  Wide Residual Networks , 2016, BMVC.

[35]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Stephen J. Wright,et al.  A proximal method for composite minimization , 2008, Mathematical Programming.

[37]  Xiaojun Chen,et al.  Smoothing methods for nonsmooth, nonconvex minimization , 2012, Math. Program..

[38]  Francesco Visin,et al.  A guide to convolution arithmetic for deep learning , 2016, ArXiv.

[39]  E. Simon,et al.  An algorithm for the minimization of nonsmooth nonconvex functions using inexact evaluations and its worst-case complexity , 2019, Math. Program..

[40]  M. Fukushima,et al.  A generalized proximal point algorithm for certain non-convex minimization problems , 1981 .

[41]  Thomas Pock,et al.  Learning a variational network for reconstruction of accelerated MRI data , 2017, Magnetic resonance in medicine.

[42]  F. Clarke Optimization And Nonsmooth Analysis , 1983 .

[43]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[44]  Antoine Soubeyran,et al.  Global convergence of a proximal linearized algorithm for difference of convex functions , 2015, Optimization Letters.

[45]  Mohamed-Jalal Fadili,et al.  Non-smooth Non-convex Bregman Minimization: Unification and New Algorithms , 2017, Journal of Optimization Theory and Applications.

[46]  Feng Jiang,et al.  Scalable Convolutional Neural Network for Image Compressed Sensing , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[47]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[48]  Guillermo Sapiro,et al.  Learning Efficient Sparse and Low Rank Models , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49]  Dmitriy Drusvyatskiy,et al.  Error Bounds, Quadratic Growth, and Linear Convergence of Proximal Methods , 2016, Math. Oper. Res..

[50]  Yann LeCun,et al.  Learning Fast Approximations of Sparse Coding , 2010, ICML.

[51]  Huan Li,et al.  Accelerated Proximal Gradient Methods for Nonconvex Programming , 2015, NIPS.

[52]  Leon Wenliang Zhong,et al.  Gradient Descent with Proximal Average for Nonconvex and Composite Regularization , 2014, AAAI.

[53]  Xiaojun Chen,et al.  Optimality and Complexity for Constrained Optimization Problems with Nonconvex Regularization , 2017, Math. Oper. Res..

[54]  Bastian Goldlücke,et al.  Variational Analysis , 2014, Computer Vision, A Reference Guide.

[55]  Chengqing Li,et al.  Multi-Channel Deep Networks for Block-Based Image Compressive Sensing , 2019, IEEE Transactions on Multimedia.

[56]  Yurii Nesterov,et al.  Smooth minimization of non-smooth functions , 2005, Math. Program..

[57]  Bernard Ghanem,et al.  ISTA-Net: Interpretable Optimization-Inspired Deep Network for Image Compressive Sensing , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[58]  Le Thi Hoai An,et al.  Recent Advances in DC Programming and DCA , 2013, Trans. Comput. Collect. Intell..

[59]  Richard G. Baraniuk,et al.  From Denoising to Compressed Sensing , 2014, IEEE Transactions on Information Theory.

[60]  Yurii Nesterov,et al.  Gradient methods for minimizing composite functions , 2012, Mathematical Programming.

[61]  Jonas Adler,et al.  Learned Primal-Dual Reconstruction , 2017, IEEE Transactions on Medical Imaging.

[62]  Sundeep Rangan,et al.  AMP-Inspired Deep Networks for Sparse Linear Inverse Problems , 2016, IEEE Transactions on Signal Processing.

[63]  Benar Fux Svaiter,et al.  Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward–backward splitting, and regularized Gauss–Seidel methods , 2013, Math. Program..

[64]  Jian Sun,et al.  Identity Mappings in Deep Residual Networks , 2016, ECCV.

[65]  Y. Nesterov A method for solving the convex programming problem with convergence rate O(1/k^2) , 1983 .