论文信息 - From One-Trick Ponies to All-Rounders: On-Demand Learning for Image Restoration

From One-Trick Ponies to All-Rounders: On-Demand Learning for Image Restoration

While machine learning approaches to image restoration offer great promise, current methods risk training “onetrick ponies” that perform well only for image corruption of a particular level of difficulty—such as a certain level of noise or blur. First, we examine the weakness of a one-trick pony model and demonstrate that training general models to handle arbitrary levels of corruption is indeed non-trivial. Then, we propose an on-demand learning algorithm for training image restoration models with deep convolutional neural networks. The main idea is to exploit a feedback mechanism to self-generate training instances where they are needed most, thereby learning models that can generalize across difficulty levels. On four restoration tasks—image inpainting, pixel interpolation, image deblurring, and image denoising—and three diverse datasets, our approach consistently outperforms both the status quo training procedure and curriculum learning alternatives.

Kristen Grauman | Ruohan Gao | K. Grauman | Ruohan Gao

[1] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[2] Yu-Bin Yang,et al. Image Restoration Using Very Deep Convolutional Encoder-Decoder Networks with Symmetric Skip Connections , 2016, NIPS.

[3] Alexei A. Efros,et al. Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4] Sergey Ioffe,et al. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[5] Minh N. Do,et al. Semantic Image Inpainting with Perceptual and Contextual Losses , 2016, ArXiv.

[6] Joachim Denzler,et al. Selecting Influential Examples: Active Learning with Expected Model Output Changes , 2014, ECCV.

[7] Li Fei-Fei,et al. Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[8] Xin Li,et al. Multi-level Adaptive Active Learning for Scene Classification , 2014, ECCV.

[9] Bo Du,et al. Multi-label Active Learning Based on Maximum Correntropy Criterion: Towards Robust and Discriminative Labeling , 2016, ECCV.

[10] Bernhard Schölkopf,et al. A Machine Learning Approach for Non-blind Image Deconvolution , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[11] Thomas Brox,et al. Learning to generate chairs with convolutional neural networks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Yunjin Chen,et al. Trainable Nonlinear Reaction Diffusion: A Flexible Framework for Fast and Effective Image Restoration , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13] Martin J. Wainwright,et al. Image denoising using scale mixtures of Gaussians in the wavelet domain , 2003, IEEE Trans. Image Process..

[14] Xiaogang Wang,et al. Deep Learning Face Attributes in the Wild , 2014, 2015 IEEE International Conference on Computer Vision (ICCV).

[15] Richard Szeliski,et al. Noise Estimation from a Single Image , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[16] Stefan Roth,et al. Shrinkage Fields for Effective Image Restoration , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[17] Enhong Chen,et al. Image Denoising and Inpainting with Deep Neural Networks , 2012, NIPS.

[18] M. Elad,et al. $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[19] Ce Liu,et al. Deep Convolutional Neural Network for Image Deconvolution , 2014, NIPS.

[20] H. Sebastian Seung,et al. Natural Image Denoising with Convolutional Networks , 2008, NIPS.

[21] Geoffrey E. Hinton,et al. ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[22] Gregory Shakhnarovich,et al. Learning Representations for Automatic Colorization , 2016, ECCV.

[23] A. Bruckstein,et al. K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[24] Yong Jae Lee,et al. Learning the easy things first: Self-paced visual category discovery , 2011, CVPR 2011.

[25] Soumith Chintala,et al. Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[26] Xiaoou Tang,et al. Image Super-Resolution Using Deep Convolutional Networks , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[27] Yoshua Bengio,et al. Extracting and composing robust features with denoising autoencoders , 2008, ICML '08.

[28] In-So Kweon,et al. Natural Image Matting Using Deep Convolutional Neural Networks , 2016, ECCV.

[29] Ming-Hsuan Yang,et al. Learning Recursive Filters for Low-Level Vision via a Hybrid Neural Network , 2016, ECCV.

[30] Rob Fergus,et al. Visualizing and Understanding Convolutional Networks , 2013, ECCV.

[31] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[32] Guangyong Chen,et al. An Efficient Statistical Method for Image Noise Level Estimation , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[33] Alexei A. Efros,et al. Colorful Image Colorization , 2016, ECCV.

[34] Lei Zhang,et al. Weighted Nuclear Norm Minimization with Application to Image Denoising , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[35] Allen Y. Yang,et al. A Convex Optimization Framework for Active Learning , 2013, 2013 IEEE International Conference on Computer Vision.

[36] J. Elman. Learning and development in neural networks: the importance of starting small , 1993, Cognition.

[37] Rob Fergus,et al. Restoring an Image Taken through a Window Covered with Dirt or Rain , 2013, 2013 IEEE International Conference on Computer Vision.

[38] Rong Jin,et al. Active Learning by Querying Informative and Representative Examples , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39] Krista A. Ehinger,et al. SUN Database: Exploring a Large Collection of Scene Categories , 2014, International Journal of Computer Vision.

[40] Trevor Darrell,et al. Gaussian Processes for Object Categorization , 2010, International Journal of Computer Vision.

[41] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[42] Alessandro Foi,et al. Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering , 2007, IEEE Transactions on Image Processing.

[43] Kristen Grauman,et al. Large-scale live active learning: Training object detectors with crawled data and crowds , 2011, CVPR.

[44] Andrew L. Maas. Rectifier Nonlinearities Improve Neural Network Acoustic Models , 2013 .

[45] Xinhao Liu,et al. Single-Image Noise Level Estimation for Blind Denoising , 2013, IEEE Transactions on Image Processing.

[46] Stefan Harmeling,et al. Image denoising: Can plain neural networks compete with BM3D? , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[47] Christoph H. Lampert,et al. Curriculum learning of multiple tasks , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48] Tianqi Chen,et al. Empirical Evaluation of Rectified Activations in Convolutional Network , 2015, ArXiv.

[49] Jason Weston,et al. Curriculum learning , 2009, ICML '09.

[50] Joachim Denzler,et al. Active learning and discovery of object categories in the presence of unnameable instances , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[51] Daphne Koller,et al. Self-Paced Learning for Latent Variable Models , 2010, NIPS.