Con-Patch: When a Patch Meets Its Context

Measuring the similarity between the patches in images is a fundamental building block in various tasks. Naturally, the patch size has a major impact on the matching quality and on the consequent application performance. Under the assumption that our patch database is sufficiently sampled, using large patches (e.g., 21 × 21) should be preferred over small ones (e.g., 7 × 7). However, this dense-sampling assumption is rarely true; in most cases, large patches cannot find relevant nearby examples. This phenomenon is a consequence of the curse of dimensionality, stating that the database size should grow exponentially with the patch size to ensure proper matches. This explains the favored choice of small patch size in most applications. Is there a way to keep the simplicity and work with small patches while getting some of the benefits that large patches provide? In this paper, we offer such an approach. We propose to concatenate the regular content of a conventional (small) patch with a compact representation of its (large) surroundings-its context. Therefore, with a minor increase of the dimensions (e.g., with additional ten values to the patch representation), we implicitly/softly describe the information of a large patch. The additional descriptors are computed based on a self-similarity behavior of the patch surrounding. We show that this approach achieves better matches, compared with the use of conventional-size patches, without the need to increase the database-size. Also, the effectiveness of the proposed method is tested on three distinct problems: 1) external natural image denoising; 2) depth image super-resolution; and 3) motion-compensated frame-rate up conversion.

[1]  Jiejie Zhu,et al.  Context-constrained hallucination for image super-resolution , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[2]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[3]  Eli Shechtman,et al.  Matching Local Self-Similarities across Images and Videos , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Giacomo Boracchi,et al.  Foveated self-similarity in nonlocal image filtering , 2012, Electronic Imaging.

[5]  Michal Irani,et al.  Internal statistics of a single natural image , 2011, CVPR 2011.

[6]  Demin Wang,et al.  Motion-Compensated Frame Rate Up-Conversion—Part II: New Algorithms for Frame Interpolation , 2010, IEEE Transactions on Broadcasting.

[7]  Michael Elad,et al.  Multi-Scale Patch-Based Image Restoration , 2016, IEEE Transactions on Image Processing.

[8]  Anat Levin,et al.  Natural image denoising: Optimality and inherent bounds , 2011, CVPR 2011.

[9]  Alfred M. Bruckstein,et al.  Motion-Compensated Coding and Frame Rate Up-Conversion: Models and Analysis , 2014, IEEE Transactions on Image Processing.

[10]  Demin Wang,et al.  Motion-Compensated Frame Rate Up-Conversion—Part I: Fast Multi-Frame Motion Estimation , 2010, IEEE Transactions on Broadcasting.

[11]  Jitendra Malik,et al.  A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[12]  Aleksandra Pizurica,et al.  Context-Aware Patch-Based Image Inpainting Using Markov Random Field Modeling , 2015, IEEE Transactions on Image Processing.

[13]  Peyman Milanfar,et al.  A General Framework for Regularized, Similarity-Based Image Restoration , 2014, IEEE Transactions on Image Processing.

[14]  Yair Weiss,et al.  From learning models of natural image patches to whole image restoration , 2011, 2011 International Conference on Computer Vision.

[15]  Jean-Michel Morel,et al.  A non-local algorithm for image denoising , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[16]  Gabriel J. Brostow,et al.  Patch Based Synthesis for Single Depth Image Super-Resolution , 2012, ECCV.

[17]  Borko Furht,et al.  Motion estimation algorithms for video compression , 1996 .

[18]  Jiang Li,et al.  A low complexity motion compensated frame interpolation method , 2005, 2005 IEEE International Symposium on Circuits and Systems.

[19]  Michael Elad,et al.  Patch-disagreement as away to improve K-SVD denoising , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  D. Scharstein,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, Proceedings IEEE Workshop on Stereo and Multi-Baseline Vision (SMBV 2001).

[21]  Lei Zhang,et al.  Weighted Nuclear Norm Minimization with Application to Image Denoising , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Sung-Hoon Hong,et al.  Frame rate up-conversion using forward-backward jointing motion estimation and spatio-temporal motion vector smoothing , 2009, 2009 International Conference on Computer Engineering & Systems.

[23]  Michal Irani,et al.  Combining the power of Internal and External denoising , 2013, IEEE International Conference on Computational Photography (ICCP).

[24]  Frédo Durand,et al.  Patch Complexity, Finite Pixel Correlations and Optimal Denoising , 2012, ECCV.

[25]  Rae-Hong Park,et al.  Weighted-adaptive motion-compensated frame rate up-conversion , 2003, IEEE Trans. Consumer Electron..

[26]  Alessandro Foi,et al.  Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering , 2007, IEEE Transactions on Image Processing.

[27]  Michael Elad,et al.  Trainlets: Dictionary Learning in High Dimensions , 2016, IEEE Transactions on Signal Processing.

[28]  Sung-Jea Ko,et al.  New frame rate up-conversion using bi-directional motion estimation , 2000, IEEE Trans. Consumer Electron..

[29]  Michael Elad,et al.  Improving K-SVD denoising by post-processing its method-noise , 2013, 2013 IEEE International Conference on Image Processing.

[30]  Michael Elad,et al.  Boosting of Image Denoising Algorithms , 2015, SIAM J. Imaging Sci..

[31]  Truong Q. Nguyen,et al.  Adaptive Image Denoising by Targeted Databases , 2014, IEEE Transactions on Image Processing.

[32]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[33]  Michal Irani,et al.  Super-resolution from a single image , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[34]  Jie Dong,et al.  Adaptive Downsampling for High-Definition Video Coding , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Marcelo Bertalmío,et al.  A Decomposition Framework for Image Denoising Algorithms , 2016, IEEE Transactions on Image Processing.

[36]  Lihi Zelnik-Manor,et al.  Context-aware saliency detection , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[37]  Jean-Michel Morel,et al.  A Review of Image Denoising Algorithms, with a New One , 2005, Multiscale Model. Simul..

[38]  Michael Elad,et al.  Image Denoising Via Sparse and Redundant Representations Over Learned Dictionaries , 2006, IEEE Transactions on Image Processing.

[39]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[40]  David G. Lowe,et al.  Scalable Nearest Neighbor Algorithms for High Dimensional Data , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[41]  Michael Elad,et al.  Single Image Interpolation Via Adaptive Nonlocal Sparsity-Based Modeling , 2014, IEEE Transactions on Image Processing.

[42]  A. Torralba,et al.  The role of context in object recognition , 2007, Trends in Cognitive Sciences.

[43]  James Hays,et al.  Super-resolution from internet-scale scene matching , 2012, 2012 IEEE International Conference on Computational Photography (ICCP).