Video denoising using separable 4D nonlocal spatiotemporal transforms

We propose a powerful video denoising algorithm that exploits temporal and spatial redundancy characterizing natural video sequences. The algorithm implements the paradigm of nonlocal grouping and collaborative filtering, where a higher-dimensional transform-domain representation is leveraged to enforce sparsity and thus regularize the data. The proposed algorithm exploits the mutual similarity between 3-D spatiotemporal volumes constructed by tracking blocks along trajectories defined by the motion vectors. Mutually similar volumes are grouped together by stacking them along an additional fourth dimension, thus producing a 4-D structure, termed group, where different types of data correlation exist along the different dimensions: local correlation along the two dimensions of the blocks, temporal correlation along the motion trajectories, and nonlocal spatial correlation (i.e. self-similarity) along the fourth dimension. Collaborative filtering is realized by transforming each group through a decorrelating 4-D separable transform and then by shrinkage and inverse transformation. In this way, collaborative filtering provides estimates for each volume stacked in the group, which are then returned and adaptively aggregated to their original position in the video. Experimental results demonstrate the effectiveness of the proposed procedure which outperforms the state of the art.

[1]  Hsueh-Ming Hang,et al.  Motion Estimation for Video Coding Standards , 1997, J. VLSI Signal Process..

[2]  Alessandro Foi,et al.  Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering , 2007, IEEE Transactions on Image Processing.

[3]  Jaakko Astola,et al.  From Local Kernel to Nonlocal Multiple-Model Image Denoising , 2009, International Journal of Computer Vision.

[4]  Mubarak Shah,et al.  Content based video matching using spatiotemporal volumes , 2008, Comput. Vis. Image Underst..

[5]  Jean-Michel Morel,et al.  Nonlocal Image and Movie Denoising , 2008, International Journal of Computer Vision.

[6]  Jeffrey C. Lagarias,et al.  Convergence Properties of the Nelder-Mead Simplex Method in Low Dimensions , 1998, SIAM J. Optim..

[7]  Alessandro Foi,et al.  JOINT IMAGE SHARPENING AND DENOISING BY 3 D TRANSFORM-DOMAIN COLLABORATIVE FILTERING , 2007 .

[8]  Jean-Michel Morel,et al.  A Review of Image Denoising Algorithms, with a New One , 2005, Multiscale Model. Simul..

[9]  Karen O. Egiazarian,et al.  Video denoising by sparse 3D transform-domain collaborative filtering , 2007, 2007 15th European Signal Processing Conference.

[10]  Abderrahim Elmoataz,et al.  Nonlocal video denoising, simplification and inpainting using discrete regularization on graphs , 2010, Signal Process..

[11]  Boracchi Giacomo,et al.  Multiframe Raw-Data Denoising Based On Block-Matching And 3-D Filtering For Low-Light Imaging And Stabilization , 2008 .

[12]  Karen O. Egiazarian,et al.  Image upsampling via spatially adaptive block-matching filtering , 2008, 2008 16th European Signal Processing Conference.

[13]  A. Foi,et al.  IMAGE AND VIDEO SUPER-RESOLUTION VIA SPATIALLY ADAPTIVE BLOCK-MATCHING FILTERING , 2008 .

[14]  John A. Nelder,et al.  A Simplex Method for Function Minimization , 1965, Comput. J..

[15]  Michael Elad,et al.  Image Sequence Denoising via Sparse and Redundant Representations , 2009, IEEE Transactions on Image Processing.

[16]  Karen O. Egiazarian,et al.  Image restoration by sparse 3D transform-domain collaborative filtering , 2008, Electronic Imaging.

[17]  Xin Li,et al.  Patch-Based Video Processing: A Variational Bayesian Approach , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Daniel DeMenthon,et al.  A Survey of Spatio-Temporal Grouping Techniques , 2002 .