Temporally Coherent Superresolution of Textured Video via Dynamic Texture Synthesis

This paper addresses the problem of hallucinating the missing high-resolution (HR) details of a low-resolution (LR) video while maintaining the temporal coherence of the reconstructed HR details using dynamic texture synthesis (DTS). Most existing multiframe-based video superresolution (SR) methods suffer from the problem of limited reconstructed visual quality due to inaccurate subpixel motion estimation between frames in an LR video. To achieve high-quality reconstruction of HR details for an LR video, we propose a texture-synthesis (TS)-based video SR method, in which a novel DTS scheme is proposed to render the reconstructed HR details in a temporally coherent way, which effectively addresses the temporal incoherence problem caused by traditional TS-based image SR methods. To further reduce the complexity of the proposed method, our method only performs the TS-based SR on a set of key frames, while the HR details of the remaining nonkey frames are simply predicted using the bidirectional overlapped block motion compensation. After all frames are upscaled, the proposed DTS-SR is applied to maintain the temporal coherence in the HR video. Experimental results demonstrate that the proposed method achieves significant subjective and objective visual quality improvement over state-of-the-art video SR methods.

[1]  William T. Freeman,et al.  Example-Based Super-Resolution , 2002, IEEE Computer Graphics and Applications.

[2]  Li-Wei Kang,et al.  Video super-resolution via dynamic texture synthesis , 2014, 2014 IEEE 16th International Workshop on Multimedia Signal Processing (MMSP).

[3]  Alan C. Bovik,et al.  Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos , 2010, IEEE Transactions on Image Processing.

[4]  Mads Nielsen,et al.  Video Super-Resolution Using Simultaneous Motion and Intensity Calculations , 2011, IEEE Transactions on Image Processing.

[5]  Thomas S. Huang,et al.  Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[6]  Lei Zhang,et al.  Nonlocal back-projection for adaptive image enlargement , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[7]  Sabine Süsstrunk,et al.  Higher Order SVD Analysis for Dynamic Texture Synthesis , 2008, IEEE Transactions on Image Processing.

[8]  Hsieh Hou,et al.  Cubic splines for image interpolation and digital filtering , 1978 .

[9]  Lei Zhang,et al.  Image Deblurring and Super-Resolution by Adaptive Sparse Domain Selection and Adaptive Regularization , 2010, IEEE Transactions on Image Processing.

[10]  Zhiwei Xiong,et al.  Robust Web Image/Video Super-Resolution , 2010, IEEE Transactions on Image Processing.

[11]  Byung Cheol Song,et al.  Video Super-Resolution Algorithm Using Bi-Directional Overlapped Block Motion Compensation and On-the-Fly Dictionary Training , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Irfan A. Essa,et al.  Graphcut textures: image and video synthesis using graph cuts , 2003, ACM Trans. Graph..

[13]  Michael Elad,et al.  Fast and robust multiframe super resolution , 2004, IEEE Transactions on Image Processing.

[14]  Muhammad Waqas Anwar,et al.  A Multifaceted Independent Performance Analysis of Facial Subspace Recognition Algorithms , 2013, PloS one.

[15]  David J. Fleet,et al.  Higher-order Autoregressive Models for Dynamic Textures , 2007, BMVC.

[16]  Chia-Hung Yeh,et al.  Self-learning-based single image super-resolution of a highly compressed image , 2013, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP).

[17]  Thomas S. Huang,et al.  Coupled Dictionary Training for Image Super-Resolution , 2012, IEEE Transactions on Image Processing.

[18]  Halim Fathoni,et al.  DEPARTMENT OF COMPUTER SCIENCE AND INFORMATION ENGINEERING , 2008 .

[19]  Nipun Kwatra,et al.  Texture optimization for example-based synthesis , 2005, ACM Trans. Graph..

[20]  Edson M. Hung,et al.  Video Super-Resolution Using Codebooks Derived From Key-Frames , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Matti Pietikäinen,et al.  Dynamic Texture Recognition Using Local Binary Patterns with an Application to Facial Expressions , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[22]  Yu-Chiang Frank Wang,et al.  A Self-Learning Approach to Single Image Super-Resolution , 2013, IEEE Transactions on Multimedia.

[23]  Ling Shao,et al.  Subspace learning for silhouette based human action recognition , 2010, Visual Communications and Image Processing.

[24]  Moon Gi Kang,et al.  Super-resolution image reconstruction: a technical overview , 2003, IEEE Signal Process. Mag..

[25]  Richard Szeliski,et al.  Video textures , 2000, SIGGRAPH.

[26]  Mohammed Ghanbari,et al.  Standard Codecs: Image Compression to Advanced Video Coding , 2003 .

[27]  Jong-Seok Lee,et al.  On Designing Paired Comparison Experiments for Subjective Multimedia Quality Assessment , 2014, IEEE Transactions on Multimedia.

[28]  Michael Elad,et al.  Generalizing the Nonlocal-Means to Super-Resolution Reconstruction , 2009, IEEE Transactions on Image Processing.

[29]  Takeo Kanade,et al.  Limits on super-resolution and how to break them , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[30]  Debargha Mukherjee,et al.  Super-resolution of video using key frames and motion estimation , 2008, 2008 15th IEEE International Conference on Image Processing.

[31]  Jiawei Han,et al.  Orthogonal Laplacianfaces for Face Recognition , 2006, IEEE Transactions on Image Processing.

[32]  Michal Irani,et al.  Super-resolution from a single image , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[33]  Chia-Wen Lin,et al.  Image super-resolution via feature-based affine transform , 2011, 2011 IEEE 13th International Workshop on Multimedia Signal Processing.

[34]  Raanan Fattal,et al.  Image upsampling via texture hallucination , 2010, 2010 IEEE International Conference on Computational Photography (ICCP).

[35]  Jie Ren,et al.  Context-Aware Sparse Decomposition for Image Denoising and Super-Resolution , 2013, IEEE Transactions on Image Processing.

[36]  Stefano Soatto,et al.  Dynamic Textures , 2003, International Journal of Computer Vision.

[37]  Xiaofei He,et al.  Locality Preserving Projections , 2003, NIPS.

[38]  Narendra Ahuja,et al.  Maximum Margin Distance Learning for Dynamic Texture Recognition , 2010, ECCV.