An Enhanced Video Super Resolution System Using Group-Based Optimized Filter-Set with Shallow Convolutional Neural Network

Scaling up video resolution has conventionally been achieved via linear interpolation, however this method occasionally introduces blurring to the output. Super-resolution (SR), an approach to preserve image quality in enlarged still images, has been exploited as a substitute for linear interpolation, however, the output at times exhibits image qualities worse than what linear interpolation produces primarily because the initial goal of SR is preservation of image quality when a still image is enlarged. In this context, this paper proposes a fast-performance adaptive system for scaling-up other resolutions like X2 using X3 model or X3 using X2 model by (1) first grouping frames that would use similar filter sets (2) then conducting fine-tuning of shallow CNN for SR on each frame group. Filter sets fine-tuned for each group resulted in significantly improved PSNR over either linear interpolation or conventional SR in our experiment. In the fine-tuning stage for each group, 0.5K to 2.5K iterations were sufficient to improve PSNR by 10%. By fine-tuning instead of performing full training, the number of sufficient iterations was reduced from 3,000K to mere 0.5K to 2.5K.

[1]  Abdellatif Mtibaa,et al.  Video shot boundary detection using motion activity descriptor , 2010, ArXiv.

[2]  Sangchul Kim,et al.  A Gradual Shot Change Detection Using Combination of Luminance and Motion Features for Frame Rate Up Conversion , 2015, 2015 11th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS).

[3]  Michael Elad,et al.  Fast and robust multiframe super resolution , 2004, IEEE Transactions on Image Processing.

[4]  Christophe Garcia,et al.  Convolutional face finder: a neural architecture for fast and robust face detection , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Michal Irani,et al.  Super-resolution from a single image , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[6]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[7]  Kyoung Mu Lee,et al.  Accurate Image Super-Resolution Using Very Deep Convolutional Networks , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  William T. Freeman,et al.  Example-Based Super-Resolution , 2002, IEEE Computer Graphics and Applications.

[9]  Xiaoou Tang,et al.  Learning a Deep Convolutional Network for Image Super-Resolution , 2014, ECCV.

[10]  Bo Zhang,et al.  A Formal Study of Shot Boundary Detection , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Kyoung Mu Lee,et al.  Deeply-Recursive Convolutional Network for Image Super-Resolution , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).