Simultaneously retargeting and super-resolution for stereoscopic video

This paper presents a novel approach that is able to resize stereoscopic video to fit various display environments with different aspect-ratios, while preserving the prominent content, keeping temporally consistent, adapting depth, as well as increasing the resolution. Our proposed approach can deal with retargeting and super-resolution problems simultaneously via replacing the down-sampling matrix appearing in super-resolution algorithm with a novel one, named as content-aware-sampling matrix, derived from retargeting method. The new matrix can sample the image into any resolution while preserving its important information as much as possible. Our approach can be roughly subdivided into three steps. In the first step, we calculate the overall saliency map for a shot, while considering the conspicuous information such as motion, depth, and structures. In the second step, given a certain resolution, we compute the retargeting parameters by a global optimization and formulate them into a matrix. Finally, we substitute the matrix into the objective function of super-resolution to achieve high visual quality images with expected resolution. In addition, we propose a novel single image super-resolution method inspired by a blind image deblurring method. The experimental results based on user studies verify the effectiveness of our approach. And the comparisons with the-state-of-the-art single image super-resolution methods validate the potential of our super-resolution method.

[1]  Takeshi Naemura,et al.  Seam carving for stereo images , 2010, 2010 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[2]  Luc Van Gool,et al.  A+: Adjusted Anchored Neighborhood Regression for Fast Super-Resolution , 2014, ACCV.

[3]  Jing Zhang,et al.  A simultaneous method for 3D video super-resolution and high-quality depth estimation , 2013, 2013 IEEE International Conference on Image Processing.

[4]  Jun Yu,et al.  3D facial motion tracking by combining online appearance model and cylinder head model in particle filtering , 2014, Science China Information Sciences.

[5]  Deqing Sun,et al.  A Bayesian approach to adaptive video super resolution , 2011, CVPR 2011.

[6]  Yanning Zhang,et al.  Single Image Super-resolution Using Deformable Patches , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Ce Liu,et al.  Exploring new representations and applications for motion analysis , 2009 .

[8]  Yung-Yu Chuang,et al.  Content-Aware Display Adaptation and Interactive Editing for Stereoscopic Images , 2011, IEEE Transactions on Multimedia.

[9]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[10]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[11]  O. Sorkine,et al.  Optimized scale-and-stretch for image resizing , 2008, SIGGRAPH 2008.

[12]  查正军,et al.  A Unified Scheme for Super-resolution and Depth Estimation from Asymmetric Stereoscopic Video , 2016 .

[13]  Jing Zhang,et al.  A new closed loop method of super-resolution for multi-view images , 2013, Machine Vision and Applications.

[14]  Ariel Shamir,et al.  Improved seam carving for video retargeting , 2008, SIGGRAPH 2008.

[15]  Luc Van Gool,et al.  Anchored Neighborhood Regression for Fast Example-Based Super-Resolution , 2013, 2013 IEEE International Conference on Computer Vision.

[16]  Michael Elad,et al.  On Single Image Scale-Up Using Sparse-Representations , 2010, Curves and Surfaces.

[17]  Jun Yu,et al.  A Video, Text, and Speech-Driven Realistic 3-D Virtual Head for Human–Machine Interface , 2015, IEEE Transactions on Cybernetics.

[18]  Wu-chi Feng,et al.  Aesthetics-Based Stereoscopic Photo Cropping for Heterogeneous Displays , 2012, IEEE Transactions on Multimedia.

[19]  Russell Zaretzki,et al.  Beta Process Joint Dictionary Learning for Coupled Feature Spaces with Application to Single Image Super-Resolution , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  S. Mukhopadhyay,et al.  MVEM-Based Fault Diagnosis of Automotive Engines Using Dempster–Shafer Theory and Multiple Hypotheses Testing , 2015, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[21]  Aline Roumy,et al.  Low-Complexity Single-Image Super-Resolution based on Nonnegative Neighbor Embedding , 2012, BMVC.

[22]  R. Molina,et al.  Bayesian Super-Resolution image reconstruction using an ℓ1 prior , 2009, 2009 Proceedings of 6th International Symposium on Image and Signal Processing and Analysis.

[23]  Wolfgang Effelsberg,et al.  Warping-based video retargeting for stereoscopic video , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[24]  Mário A. T. Figueiredo,et al.  New stopping criteria for iterative blind image deblurring based on residual whiteness measures , 2011, 2011 IEEE Statistical Signal Processing Workshop (SSP).

[25]  Ariel Shamir,et al.  Seam Carving for Content-Aware Image Resizing , 2007, ACM Trans. Graph..

[26]  Thomas S. Huang,et al.  Image Super-Resolution Via Sparse Representation , 2010, IEEE Transactions on Image Processing.

[27]  Luís B. Almeida,et al.  Blind and Semi-Blind Deblurring of Natural Images , 2010, IEEE Transactions on Image Processing.

[28]  Chih-Yuan Yang,et al.  Fast Direct Super-Resolution by Simple Functions , 2013, 2013 IEEE International Conference on Computer Vision.

[29]  Michael Elad,et al.  Fast and robust multiframe super resolution , 2004, IEEE Transactions on Image Processing.

[30]  Wolfgang Effelsberg,et al.  Seam carving for stereoscopic video , 2013, IVMSP 2013.

[31]  C. Lawrence Zitnick,et al.  Structured Forests for Fast Edge Detection , 2013, 2013 IEEE International Conference on Computer Vision.