Temporal Incoherence-Free Video Retargeting Using Foreground Aware Extrapolation

Video retargeting is a method of adjusting the aspect ratio of a given video to the target aspect ratio. However, temporal incoherence of video contents, which can occur frequently by video retargeting, is the most dominant factor that degrades the quality of retargeted videos. Current methods to maintain temporal coherence use the entire frames of the input videos; however, these methods cannot be implemented as on-time systems because of their tremendous computational complexity. As far as we know, there is no existing on-time video retargeting method that can avoid spatial distortion while perfectly maintaining temporal coherence. In this paper, we propose a novel on-time video retargeting method that can perfectly maintain temporal coherence and prevent the spatial distortion by using only two consecutive input frames. In our method, the maximum a posteriori-based foreground aware-block matching is used for the extrapolation that extends the side area of a given video to adjust its aspect ratio to the target. To maintain the temporal coherence of the extended area, the result of block matching for backward warping-based extrapolation of the start frame after the scene change occurs, is reused for the other frames until the next scene change occurs. In addition, we propose a scene scenario-adaptive fallback scheme to prevent severe distortions that can occur with reusing block matching results or extrapolation-based side extension. The simulation results showed that the proposed method greatly improved the bidirectional similarity value, which can measure the quality of video retargeting, by up to 10.26 compared with the existing on-time video retargeting methods.

[1]  Olga Sorkine-Hornung,et al.  A comparative study of image retargeting , 2010, ACM Trans. Graph..

[2]  Aljoscha Smolic,et al.  Algorithm and VLSI architecture for real-time 1080p60 video retargeting , 2012, EGGH-HPG'12.

[3]  Tihao Chiang,et al.  A hierarchical motion estimation algorithm using nonlinear pyramid for MPEG-2 , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.

[4]  Suk-Ju Kang,et al.  Dual-dissimilarity measure-based statistical video cut detection , 2017, Journal of Real-Time Image Processing.

[5]  Young Hwan Kim,et al.  Census transform-based static caption detection for frame rate up-conversion , 2017, 2017 IEEE International Symposium on Circuits and Systems (ISCAS).

[6]  Min Xu,et al.  Context-Aware Video Retargeting via Graph Model , 2013, IEEE Transactions on Multimedia.

[7]  Yong-Jin Liu,et al.  Optimal-Scaling-Factor Assignment for Patch-wise Image Retargeting , 2013, IEEE Computer Graphics and Applications.

[8]  Denis Simakov,et al.  Summarizing visual data using bidirectional similarity , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Diego Gutierrez,et al.  Objective Quality Prediction of Image Retargeting Algorithms , 2017, IEEE Transactions on Visualization and Computer Graphics.

[10]  Young Serk Shim,et al.  A fast hierarchical motion vector estimation algorithm using mean pyramid , 1995, IEEE Trans. Circuits Syst. Video Technol..

[11]  Ariel Shamir,et al.  Cropping Scaling Seam carving Warping Multi-operator , 2009 .

[12]  Yoav Y. Schechner,et al.  Multiscale ultrawide foveated video extrapolation , 2011, 2011 IEEE International Conference on Computational Photography (ICCP).

[13]  Daniel Cohen-Or,et al.  Non-homogeneous Content-driven Video-retargeting , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[14]  Olga Sorkine-Hornung,et al.  Robust Image Retargeting via Axis‐Aligned Deformation , 2012, Comput. Graph. Forum.

[15]  Feiniu Yuan,et al.  Optimized Multioperator Image Retargeting Based on Perceptual Similarity Measure , 2017, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[16]  Xinfeng Zhang,et al.  Multiple-Level Feature-Based Measure for Retargeted Image Quality , 2018, IEEE Transactions on Image Processing.

[17]  Bo Yan,et al.  Matching-Area-Based Seam Carving for Video Retargeting , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Wen Gao,et al.  Spatiotemporal Grid Flow for Video Retargeting , 2014, IEEE Transactions on Image Processing.

[19]  O. Sorkine,et al.  Motion-based video retargeting with optimized crop-and-warp , 2010, ACM Trans. Graph..

[20]  Fabrice Urban,et al.  Extrafoveal Video Extension for an Immersive Viewing Experience , 2017, IEEE Transactions on Visualization and Computer Graphics.

[21]  Jiwon Choi,et al.  Sparse Seam-Carving for Structure Preserving Image Retargeting , 2016, J. Signal Process. Syst..

[22]  Tong-Yee Lee,et al.  Scalable and coherent video resizing with per-frame optimization , 2011, SIGGRAPH 2011.

[23]  Zhi Liu,et al.  Stretchability-aware block scaling for image retargeting , 2013, J. Vis. Commun. Image Represent..

[24]  Chao-Hung Lin,et al.  Content-Aware Video Retargeting Using Object-Preserving Warping , 2013 .

[25]  S. Avidan,et al.  Seam carving for content-aware image resizing , 2007, SIGGRAPH 2007.

[26]  Hans-Peter Seidel,et al.  Motion-aware temporal coherence for video resizing , 2009, ACM Trans. Graph..

[27]  Markus H. Gross,et al.  A system for retargeting of streaming video , 2009, ACM Trans. Graph..

[28]  Ariel Shamir,et al.  Improved seam carving for video retargeting , 2008, ACM Trans. Graph..

[29]  Weiming Dong,et al.  Optimized image resizing using seam carving and scaling , 2009, SIGGRAPH 2009.

[30]  Demin Wang,et al.  Motion-Compensated Frame Rate Up-Conversion—Part I: Fast Multi-Frame Motion Estimation , 2010, IEEE Transactions on Broadcasting.