A novel compact yet rich key frame creation method for compressed video summarization

Video summarization has great potential to enable rapid browsing and efficient video indexing in many applications. In this study, we propose a novel compact yet rich key frame creation method for compressed video summarization. First, we directly extract DC coefficients of I frame from a compressed video stream, and DC-based mutual information is computed to segment the long video into shots. Then, we select shots with static background and moving object according to the intensity and range of motion vector in the video stream. Detecting moving object outliers in each selected shot, the optimal object set is then selected by importance ranking and solving an optimum programming problem. Finally, we conduct an improved KNN matting approach on the optimal object outliers to automatically and seamlessly splice these outliers to the final key frame as video summarization. Previous video summarization methods typically select one or more frames from the original video as the video summarization. However, these existing key frame representation approaches for video summarization eliminate the time axis and lose the dynamic aspect of the video scene. The proposed video summarization preserves both compactness and considerably richer information than previous video summaries. Experimental results indicate that the proposed key frame representation not only includes abundant semantics but also is natural, which satisfies user preferences.

[1]  Chi-Keung Tang,et al.  KNN Matting , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Yael Pritch,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008 1 Non-Chronological Video , 2022 .

[3]  Shiyang Lu,et al.  Keypoint-Based Keyframe Selection , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Meng Wang,et al.  Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification , 2012, IEEE Transactions on Multimedia.

[5]  Arnaldo de Albuquerque Araújo,et al.  VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method , 2011, Pattern Recognit. Lett..

[6]  Xiaowei Zhou,et al.  Moving Object Detection by Detecting Contiguous Outliers in the Low-Rank Representation , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Y. L. Liu,et al.  A Robust Image Hashing Algorithm Resistant Against Geometrical Attacks , 2013 .

[8]  Kristen Grauman,et al.  Diverse Sequential Subset Selection for Supervised Video Summarization , 2014, NIPS.

[9]  C. Schmid,et al.  Category-Specific Video Summarization , 2014, ECCV.

[10]  Yelena Yesha,et al.  Keyframe-based video summarization using Delaunay clustering , 2006, International Journal on Digital Libraries.

[11]  Chong-Wah Ngo,et al.  Motion-Based Video Representation for Scene Change Detection , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[12]  Marco Pellegrini,et al.  STIMO: STIll and MOving video storyboard for the web scenario , 2009, Multimedia Tools and Applications.

[13]  Georgios Tziritas,et al.  Equivalent Key Frames Selection Based on Iso-Content Principles , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Yael Pritch,et al.  Making a Long Video Short: Dynamic Video Synopsis , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[15]  Mohammad Rahmati,et al.  Content based video retrieval using information theory , 2013, 2013 8th Iranian Conference on Machine Vision and Image Processing (MVIP).

[16]  Yong Yu,et al.  Video summarization via transferrable structured learning , 2011, WWW.

[17]  Maneesh Agrawala,et al.  Interactive video cutout , 2005, SIGGRAPH 2005.

[18]  Aristidis Likas,et al.  Weighted multi-view key-frame extraction , 2016, Pattern Recognit. Lett..

[19]  Yongwei Nie,et al.  Compact Video Synopsis via Global Spatiotemporal Optimization , 2013, IEEE Trans. Vis. Comput. Graph..

[20]  Qiang Zhang,et al.  An Efficient Method of Key-Frame Extraction Based on a Cluster Algorithm , 2013, Journal of human kinetics.

[21]  Jianmin Jiang,et al.  A novel clustering method for static video summarization , 2017, Multimedia Tools and Applications.

[22]  John R. Kender,et al.  Video summaries and cross-referencing through mosaic-based representation , 2004, Comput. Vis. Image Underst..

[23]  Sung Wook Baik,et al.  Feature aggregation based visual attention model for video summarization , 2014, Comput. Electr. Eng..

[24]  Junaid Baber,et al.  Shot boundary detection from videos using entropy and local descriptor , 2011, 2011 17th International Conference on Digital Signal Processing (DSP).

[25]  Ioannis Pitas,et al.  Information theory-based shot cut/fade detection and video summarization , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[26]  Shi-Min Hu,et al.  Global contrast based salient region detection , 2011, CVPR 2011.

[27]  Shaohui Mei,et al.  Video summarization via minimum sparse reconstruction , 2015, Pattern Recognit..

[28]  Jiebo Luo,et al.  Towards Scalable Summarization of Consumer Videos Via Sparse Dictionary Selection , 2012, IEEE Transactions on Multimedia.

[29]  Li Zhao,et al.  Key-frame extraction and shot retrieval using nearest feature line (NFL) , 2000, MULTIMEDIA '00.

[30]  Bin Zhao,et al.  Quasi Real-Time Summarization for Consumer Videos , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[31]  Wolfgang Effelsberg,et al.  Video abstracting , 1997, CACM.

[32]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[33]  Costas Panagiotakis,et al.  Video Synopsis Based on a Sequential Distortion Minimization Method , 2013, CAIP.

[34]  Luc Van Gool,et al.  Video summarization by learning submodular mixtures of objectives , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[35]  Shaogang Gong,et al.  Video Synopsis by Heterogeneous Multi-source Correlation , 2013, 2013 IEEE International Conference on Computer Vision.

[36]  P. R. Deshmukh,et al.  Keyframe Based Video Summarization Using Automatic Threshold & Edge Matching Rate , 2012 .

[37]  Mateu Sbert,et al.  Tsallis entropy-based information measures for shot boundary detection and keyframe selection , 2013, Signal Image Video Process..

[38]  Chih-Jen Lin,et al.  Large-Scale Video Summarization Using Web-Image Priors , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[39]  Luc Van Gool,et al.  Creating Summaries from User Videos , 2014, ECCV.