Unsupervised Video Summaries Using Multiple Features and Image Quality

It is important to generate both interesting and representative video summary for massive videos. This work proposes a new method to generate dynamic video summary using multiple features and image quality without human's involvement in the whole procedure. Specifically, we first split a video into several video clips. Second, a set of features including visual attention, exposure of light, saturation, hue, rule of thirds, contrast and directionality is computed and the qualities of video clips are also estimated. Then, the importance of each video clip is obtained based on these features and the estimated quality. Finally, based on the importance value, we sort clip values in descending order and select an optimal subset to generate video summary. Experimental results demonstrate that the proposed method enables to generate high-quality video summary.

[1]  Hideyuki Tamura,et al.  Textural Features Corresponding to Visual Perception , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[2]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[3]  Arnaldo de Albuquerque Araújo,et al.  VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method , 2011, Pattern Recognit. Lett..

[4]  Wayne H. Wolf,et al.  Key frame selection by motion analysis , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[5]  Ali Farhadi,et al.  Ranking Domain-Specific Highlights by Analyzing Edited Videos , 2014, ECCV.

[6]  Chinh T. Dang,et al.  Key frame extraction from consumer videos using epitome , 2012, 2012 19th IEEE International Conference on Image Processing.

[7]  W. Chu Studying Aesthetics in Photographic Images Using a Computational Approach , 2013 .

[8]  Adrian Ulges,et al.  Keyframe Extraction for Video Tagging & Summarization , 2008, Informatiktage.

[9]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Yong Jae Lee,et al.  Discovering important people and objects for egocentric video summarization , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Remco C. Veltkamp Multimedia Algorithmics , 2005, Multimedia Tools and Applications.

[12]  Xin Liu,et al.  Video summarization using singular value decomposition , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[13]  Hao Tang,et al.  Detecting highlights in sports videos: Cricket as a test case , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[14]  Sung Wook Baik,et al.  Efficient visual attention based framework for extracting key frames from videos , 2013, Signal Process. Image Commun..

[15]  Patricia Ladret,et al.  The blur effect: perception and estimation with a new no-reference perceptual blur metric , 2007, Electronic Imaging.

[16]  Luc Van Gool,et al.  Creating Summaries from User Videos , 2014, ECCV.

[17]  Joseph V. Mascelli The five C's of cinematography : motion picture filming techniques simplified , 1965 .