Selecting Video Key Frames Based on Relative Entropy and the Extreme Studentized Deviate Test

This paper studies the relative entropy and its square root as distance measures of neighboring video frames for video key frame extraction. We develop a novel approach handling both common and wavelet video sequences, in which the extreme Studentized deviate test is exploited to identify shot boundaries for segmenting a video sequence into shots. Then, video shots can be divided into different sub-shots, according to whether the video content change is large or not, and key frames are extracted from sub-shots. The proposed technique is general, effective and efficient to deal with video sequences of any kind. Our new approach can offer optional additional multiscale summarizations of video data, achieving a balance between having more details and maintaining less redundancy. Extensive experimental results show that the new scheme obtains very encouraging results in video key frame extraction, in terms of both objective evaluation metrics and subjective visual perception.

[1]  S. Stigler Gergonne's 1815 paper on the design and analysis of polynomial regression experiments , 1974 .

[2]  Sang Uk Lee,et al.  Efficient video indexing scheme for content-based retrieval , 1999, IEEE Trans. Circuits Syst. Video Technol..

[3]  Don H. Johnson,et al.  Information-theoretic analysis of neural coding , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[4]  Boon-Lock Yeo,et al.  Rapid scene analysis on compressed video , 1995, IEEE Trans. Circuits Syst. Video Technol..

[5]  Mateu Sbert,et al.  Browsing and exploration of video sequences: A new scheme for key frame extraction and 3D visualization using entropy based Jensen divergence , 2014, Inf. Sci..

[6]  Alexandra Psarrou,et al.  Key-frame extraction algorithm using entropy difference , 2004, MIR '04.

[7]  Alan Hanjalic,et al.  Shot-boundary detection: unraveled and resolved? , 2002, IEEE Trans. Circuits Syst. Video Technol..

[8]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[9]  Jianhua Lin,et al.  Divergence measures based on the Shannon entropy , 1991, IEEE Trans. Inf. Theory.

[10]  Shihua Sun,et al.  Key Frame Selection Based on KL-Divergence , 2015, 2015 IEEE International Conference on Multimedia Big Data.

[11]  Yuhong Yang,et al.  Information-theoretic determination of minimax rates of convergence , 1999 .

[12]  Guilherme Tavares de Assis,et al.  A unified approach to content-based indexing and retrieval of digital videos from television archives , 2014, Artif. Intell. Res..

[13]  Mateu Sbert,et al.  Tsallis Mutual Information for Document Classification , 2011, Entropy.

[14]  Don H. Johnson,et al.  Information Theory and Neural Information Processing , 2010, IEEE Transactions on Information Theory.

[15]  Shih-Fu Chang,et al.  Motion trajectory matching of video objects , 1999, Electronic Imaging.

[16]  Pablo Suau,et al.  Information Theory in Computer Vision and Pattern Recognition , 2009 .

[17]  Mateu Sbert,et al.  Information Theory Tools for Image Processing , 2014, Information Theory Tools for Image Processing.

[18]  John R. Kender,et al.  Computational approaches to temporal sampling of video sequences , 2007, TOMCCAP.

[19]  N. Nikolaidis,et al.  Video shot detection and condensed representation. a review , 2006, IEEE Signal Processing Magazine.

[20]  Jean-Nicolas Ouellet,et al.  To Watch or Not to Watch: Video Summarization with Explicit Duplicate Elimination , 2011, 2011 Canadian Conference on Computer and Robot Vision.

[21]  Adam Finkelstein,et al.  Video tapestries with continuous temporal zoom , 2010, ACM Trans. Graph..

[22]  Surendra P. Verma,et al.  Critical values for six Dixon tests for outliers in normal samples up to sizes 100, and applications in science and engineering , 2006 .

[23]  Ben Shneiderman,et al.  The state of visual analytics: Views on what visual analytics is and where it is going , 2010, IEEE VAST.

[24]  Arnold W. M. Smeulders,et al.  Color and Scale: The Spatial Structure of Color Images , 2000, ECCV.

[25]  Klaus Schöffmann,et al.  Video Interaction Tools , 2015, ACM Comput. Surv..

[26]  Markus A. Stricker,et al.  Similarity of color images , 1995, Electronic Imaging.

[27]  Wolfgang Effelsberg,et al.  Video abstracting , 1997, CACM.

[28]  Ba Tu Truong,et al.  Video abstraction: A systematic review and classification , 2007, TOMCCAP.

[29]  Daniel Cohen-Or,et al.  Action synopsis: pose selection and illustration , 2005, ACM Trans. Graph..

[30]  Mohamed S. Kamel,et al.  Categorizing Extent of Tumor Cell Death Response to Cancer Therapy Using Quantitative Ultrasound Spectroscopy and Maximum Mean Discrepancy , 2014, IEEE Transactions on Medical Imaging.

[31]  C.-C.J. Kuo,et al.  Retrieval and progressive transmission of wavelet compressed images , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.

[32]  Wolfgang Hürst,et al.  Sliders Versus Storyboards - Investigating Interaction Design for Mobile Video Browsing , 2015, MMM.

[33]  Bernard Rosner,et al.  On the Detection of Many Outliers , 1975 .

[34]  Daniel Starch,et al.  A demonstration of the trial and error method of learning. , 1910 .

[35]  Ioannis Pitas,et al.  Information theory-based shot cut/fade detection and video summarization , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[36]  Irena Koprinska,et al.  Video segmentation of MPEG compressed data , 1998, 1998 IEEE International Conference on Electronics, Circuits and Systems. Surfing the Waves of Science and Technology (Cat. No.98EX196).

[37]  F. E. Grubbs Sample Criteria for Testing Outlying Observations , 1950 .

[38]  S. Shirmohammadi,et al.  Content-based visual search learned from social media , 2012, ACMMR.

[39]  Thomas M. Cover,et al.  Elements of information theory (2. ed.) , 2006 .

[40]  Raimondo Schettini,et al.  Erratum to: An innovative algorithm for key frame extraction in video summarization , 2006, Journal of Real-Time Image Processing.

[41]  Chia-Hung Yeh,et al.  Techniques for movie content analysis and skimming: tutorial and overview on video abstraction techniques , 2006, IEEE Signal Processing Magazine.

[42]  Harry W. Agius,et al.  Video summarisation: A conceptual framework and survey of the state of the art , 2008, J. Vis. Commun. Image Represent..