Enhancement of perceptual quality in static video summarization using minimal spanning tree approach

A video summarization technique is proposed in this work using minimal spanning tree (MST) of data points. The data points correspond to image frames of a shot in the video which is to be summarized. Correlation is chosen as a similarity metric for computing the edge weights of the MST. The representative frames for each shot are chosen by computing the density of each data point. A novel method for redundancy reduction is devised using SURF and GIST. The redundant frames are eliminated for concise representation of the video. The degree of reduction achieved by using the two approaches is also presented. The proposed method is assessed for perceptual quality against manual summarization. High values of precision and recall endorse the efficacy of the method. The system works well for different kinds of video without a priori knowledge about the type or content of it. Two datasets are considered for experimentation, one comprises short videos while the other consists of long videos. The method is found to provide satisfactory results for both the datasets.

[1]  Siddhartha Bhattacharyya,et al.  Towards redundancy reduction in storyboard representation for static video summarization , 2014, 2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI).

[2]  Siddhartha Bhattacharyya,et al.  Video Shot Segmentation Using Spatio-temporal Fuzzy Hostility Index and Automatic Threshold , 2014, 2014 Fourth International Conference on Communication Systems and Network Technologies.

[3]  Guillermo Cámara Chávez,et al.  A New Method for Static Video Summarization Using Local Descriptors and Video Temporal Segmentation , 2013, 2013 XXVI Conference on Graphics, Patterns and Images.

[4]  Sung Wook Baik,et al.  Adaptive key frame extraction for video summarization using an aggregation mechanism , 2012, J. Vis. Commun. Image Represent..

[5]  S. Padmakala,et al.  An effective content based video retrieval utilizing texture, color and optimal key frame features , 2011, 2011 International Conference on Image Information Processing.

[6]  Jan-Michael Frahm,et al.  Modeling and Recognition of Landmark Image Collections Using Iconic Scene Graphs , 2008, International Journal of Computer Vision.

[7]  Xu Chen,et al.  Motion Trajectory-Based Video Retrieval, Classification, and Summarization , 2010, Video Search and Mining.

[8]  D. Schonfeld,et al.  Content based Video Retrieval , Classification and Summarization : The State-ofthe-Art and the Future , 2009 .

[9]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[10]  Antonio Torralba,et al.  Small codes and large image databases for recognition , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[12]  George Economou,et al.  Key frame extraction in video sequences: a vantage points approach , 2007, 2007 IEEE 9th Workshop on Multimedia Signal Processing.

[13]  Alexei A. Efros,et al.  Scene completion using millions of photographs , 2007, SIGGRAPH 2007.

[14]  Nathalie Guyader,et al.  Video Summarization Based on Camera Motion and a Subjective Evaluation Method , 2007, EURASIP J. Image Video Process..

[15]  Dipti Prasad Mukherjee,et al.  Key Frame Estimation in Video Using Randomness Measure of Feature Point Pattern , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Ting Wang,et al.  An Approach to Video Key-frame Extraction Based on Rough Set , 2007, 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE'07).

[17]  Ba Tu Truong,et al.  Video abstraction: A systematic review and classification , 2007, TOMCCAP.

[18]  Kin-Man Lam,et al.  A new key frame representation for video segment retrieval , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Chong-Wah Ngo,et al.  Video summarization and scene detection by graph modeling , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Cordelia Schmid,et al.  A performance evaluation of local descriptors , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[22]  Patrick Gros,et al.  A Geometrical Key-Frame Selection Method Exploiting Dominant Motion Estimation in Video , 2004, CIVR.

[23]  Michael R. Lyu,et al.  Video summarization by video structure analysis and graph optimization , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[24]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[25]  R. Narasimha,et al.  Key frame extraction using MPEG-7 motion descriptors , 2003, The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003.

[26]  HongJiang Zhang,et al.  A model of motion attention for video skimming , 2002, Proceedings. International Conference on Image Processing.

[27]  HongJiang Zhang,et al.  A user attention model for video summarization , 2002, MULTIMEDIA '02.

[28]  Xin Liu,et al.  Video summarization using singular value decomposition , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[29]  Rainer Lienhart Dynamic video summarization of home video , 1999, Electronic Imaging.

[30]  Sang Uk Lee,et al.  Efficient video indexing scheme for content-based retrieval , 1999, IEEE Trans. Circuits Syst. Video Technol..

[31]  Alan Hanjalic,et al.  An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis , 1999, IEEE Trans. Circuits Syst. Video Technol..

[32]  Jeho Nam,et al.  Dynamic video summarization and visualization , 1999, MULTIMEDIA '99.

[33]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[34]  David S. Doermann,et al.  Video summarization by curve simplification , 1998, MULTIMEDIA '98.

[35]  Nuno Vasconcelos,et al.  A spatiotemporal motion model for video summarization , 1998, Proceedings. 1998 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.98CB36231).

[36]  Boon-Lock Yeo,et al.  Video visualization for compact presentation and fast browsing of pictorial content , 1997, IEEE Trans. Circuits Syst. Video Technol..

[37]  Michael A. Smith,et al.  Video skimming and characterization through the combination of image and language understanding techniques , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.