Key frame extraction for video summarization using local description and repeatability graph clustering

Abstract Due to video data exponential growth supported by quick advances in multimedia technology, it became difficult for the user to retrieve information from a large videos collection. Key frame extraction consists in providing an abstract of the entire video, containing the most informative frames. In this paper, we present an efficient key frame extraction method. This method is based on local interest points description and repeatability graph clustering via approaching modularity value. To minimize the data to be treated, the process will be applied only on a set candidate frames selected with a windowing rule. Indeed, the first step consists in extracting interest points on a set of candidate frames. After that, we compute repeatability values between each two frames from the candidate set. These values are represented by a repeatability direct graph. The selection of key frames is performed using graph clustering by approaching modularity principle. The experiments performed showed that the proposed method succeeds in extracting key frames that preserve the relevant video content without redundancy.

[1]  Ezzeddine Zagrouba,et al.  On-the-fly Extraction of Key Frames for Efficient Video Summarization , 2013 .

[2]  W. Sabbar,et al.  Video summarization using shot segmentation and local motion estimation , 2012, Second International Conference on the Innovative Computing Technology (INTECH 2012).

[3]  Yelena Yesha,et al.  Keyframe-based video summarization using Delaunay clustering , 2006, International Journal on Digital Libraries.

[4]  Ezzeddine Zagrouba,et al.  Key Frames Extraction Based on Local Features for Efficient Video Summarization , 2016, ACIVS.

[5]  Weisi Lin,et al.  Scene-Based Movie Summarization Via Role-Community Networks , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Marco Pellegrini,et al.  STIMO: STIll and MOving video storyboard for the web scenario , 2009, Multimedia Tools and Applications.

[7]  Mohammed Javed,et al.  An efficient method for video shot boundary detection and keyframe extraction using SIFT-point distribution histogram , 2016, International Journal of Multimedia Information Retrieval.

[8]  Xiao Liu,et al.  Joint shot boundary detection and key frame extraction , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[9]  E. Asadi,et al.  Video summarization using fuzzy c-means clustering , 2012, 20th Iranian Conference on Electrical Engineering (ICEE2012).

[10]  Ezzeddine Zagrouba,et al.  A Novel Key Frame Extraction Approach for Video Summarization , 2016, VISIGRAPP.

[11]  JinLong Li,et al.  Video shot segmentation and key frame extraction based on SIFT feature , 2012, 2012 International Conference on Image Analysis and Signal Processing.

[12]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[13]  Rudinei Goularte,et al.  KS-SIFT: A Keyframe Extraction Method Based on Local Features , 2014, 2014 IEEE International Symposium on Multimedia.

[14]  Mona Omidyeganeh,et al.  Group-based spatio-temporal video analysis and abstraction using wavelet parameters , 2013, Signal Image Video Process..

[15]  Arnaldo de Albuquerque Araújo,et al.  VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method , 2011, Pattern Recognit. Lett..

[16]  Hana Gharbi,et al.  Robust interest points matching based on local description and spatial constraints , 2015, International Conference on Graphic and Image Processing.

[17]  Pierre Hansen,et al.  Improving heuristics for network modularity maximization using an exact algorithm , 2011, Discret. Appl. Math..

[18]  Boleslaw K. Szymanski,et al.  Community Detection via Maximization of Modularity and Its Variants , 2014, IEEE Transactions on Computational Social Systems.

[19]  Mohamed Haykel Boukadida,et al.  Création automatique de résumés vidéo par programmation par contraintes. (Automatic video summarization using constraint satisfaction programming) , 2015 .

[20]  Luc Van Gool,et al.  Creating Summaries from User Videos , 2014, ECCV.

[21]  Rishi Kumar,et al.  Techniques for key frame extraction: Shot segmentation and feature trajectory computation , 2016, 2016 6th International Conference - Cloud System and Big Data Engineering (Confluence).

[22]  Shaogang Gong,et al.  Video Synopsis by Heterogeneous Multi-source Correlation , 2013, 2013 IEEE International Conference on Computer Vision.

[23]  Yu Huang,et al.  Video summarization with semantic concept preservation , 2011, MUM.

[24]  Thomas S. Huang,et al.  Relevance feedback techniques in interactive content-based image retrieval , 1997, Electronic Imaging.

[25]  Muhammad Shakir,et al.  Video Summarization: Techniques and Classification , 2012, ICCVG.

[26]  Alex Pentland,et al.  Video and Image Semantics: Advanced Tools for Telecommunications , 1994, IEEE Multim..

[27]  Xiang Li,et al.  Network Clustering via Maximizing Modularity: Approximation Algorithms and Theoretical Limits , 2015, 2015 IEEE International Conference on Data Mining.

[28]  Mateu Sbert,et al.  Tsallis entropy-based information measures for shot boundary detection and keyframe selection , 2013, Signal Image Video Process..

[29]  Hélio Pedrini,et al.  VISCOM: A robust video summarization approach using color co-occurrence matrices , 2016, Multimedia Tools and Applications.

[30]  Noémi Gaskó,et al.  Mixing Network Extremal Optimization for Community Structure Detection , 2015, EvoCOP.

[31]  Yoshinobu Tonomura,et al.  VideoMAP and VideoSpaceIcon: tools for anatomizing video content , 1993, INTERCHI.

[32]  Ruxandra Tapu,et al.  A complete framework for temporal video segmentation , 2011, 2011 IEEE International Conference on Consumer Electronics -Berlin (ICCE-Berlin).