Key frames extraction using graph modularity clustering for efficient video summarization

Keyframe extraction is one of the basic procedures relating to video retrieval and summary. It consists on presenting an abstract of the video with the most representative frames. This paper presents an efficient keyframe extraction approach based on local description and graph modularity clustering. The first step is to generate a set of candidate keyframes using a windowing rule in order to reduce the data to be examined. After that, detect interest points in these set of images. Then compute repeatability between each two images belonging to the candidate set and stocks these values in a matrix that we called repeatability matrix. Finally, the repeatability matrix is modelled by an oriented graph and we will select keyframes using graph modularity clustering principle. The experiments showed that this method succeeds in extracting keyframes while preserving the salient content of the video. Further, we found good values in term of precision, PSNR and compression rate.

[1]  Hong Chen,et al.  Multi-video summarization using complex graph clustering and mining , 2010, Comput. Sci. Inf. Syst..

[2]  Ezzeddine Zagrouba,et al.  A Novel Key Frame Extraction Approach for Video Summarization , 2016, VISIGRAPP.

[3]  Chong-Wah Ngo,et al.  Automatic video summarization by graph modeling , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[4]  George Economou,et al.  Combining graph connectivity & dominant set clustering for video summarization , 2009, Multimedia Tools and Applications.

[5]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[6]  Sung Wook Baik,et al.  Adaptive key frame extraction for video summarization using an aggregation mechanism , 2012, J. Vis. Commun. Image Represent..

[7]  Mateu Sbert,et al.  Key frame selection based on Jensen-Rényi divergence , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[8]  Yang Yi,et al.  Key frame extraction based on visual attention model , 2012, J. Vis. Commun. Image Represent..

[9]  Cordelia Schmid,et al.  Evaluation of Interest Point Detectors , 2000, International Journal of Computer Vision.

[10]  Mateu Sbert,et al.  Browsing and exploration of video sequences: A new scheme for key frame extraction and 3D visualization using entropy based Jensen divergence , 2014, Inf. Sci..

[11]  Mrityunjay Kumar,et al.  Key frame extraction from consumer videos using sparse representation , 2011, 2011 18th IEEE International Conference on Image Processing.

[12]  Luc Van Gool,et al.  Speeded-Up Robust Features (SURF) , 2008, Comput. Vis. Image Underst..

[13]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[14]  Marco Pellegrini,et al.  STIMO: STIll and MOving video storyboard for the web scenario , 2009, Multimedia Tools and Applications.

[15]  W. Sabbar,et al.  Video scene segmentation using the shot transition detection by local characterization of the points of interest , 2012, 2012 6th International Conference on Sciences of Electronics, Technologies of Information and Telecommunications (SETIT).

[16]  Xiao Liu,et al.  Joint shot boundary detection and key frame extraction , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[17]  Ruxandra Tapu,et al.  A complete framework for temporal video segmentation , 2011, 2011 IEEE International Conference on Consumer Electronics -Berlin (ICCE-Berlin).