Unsupervised video summarization via clustering validity index

Although lots of the prior works have been proposed to solve the representative selection problem of video summarization, the main difficulty is still left for determining the optimal representatives’ number of the raw videos that are not annotated. In this paper, we propose an unsupervised video summarization method by motion-based frame selection and a novel clustering validity indexes to determine the optimal representatives of the original video. The proposed framework segments shots and selects candidate frames by evaluating their forward and backward motion and can automatically select representatives to highlight all the significant visual properties. Shots are segmented uniformly and the frame with the largest motion is extracted in each segmentation to form the video candidate frame subset. Then Affinity Propagation combined with the validity index is used to automatically select the optimal representatives from the candidate frame subset. Our experimental result on several benchmark datasets demonstrates the robustness and effectiveness of our proposed method.

[1]  Michael Lam,et al.  Unsupervised Video Summarization with Adversarial LSTM Networks , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Ke Zhang,et al.  Summary Transfer: Exemplar-Based Subset Selection for Video Summarization , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[3]  S. Dolnicar,et al.  An examination of indexes for determining the number of clusters in binary data sets , 2002, Psychometrika.

[4]  Xindong Wu,et al.  Learning on Big Graph: Label Inference and Regularization with Anchor Hierarchy , 2017, IEEE Transactions on Knowledge and Data Engineering.

[5]  T. Caliński,et al.  A dendrite method for cluster analysis , 1974 .

[6]  Rita Cucchiara,et al.  Personalized Egocentric Video Summarization of Cultural Tour on User Preferences Input , 2017, IEEE Transactions on Multimedia.

[7]  Aruna Tiwari,et al.  Enhanced cluster validity index for the evaluation of optimal number of clusters for Fuzzy C-Means algorithm , 2014, 2014 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE).

[8]  Po-Whei Huang,et al.  An efficient validity index method for datasets with complex-shaped clusters , 2016, 2016 International Conference on Machine Learning and Cybernetics (ICMLC).

[9]  Sebti Foufou,et al.  Cluster validity index based on Jeffrey divergence , 2017, Pattern Analysis and Applications.

[10]  Yale Song,et al.  TVSum: Summarizing web videos using titles , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Ye Zhao,et al.  Visual summarization of image collections by fast RANSAC , 2016, Neurocomputing.

[12]  Thomas A. Runkler,et al.  The Generalized C Index for Internal Fuzzy Cluster Validity , 2016, IEEE Transactions on Fuzzy Systems.

[13]  Yue Gao,et al.  View-Based Discriminative Probabilistic Modeling for 3D Object Retrieval and Recognition , 2013, IEEE Transactions on Image Processing.

[14]  Luc Van Gool,et al.  Creating Summaries from User Videos , 2014, ECCV.

[15]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[16]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[17]  Shaohui Mei,et al.  Nonlinear kernel sparse dictionary selection for video summarization , 2017, 2017 IEEE International Conference on Multimedia and Expo (ICME).

[18]  Donald C. Wunsch,et al.  A Comparison Study of Validity Indices on Swarm-Intelligence-Based Clustering , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[19]  Meng Wang,et al.  Scalable Semi-Supervised Learning by Efficient Anchor Graph Regularization , 2016, IEEE Transactions on Knowledge and Data Engineering.

[20]  Bingbing Ni,et al.  Image Classification by Selective Regularized Subspace Learning , 2016, IEEE Transactions on Multimedia.

[21]  Meng Wang,et al.  Event Driven Web Video Summarization by Tag Localization and Key-Shot Identification , 2012, IEEE Transactions on Multimedia.

[22]  Dacheng Tao,et al.  Multi-View Object Retrieval via Multi-Scale Topic Models. , 2016, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society.

[23]  Amy V Kapp,et al.  Are clusters found in one dataset present in another dataset? , 2007, Biostatistics.

[24]  Lily Elefteriadou,et al.  Driver types and their behaviors within a high level of pedestrian activity environment , 2017 .

[25]  Tang Xu-qing Comparative Study on Method for Determining Optimal Number of Clusters Based on Affinity Propagation Clustering , 2011 .

[26]  Erzsébet Merényi,et al.  A Validity Index for Prototype-Based Clustering of Data Sets With Complex Cluster Structures , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[27]  Amit K. Roy-Chowdhury,et al.  Weakly Supervised Summarization of Web Videos , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[28]  Xuelong Li,et al.  Event-Based Media Enrichment Using an Adaptive Probabilistic Hypergraph Model , 2015, IEEE Transactions on Cybernetics.

[29]  Yue Wang,et al.  Motion-State-Adaptive Video Summarization via Spatiotemporal Analysis , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[30]  S. Dudoit,et al.  A prediction-based resampling method for estimating the number of clusters in a dataset , 2002, Genome Biology.

[31]  Junsong Yuan,et al.  Video Summarization Via Multiview Representative Selection , 2018, IEEE Transactions on Image Processing.

[32]  Delbert Dueck,et al.  Clustering by Passing Messages Between Data Points , 2007, Science.

[33]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.