Multi-View Video Summarization

Previous video summarization studies focused on monocular videos, and the results would not be good if they were applied to multi-view videos directly, due to problems such as the redundancy in multiple views. In this paper, we present a method for summarizing multi-view videos. We construct a spatio-temporal shot graph and formulate the summarization problem as a graph labeling task. The spatio-temporal shot graph is derived from a hypergraph, which encodes the correlations with different attributes among multi-view video shots in hyperedges. We then partition the shot graph and identify clusters of event-centered shots with similar contents via random walks. The summarization result is generated through solving a multi-objective optimization problem based on shot importance evaluated using a Gaussian entropy fusion scheme. Different summarization objectives, such as minimum summary length and maximum information coverage, can be accomplished in the framework. Moreover, multi-level summarization can be achieved easily by configuring the optimization parameters. We also propose the multi-view storyboard and event board for presenting multi-view summaries. The storyboard naturally reflects correlations among multi-view summarized shots that describe the same important event. The event-board serially assembles event-centered multi-view shots in temporal order. Single video summary which facilitates quick browsing of the summarized multi-view video can be easily generated based on the event board representation.

[1]  Yung-Yu Chuang,et al.  NTU TRECVID-2007 fast rushes summarization system , 2007, TVS '07.

[2]  竹安 数博,et al.  Time series analysis and its applications , 2007 .

[3]  William H. Press,et al.  Numerical Recipes 3rd Edition: The Art of Scientific Computing , 2007 .

[4]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[5]  Yuxin Peng,et al.  Clip-based similarity measure for query-dependent clip retrieval and video summarization , 2006, IEEE Trans. Circuits Syst. Video Technol..

[6]  Quanquan Gu,et al.  Learning the Shared Subspace for Multi-task Clustering and Transductive Transfer Classification , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[7]  Guizhong Liu,et al.  A Multiple Visual Models Based Perceptive Analysis Framework for Multilevel Video Summarization , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Benoit Huet,et al.  Video shots key-frames indexing and retrieval through pattern analysis and fusion techniques , 2007, 2007 10th International Conference on Information Fusion.

[9]  Michael R. Lyu,et al.  Video summarization by video structure analysis and graph optimization , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[10]  Alan Hanjalic,et al.  Automated high-level movie segmentation for advanced video-retrieval systems , 1999, IEEE Trans. Circuits Syst. Video Technol..

[11]  Xavier Binefa,et al.  An EM algorithm for video summarization, generative model approach , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[12]  Noboru Babaguchi,et al.  Towards abstracting sports video by highlights , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[13]  Stephen R. Gulliver,et al.  Introduction to special issue on eye-tracking applications in multimedia systems , 2007, TOMCCAP.

[14]  Bernard Mérialdo,et al.  A collaborative approach to automatic rushes video summarization , 2008, 2008 15th IEEE International Conference on Image Processing.

[15]  Ziya Telatar,et al.  Graph-based multilevel temporal video segmentation , 2008, Multimedia Systems.

[16]  Tao Mei,et al.  Video collage , 2007, ACM Multimedia.

[17]  HongJiang Zhang,et al.  Contrast-based image attention analysis by using fuzzy growing , 2003, MULTIMEDIA '03.

[18]  Qingming Huang,et al.  Highlight Summarization in Sports Video Based on Replay Detection , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[19]  Jeho Nam,et al.  Dynamic video summarization and visualization , 1999, MULTIMEDIA '99.

[20]  Regunathan Radhakrishnan,et al.  A time series clustering based framework for multimedia mining and summarization using audio features , 2004, MIR '04.

[21]  Chong-Wah Ngo,et al.  Video event detection using motion relativity and visual relatedness , 2008, ACM Multimedia.

[22]  Alan Hanjalic,et al.  An integrated scheme for automated video abstraction based on unsupervised cluster-validity analysis , 1999, IEEE Trans. Circuits Syst. Video Technol..

[23]  Jung-Hwan Oh,et al.  Scenario based dynamic video abstractions using graph matching , 2005, MULTIMEDIA '05.

[24]  Wei-Ying Ma,et al.  Video summarization based on user log enhanced link analysis , 2003, ACM Multimedia.

[25]  Mohan S. Kankanhalli,et al.  Automatic summarization of music videos , 2006, TOMCCAP.

[26]  Changsheng Xu,et al.  Automatic music classification and summarization , 2005, IEEE Transactions on Speech and Audio Processing.

[27]  Tao Mei,et al.  Video collage: presenting a video sequence using a single image , 2008, The Visual Computer.

[28]  Benoit Huet,et al.  Automatic video summarization , 2006 .

[29]  Shaogang Gong,et al.  Video behaviour profiling and abnormality detection without manual labelling , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[30]  Chong-Wah Ngo,et al.  Video summarization and scene detection by graph modeling , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[31]  Bernhard Schölkopf,et al.  Learning with Hypergraphs: Clustering, Classification, and Embedding , 2006, NIPS.

[32]  Qingming Huang,et al.  Summarization in Soccer Video based on Goalmouth Detection , 2006 .

[33]  Michael A. Smith,et al.  Video skimming and characterization through the combination of image and language understanding techniques , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[34]  Xin Liu,et al.  Summarizing video by minimizing visual content redundancies , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[35]  Jiang Li,et al.  A real-time interactive multi-view video system , 2005, MULTIMEDIA '05.

[36]  Chong-Wah Ngo,et al.  Video partitioning by temporal slice coherency , 2001, IEEE Trans. Circuits Syst. Video Technol..

[37]  Aggelos K. Katsaggelos,et al.  MINMAX optimal video summarization , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[38]  Xin Liu,et al.  Video summarization and retrieval using singular value decomposition , 2003, Multimedia Systems.

[39]  Mohan S. Kankanhalli,et al.  Automatic music video summarization based on audio-visual-text analysis and alignment , 2005, SIGIR '05.

[40]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[41]  Shaogang Gong,et al.  Activity Based Video Content Trajectory Representation and Segmentation , 2004, BMVC.

[42]  David S. Doermann,et al.  Video summarization by curve simplification , 1998, MULTIMEDIA '98.

[43]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[44]  Leo Grady,et al.  Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Jieping Ye,et al.  Hypergraph spectral learning for multi-label classification , 2008, KDD.

[46]  Lie Lu,et al.  A generic framework of user attention model and its application in video summarization , 2005, IEEE Trans. Multim..

[47]  Wei-Hao Lin,et al.  Exploring the utility of fast-forward surrogates for bbc rushes , 2008, TVS '08.

[48]  Alberto Del Bimbo,et al.  Semantic annotation of soccer videos: automatic highlights identification , 2003, Comput. Vis. Image Underst..

[49]  Shih-Fu Chang,et al.  A utility framework for the automatic generation of audio-visual skims , 2002, MULTIMEDIA '02.

[50]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[51]  Alan Hanjalic,et al.  A New Method for Key Frame Based Video Content Representation , 1998, Image Databases and Multi-Media Search.

[52]  Ba Tu Truong,et al.  Video abstraction: A systematic review and classification , 2007, TOMCCAP.