Comparison of content selection methods for skimming rushes video

We compare two methods for selecting segments to be included in a video skim, using lists of relevant as well as redundant segments created from different visual features as input. One approach is rule-based, and creates a weighted sum of the input relevances. The other is HMM based, using a model trained on the TRECVID 2007 rushes data. The redundant segments are created from detection of repeated takes and junk content, the selected segments from visual activity and face detection. The results show that the approaches create very short summaries which only contain a part of the relevant information in the video, but reach very high scores in terms of the usability measures non-duplicates, non-junk and pleasant tempo. The HMM based approach contains more information despite shorter duration of the summaries.

[1]  Werner Bailer,et al.  Detecting and Clustering Multiple Takes of One Scene , 2008, MMM.

[2]  Paul Over,et al.  The trecvid 2008 BBC rushes summarization evaluation , 2008, TVS '08.

[3]  Werner Bailer A Comparison of Distance Measures for Clustering Video Sequences , 2008, 2008 19th International Workshop on Database and Expert Systems Applications.

[4]  Dimitrios Gunopulos,et al.  Discovering similar multidimensional trajectories , 2002, Proceedings 18th International Conference on Data Engineering.

[5]  Werner Bailer,et al.  Skimming rushes video using retake detection , 2007, TVS '07.

[6]  Paul A. Viola,et al.  Fast and Robust Classification using Asymmetric AdaBoost and a Detector Cascade , 2001, NIPS.

[7]  Bernard Mérialdo,et al.  A collaborative approach to automatic rushes video summarization , 2008, 2008 15th IEEE International Conference on Image Processing.