Automatic scene/DVD chapter extraction in Hollywoodian movies

In this paper we tackle the issue of video scene/DVD chapter segmentation. First, we introduce a review and analysis of the most salient methods existent in the technical literature. Then we propose a novel methodological framework for high level video temporal structuring and segmentation that extracts scenes/DVD chapters based on temporal constraints clustering, adaptive temporal lengths, neutralized shots and adaptive thresholding mechanism. The output of our method provides a structured video and facilitates the user access to different parts of the image sequence. In order to validate the proposed technique we have considered as low level visual features the interest points extracted using the SURF descriptor. The experimental evaluation validates the proposed approach returning an average score for the F1 norm of 82%.

[1]  Chong-Wah Ngo,et al.  Motion-Based Video Representation for Scene Change Detection , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[2]  Liu Huayong,et al.  The segmentation of news video into story units , 2005 .

[3]  Boon-Lock Yeo,et al.  Segmentation of Video by Clustering and Graph Analysis , 1998, Comput. Vis. Image Underst..

[4]  Mubarak Shah,et al.  Detection and representation of scenes in videos , 2005, IEEE Transactions on Multimedia.

[5]  Rainer Lienhart,et al.  Scene Determination Based on Video and Audio Features , 2004, Multimedia Tools and Applications.

[6]  Svetha Venkatesh,et al.  Towards automatic extraction of expressive elements from motion pictures: tempo , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[7]  Mubarak Shah,et al.  Visual attention detection in video sequences using spatiotemporal cues , 2006, MM '06.

[8]  Peng Wang,et al.  Scene Segmentation and Categorization Using NCuts , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Teodor Petrescu,et al.  Multiresolution median filtering based video temporal segmentation , 2011, ISSCS 2011 - International Symposium on Signals, Circuits and Systems.

[10]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[11]  Junaid Baber,et al.  Video segmentation into scenes using entropy and SURF , 2011, 2011 7th International Conference on Emerging Technologies.

[12]  John R. Kender,et al.  Video Summaries through Mosaic-Based Shot and Scene Clustering , 2002, ECCV.

[13]  Ba Tu Truong,et al.  Scene extraction in motion pictures , 2003, IEEE Trans. Circuits Syst. Video Technol..

[14]  Wolfgang Effelsberg,et al.  Scene Determination Based on Video and Audio Features , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[15]  Yuncai Liu,et al.  Video scene segmentation and semantic representation using a novel scheme , 2009, Multimedia Tools and Applications.

[16]  Alan Hanjalic,et al.  Shot-boundary detection: unraveled and resolved? , 2002, IEEE Trans. Circuits Syst. Video Technol..

[17]  Luc Van Gool,et al.  SURF: Speeded Up Robust Features , 2006, ECCV.