Shot Boundary Detection In The Framework of Rough Indexing Paradigm

This paper presents the Shot Boundary Detection system developed by LaBRI in the context of “Rough Indexing” paradigm. We work on compressed streams and we use only I and P frames information, (DC coefficients of I-Frames, motion vectors of PFrames and DC coefficients of prediction error) which allow us to be faster than many equivalent systems (10 times faster than real-time on TRECVID2003 test set, and 3 times faster on 2004, because MPEG files structure is composed of only I and P frames). In this context the application was not developed to classify shot change transition effects, the initial goal was to allow a real-time and intelligent browsing in video content for common users. The detection is performed in two stages: - Robust Global Camera Motion Estimation - Detection of P-Frame peaks (computation of motion and frame statistics), and of I-Frames (measuring similarity on successive compensated I frames). As we work with two types of frames (I and P), we associate two statistical models which give us two sets of ratio and threshold to calibrate the detector. The first TRECVID participation of LaBRI implies an evolution of the application for transitions effects distinction, which induces two new thresholds to calibrate. We generally obtain equivalent values of Recall and Precision (0.72 on TRECVID 2003 test set). On TRECVID 2004 test set we obtain as best runs ri-3: 0.723(Recall) and 0.606(Precision); and ri-4: 0.703(Recall) and 0.635(Precision).

[1]  Keiichiro Hoashi,et al.  Shot Boundary Determination on MPEC Compressed Domain and Story Segmentation Experiments for TRECVID 2003 , 2003, TRECVID.

[2]  Denyse Baillargeon,et al.  Bibliographie , 1929 .

[3]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[4]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[5]  Lie Lu,et al.  MSR-Asia at TREC-11 Video Track , 2002 .

[6]  Rainer Lienhart,et al.  Reliable Transition Detection in Videos: A Survey and Practitioner's Guide , 2001, Int. J. Image Graph..

[7]  Jenny Benois-Pineau,et al.  Extraction of foreground objects from an MPEG2 video stream in rough-indexing framework , 2003, IS&T/SPIE Electronic Imaging.

[8]  Jenny Benois-Pineau,et al.  Scene similarity measure for video content segmentation in the framework of a rough indexing paradigm , 2006, Int. J. Intell. Syst..

[9]  Arding Hsu,et al.  Image processing on compressed data for large video databases , 1993, MULTIMEDIA '93.

[10]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, J. Electronic Imaging.

[11]  Donald A. Adjeroh,et al.  A principled approach to fast partitioning of uncompressed video , 1996, Proceedings of International Workshop on Multimedia Database Management Systems.

[12]  Alan Hanjalic,et al.  Shot-boundary detection: unraveled and resolved? , 2002, IEEE Trans. Circuits Syst. Video Technol..

[13]  Georges Quénot,et al.  CLIPS at TRECVID : Shot Boundary Detection and Feature Detection , 2003, TRECVID.