A unified model for techniques on video-shot transition detection

A first step required to allow video indexing and retrieval of visual data is to perform a temporal segmentation, that is, to find the location of camera-shot transitions, which can be either abrupt (i.e., cuts) or gradual (e.g., fades, dissolves, wipes). After a critical review of most approaches seeking to solve this problem, we propose a unified detection model (both for abrupt and all types of gradual transitions), as well as an implementation whose results improve upon those of all the inspected reports. The innovation of the approach presented here is centered on mapping the space of inter-frame distances onto a new space of decision better suited to achieving a sequence-independent thresholding. This mapping aims to consider frame ordering information within the thresholding process; it is based on the parametric modeling of the patterns that transitions generate on the distances' output. As opposed to most reviewed works, our results are detailed over a large and representative sample of more than 1500 cuts and 250 gradual transitions, which make up a significant part (200 min) of the MPEG-7 testing material; this ensures a high degree of confidence in the validity of our approach.

[1]  Hyeokman Kim,et al.  Detection of gradual scene changes for parsing of video data , 1997, Electronic Imaging.

[2]  Ralph M. Ford,et al.  Metrics for shot boundary detection in digital video sequences , 2000, Multimedia Systems.

[3]  Jinho Lee,et al.  An efficient graphical shot verifier incorporating visual rhythm , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[4]  Gozde Bozdagi Akar,et al.  Feature-based hierarchical video segmentation , 1997, Proceedings of International Conference on Image Processing.

[5]  Akio Nagasaka,et al.  Automatic Video Indexing and Full-Video Search for Object Appearances , 1991, VDB.

[6]  Wei Xiong,et al.  Efficient Scene Change Detection and Camera Motion Annotation for Video Classification , 1998, Comput. Vis. Image Underst..

[7]  Howard D. Wactlar,et al.  Automated video indexing of very large video libraries , 1997 .

[8]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying scene breaks , 1995, MULTIMEDIA '95.

[9]  Ramesh C. Jain,et al.  Production model based digital video segmentation , 1995, Multimedia Tools and Applications.

[10]  A. Murat Tekalp,et al.  A high-performance shot boundary detection algorithm using multiple cues , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[11]  Boon-Lock Yeo,et al.  A unified approach to temporal segmentation of motion JPEG and MPEG compressed video , 1995, Proceedings of the International Conference on Multimedia Computing and Systems.

[12]  Edoardo Ardizzone,et al.  Video indexing using optical flow field , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[13]  Ramesh C. Jain,et al.  Dynamic vision , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[14]  Patrick Bouthemy,et al.  A unified approach to shot change detection and camera motion characterization , 1999, IEEE Trans. Circuits Syst. Video Technol..

[15]  Alan Hanjalic,et al.  Optimal shot boundary detection based on robust statistical models , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[16]  Fernando Pereira,et al.  MPEG-7 the generic multimedia content description standard, part 1 - Multimedia, IEEE , 2001 .

[17]  Paul England,et al.  Comparison of automatic video segmentation algorithms , 1996, Other Conferences.

[18]  Arding Hsu,et al.  Image processing on encoded video sequences , 1994, Multimedia Systems.

[19]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[20]  Yoshinobu Tonomura,et al.  Projection Detecting Filter for Video Cut Detection , 1993, ACM Multimedia.

[21]  Yoshinobu Tonomura,et al.  Video tomography: an efficient method for camerawork extraction and motion analysis , 1994, MULTIMEDIA '94.

[22]  José Manuel Menéndez,et al.  Multidimensional comparison of shot detection algorithms , 2002, Proceedings. International Conference on Image Processing.

[23]  Behzad Shahraray,et al.  Scene change detection and content-based sampling of video sequences , 1995, Electronic Imaging.

[24]  Hong Heather Yu,et al.  Multiresolution video segmentation using wavelet transformation , 1997, Electronic Imaging.

[25]  William J. Christmas,et al.  Combining multiple experts for classifying shot changes in video sequences , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[26]  José Manuel Menéndez,et al.  Gradual shot transition detection based on multidimensional clustering , 2000, 4th IEEE Southwest Symposium on Image Analysis and Interpretation.

[27]  Stephen W. Smoliar,et al.  Content-based video browsing tools , 1995, Electronic Imaging.

[28]  Jie Wei,et al.  Illumination-invariant video segmentation by hierarchical robust thresholding , 1997, Electronic Imaging.

[29]  José Manuel Menéndez,et al.  Real time temporal segmentation of MPEG video , 2002, Proceedings. International Conference on Image Processing.

[30]  John S. Boreczky,et al.  Comparison of video shot boundary detection techniques , 1996, Electronic Imaging.

[31]  Chong-Wah Ngo,et al.  Camera break detection by partitioning of 2D spatio-temporal images in MPEG domain , 1999, Proceedings IEEE International Conference on Multimedia Computing and Systems.

[32]  Thomas D. C. Little,et al.  A digital on-demand video service supporting content-based queries , 1993, MULTIMEDIA '93.

[33]  Irena Koprinska,et al.  Temporal video segmentation: A survey , 2001, Signal Process. Image Commun..

[34]  Takafumi Miyatake,et al.  IMPACT: an interactive natural-motion-picture dedicated multimedia authoring system , 1991, CHI.

[35]  José Manuel Menéndez,et al.  A unified approach to gradual shot transition detection , 2000, Proceedings 2000 International Conference on Image Processing (Cat. No.00CH37101).