Intensity-driven dissolve detection adapted to synthetic video contents

Abstract. We approach the problem around several main directions of video temporal segmentation and propose an intensity-based dissolve detection approach that is able to perform on animated video contents. It uses the hypothesis that during a dissolve the amount of fading-out and fading-in pixels should be significant compared with other visual transitions. We use this information as a visual discontinuity function. Instead of just applying a global threshold to filter these values, as most of the existing approaches do, we use a twin-thresholding approach and the shape analysis of the discontinuity measure. This allows us to reduce false detections caused by steep intensity fluctuations as well as to retrieve dissolves caught up in other visual transitions (e.g., caused by movement, color effects, etc.). Experimental tests conducted on more than 452 dissolve transitions show that whether classic approaches tend to fail, the proposed method is able to provide good performance achieving average precision and recall ratios above 94% and 79.6%, respectively.

[1]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Patrick Lambert,et al.  Improved Cut Detection for the Segmentation of Animation Movies , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[3]  Kuo-Chin Fan,et al.  A motion-tolerant dissolve detection algorithm , 2005, IEEE Transactions on Multimedia.

[4]  Jie Zhao,et al.  Effective Dissolve Detection Based on Accumulating Histogram Difference and the Support Point , 2010, 2010 First International Conference on Pervasive Computing, Signal Processing and Applications.

[5]  Gennaro Percannella,et al.  Automated threshold selection for the detection of dissolves in MPEG video , 2000, 2000 IEEE International Conference on Multimedia and Expo. ICME2000. Proceedings. Latest Advances in the Fast Changing World of Multimedia (Cat. No.00TH8532).

[6]  Rainer Lienhart,et al.  Reliable Transition Detection in Videos: A Survey and Practitioner's Guide , 2001, Int. J. Image Graph..

[7]  Ramin Zabih,et al.  A feature-based algorithm for detecting and classifying production effects , 1999, Multimedia Systems.

[8]  Paul Over,et al.  High-level feature detection from video in TRECVid: a 5-year retrospective of achievements , 2009 .

[9]  Ling Guan,et al.  Enhancement of dissolved shot boundary detection with twin-windows amplification method , 2007 .

[10]  Rainer Lienhart,et al.  Comparison of automatic shot boundary detection algorithms , 1998, Electronic Imaging.

[11]  Marcel Worring,et al.  Multimodal Video Indexing : A Review of the State-ofthe-art , 2001 .

[12]  Wen Gao,et al.  Coarse-to-Fine Dissolve Detection Based on Image Quality Assessment , 2013 .

[13]  Shamik Sural,et al.  Detection of hard cuts and gradual transitions from video using fuzzy logic , 2008, Int. J. Artif. Intell. Soft Comput..

[14]  John S. Boreczky,et al.  A hidden Markov model framework for video segmentation using audio and image features , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[15]  Gary Vondran,et al.  GPU color space conversion , 2011, Electronic Imaging.

[16]  Patrick Lambert,et al.  Dissolve detection in abstract video contents , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[17]  Xinbo Gao,et al.  A Unified Framework for Shot Boundary Detection , 2005, CIS.

[18]  Samuel Moon-Ho Song,et al.  Automatic Dissolve Detection Scheme Based on Visual Rhythm Spectrum , 2005, PCM.

[19]  Ba Tu Truong,et al.  New enhancements to cut, fade, and dissolve detection processes in video segmentation , 2000, ACM Multimedia.

[20]  Patrick Lambert,et al.  A color-action perceptual approach to the classification of animated movies , 2011, ICMR '11.

[21]  Atreyi Kankanhalli,et al.  Automatic partitioning of full-motion video , 1993, Multimedia Systems.

[22]  Yang Yang,et al.  A Motion-Insensitive Dissolve Detection Method with SURF , 2009, 2009 Fifth International Conference on Image and Graphics.

[23]  Patrick Lambert,et al.  Tackling action-based video abstraction of animated movies for video browsing , 2010, J. Electronic Imaging.