Temporal recurrence hashing algorithm for mining commercials from multimedia streams

We propose a dual-stage algorithm for fully-unsupervised and super-fast TV commercial mining in this paper. The two stages involved in process include: 1) searching for recurring short segments, and 2) assembling these short segments into sets of long and complete commercial sequences. The first stage is achieved by frame hashing. Different from the related studies that depend on brute-force pairwise matching, we propose applying a second-stage hashing algorithm for the recurring segment assemblage, which is the key idea in this paper. A large-scale archive containing a 10-hour and a 1-month stream was used for the experimentation. The algorithm mined commercials from the 1-month stream in less than 50 minutes, which was ten times faster than that of related studies, with a 98.05% sequence-level and 97.39% frame-level accuracy. We demonstrate the performance consistency of the algorithm on both audio and video streams, and investigate the computational cost from both the theoretical and experimental viewpoints.

[1]  John M. Gauch,et al.  Finding and identifying unknown commercials using repeated video sequence detection , 2006, Comput. Vis. Image Underst..

[2]  Changsheng Xu,et al.  Segmentation, categorization, and identification of commercial clips from TV streams using multimodal analysis , 2006, MM '06.

[3]  Shin'ichi Satoh,et al.  Commercial film detection and identification based on a dual-stage temporal recurrence hashing algorithm , 2010, VLS-MCMR '10.

[4]  Jesse S. Jin,et al.  A Confidence Based Recognition System for TV Commercial Extraction , 2008, ADC.

[5]  Antonio Albiol,et al.  Detection of TV commercials , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Shin'ichi Satoh,et al.  Unified Approach to Detection and Identification of Commercial Films by Temporal Occurrence Pattern , 2010, 2010 20th International Conference on Pattern Recognition.

[7]  Cormac Herley,et al.  ARGOS: automatically extracting repeating objects from multimedia streams , 2006, IEEE Transactions on Multimedia.

[8]  Sid-Ahmed Berrani,et al.  A non-supervised approach for repeated sequence detection in TV broadcast streams , 2008, Signal Process. Image Commun..

[9]  Rainer Lienhart,et al.  Mining TV broadcasts for recurring video sequences , 2009, CIVR '09.

[10]  Ton Kalker,et al.  A Highly Robust Audio Fingerprinting System , 2002, ISMIR.

[11]  Lie Lu,et al.  Robust learning-based TV commercial detection , 2005, 2005 IEEE International Conference on Multimedia and Expo.