Commercial mining basedon temporal recurrence hashing algorithm and bag-of-fingerprints model

We propose two novel algorithms for fully-unsupervised, super-fast, and cross-channel TV commercial mining in this paper. The tasks involved in the process include: 1) mining commercial clusters from streams of individual channels, and 2) grouping identical commercial clusters across multiple channels. The first process is achieved with a dual-stage hashing algorithm, which searches for recurring short segments by hashing frames, and it assembles these short segments into sets of commercial sequences by hashing temporal recurrences. The algorithm mined commercials from a one-month stream in less than 42 minutes, which was ten times faster than that in related studies. A new bag-of-fingerprints model is proposed for the second process to encode the temporal clues of local fingerprints. The model is abundantly robust against framing and fingerprinting errors in recurring sequences, and discovers false matches of local fingerprints. A five-month database was used for comprehensively demonstrating the effectiveness and efficiency of the model.

[1]  Lie Lu,et al.  Robust learning-based TV commercial detection , 2005, 2005 IEEE International Conference on Multimedia and Expo.

[2]  Sid-Ahmed Berrani,et al.  A non-supervised approach for repeated sequence detection in TV broadcast streams , 2008, Signal Process. Image Commun..

[3]  John M. Gauch,et al.  Finding and identifying unknown commercials using repeated video sequence detection , 2006, Comput. Vis. Image Underst..

[4]  Changsheng Xu,et al.  Segmentation, categorization, and identification of commercial clips from TV streams using multimodal analysis , 2006, MM '06.

[5]  Michael J. Witbrock,et al.  Story segmentation and detection of commercials in broadcast news video , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[6]  Jesse S. Jin,et al.  A Confidence Based Recognition System for TV Commercial Extraction , 2008, ADC.

[7]  Cormac Herley,et al.  ARGOS: automatically extracting repeating objects from multimedia streams , 2006, IEEE Transactions on Multimedia.

[8]  Rainer Lienhart,et al.  Mining TV broadcasts for recurring video sequences , 2009, CIVR '09.

[9]  Wolfgang Effelsberg,et al.  On the detection and recognition of television commercials , 1997, Proceedings of IEEE International Conference on Multimedia Computing and Systems.

[10]  Shin'ichi Satoh,et al.  Unified Approach to Detection and Identification of Commercial Films by Temporal Occurrence Pattern , 2010, 2010 20th International Conference on Pattern Recognition.

[11]  Ton Kalker,et al.  A Highly Robust Audio Fingerprinting System , 2002, ISMIR.

[12]  Shin'ichi Satoh,et al.  Commercial film detection and identification based on a dual-stage temporal recurrence hashing algorithm , 2010, VLS-MCMR '10.

[13]  Antonio Albiol,et al.  Detection of TV commercials , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.