Automatic detection of repetitive actions in a video

In this paper we propose a method to locate inloop repetitions in a video. An in-loop repetition consists in repeating the same action(s) many times consecutively. The proposed method adapts and uses the auto-correlation method YIN, originally proposed to find the fundamental frequency in audio signals. Based on this technique, we propose a method that generates a matrix where repetitions correspond to triangle-shaped zones of low values in this matrix (we called YIN-Matrix). Locating these triangles leads to locate video segments that enclose a repetition as well as to extract their parameters. In order to evaluate our method, we used a standard evaluation method that shows the error rates compared to ground-truth information. According to this evaluation method, our method shows promising results that nominate it to form a solid base for future works.

[1]  John M. Gauch,et al.  Identification of new commercials using repeated video sequence detection , 2005, IEEE International Conference on Image Processing 2005.

[2]  Ning Hu,et al.  Pattern Discovery Techniques for Music Audio , 2002, ISMIR.

[3]  Derek Hoiem,et al.  Computer vision for music identification , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[4]  Li Chen,et al.  Video copy detection: a comparative study , 2007, CIVR '07.

[5]  Hideki Kawahara,et al.  YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[6]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[7]  Shumeet Baluja,et al.  Advertisement Detection and Replacement using Acoustic and Visual Repetition , 2006, 2006 IEEE Workshop on Multimedia Signal Processing.

[8]  Xu Xiao Recognition of human actions , 2010 .

[9]  Z. Meral Özsoyoglu,et al.  Indexing large metric spaces for similarity search queries , 1999, TODS.

[10]  Patrick Pérez,et al.  Periodic motion detection and segmentation via approximate sequence alignment , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[11]  Pinar Duygulu Sahin,et al.  Comparison and combination of two novel commercial detection methods , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[12]  Jonathan Foote,et al.  Visualizing Musical Structure and Rhythm via Self-Similarity , 2001, ICMC.