Video Content Detection with Single Frame Level Accuracy Using Dynamic Thresholding Technique

This paper proposes a video retrieval method that detects frame sections that correspond to shots in a query (video segment) with single frame level accuracy. The method adopts the coarse-to-fine strategy to decrease the processing time and the memory consumption, dynamic threshold with initial ranges for small segments is proposed to detect the exact beginning and end of each corresponding frame section to each shot in a query. Experiments on real videos show that our method can achieve accurate video detection with exact frame position while reducing processing time and memory consumption.

[1]  Kota Iwamoto,et al.  Visual duplicate based topic linking using a robust video signature , 2013, 2013 IEEE International Conference on Consumer Electronics (ICCE).

[2]  Li Li,et al.  A Survey on Visual Content-Based Video Indexing and Retrieval , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[3]  Nobuyuki Yagi,et al.  Shot Boundary Detection at TRECVID 2007 , 2007, TRECVID.

[4]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[5]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[6]  Vasumathi Narayanan,et al.  A Survey of Content-Based Video Retrieval , 2008 .

[7]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[8]  Chong-Wah Ngo,et al.  A robust dissolve detector by support vector machine , 2003, ACM Multimedia.