Studying the impact of sequence clustering on near-duplicate video retrieval: an experimental comparison

In this paper, we propose studying the impact of clustering on near-duplicate video (NDV) retrieval. The aim is to reduce the search space at retrieval time through a pre-processing clustering step performed on the dataset off-line and retrieving NDVs based on the formed clusters. Our contribution is a novel clustering framework inspired by a bioinformatics technique, namely DNA multiple sequence alignment (MSA). A series of video keyframes in chronological order is represented as an alphabetical genome, analogous to a DNA sequence and MSA is employed to automatically partition the NDVs in a video collection into clusters. After discussing the advantages and shortcomings of the main state-of-the-art clustering approaches for video clustering in the theoretical part of the paper, we empirically evaluate the performance of the proposed MSA-based framework against five clustering algorithms representative of these mainstream approaches: Birch, Cure, Dbscan, Expectation-Maximization and Proclus. Also, we show that our clustering-based approach, while being significantly faster than non-clustering-based n-gram and edit distance NDV retrieval techniques, yields better mean average precision retrieval accuracy.

[1]  Dimitrios Gunopulos,et al.  Automatic subspace clustering of high dimensional data for data mining applications , 1998, SIGMOD '98.

[2]  Hung-Khoon Tan,et al.  Real-Time Near-Duplicate Elimination for Web Video Search With Content and Context , 2009, IEEE Transactions on Multimedia.

[3]  Yan Ke,et al.  An efficient parts-based near-duplicate and sub-image retrieval system , 2004, MULTIMEDIA '04.

[4]  Aggelos K. Katsaggelos,et al.  A Kd-Tree Based Dynamic Indexing Scheme for Video Retrieval and Geometry Matching , 2008, 2008 Proceedings of 17th International Conference on Computer Communications and Networks.

[5]  Bashar Tahayna,et al.  Near-duplicate video detection featuring coupled temporal and perceptual visual structures and logical inference based matching , 2012, Inf. Process. Manag..

[6]  Shuicheng Yan,et al.  Near-duplicate keyframe retrieval by nonrigid image matching , 2008, ACM Multimedia.

[7]  Xian-Sheng Hua,et al.  Robust video signature based on ordinal measure , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[8]  Xiao-Ping Zhang,et al.  Automatic identification of digital video based on shot-level sequence matching , 2005, MULTIMEDIA '05.

[9]  Fei Wang,et al.  Real-time large scale near-duplicate web video retrieval , 2010, ACM Multimedia.

[10]  R. Doolittle,et al.  Progressive sequence alignment as a prerequisitetto correct phylogenetic trees , 2007, Journal of Molecular Evolution.

[11]  Fei Wang,et al.  Million-scale near-duplicate video retrieval system , 2011, ACM Multimedia.

[12]  Qi Tian,et al.  Fast and robust search method for short video clips from large video collection , 2004, ICPR 2004.

[13]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[14]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[15]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[16]  Shu-Ching Chen,et al.  GeM-Tree: Towards a Generalized Multidimensional Index Structure Supporting Image and Video Retrieval , 2008, 2008 Tenth IEEE International Symposium on Multimedia.

[17]  Lei Chen,et al.  Monitoring near duplicates over video streams , 2010, ACM Multimedia.

[18]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[19]  Philip S. Yu,et al.  Fast algorithms for projected clustering , 1999, SIGMOD '99.

[20]  Guojun Lu,et al.  Techniques and data structures for efficient multimedia retrieval based on similarity , 2002, IEEE Trans. Multim..

[21]  Hung-Khoon Tan,et al.  Scalable detection of partial near-duplicate videos by visual-temporal consistency , 2009, ACM Multimedia.

[22]  Qi Tian,et al.  Fast and robust video clip search using index structure , 2004, MULTIMEDIA '04.

[23]  Philip S. Yu,et al.  Finding generalized projected clusters in high dimensional spaces , 2000, SIGMOD '00.

[24]  Sudipto Guha,et al.  CURE: an efficient clustering algorithm for large databases , 1998, SIGMOD '98.

[25]  Zi Huang,et al.  UQLIPS: A Real-time Near-duplicate Video Clip Detection System , 2007, VLDB.

[26]  Liang-Tien Chia,et al.  Stratification-based keyframe cliques for removal of near-duplicates in video search results , 2010, MIR '10.

[27]  Xian-Sheng Hua,et al.  Bayesian video search reranking , 2008, ACM Multimedia.

[28]  Tao Mei,et al.  Scalable clip-based near-duplicate video detection with ordinal measure , 2010, CIVR '10.

[29]  Yan Ke,et al.  Efficient Near-duplicate Detection and Sub-image Retrieval , 2004 .

[30]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[31]  Zi Huang,et al.  Multiple feature hashing for real-time large scale near-duplicate video retrieval , 2011, ACM Multimedia.

[32]  D. Higgins,et al.  T-Coffee: A novel method for fast and accurate multiple sequence alignment. , 2000, Journal of molecular biology.

[33]  G. McLachlan,et al.  The EM algorithm and extensions , 1996 .