A similarity measure between videos using alignment, graphical and speech features

A novel video similarity measure is proposed by using visual features, alignment distances and speech transcripts. First, video files are represented by a sequence of segments each of which contains colour histograms, starting time, and a set of phonemes. After, textual, alignment and visual features are extracted of these segments. The following step, bipartite matching and statistical features are applied to find correspondences between segments. Finally, a similarity is calculated between videos. Experiments have been carried out and promising results have been obtained.

[1]  Wei Xiong,et al.  Query by video clip , 1999, Multimedia Systems.

[2]  Avideh Zakhor,et al.  Efficient video similarity measurement with video signature , 2003, IEEE Trans. Circuits Syst. Video Technol..

[3]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[4]  Shih-Fu Chang,et al.  Detecting image near-duplicate by stochastic attributed relational graph matching with learning , 2004, MULTIMEDIA '04.

[5]  Stefan Eickeler,et al.  Content-based video indexing of TV broadcast news using hidden Markov models , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[6]  Hung-Khoon Tan,et al.  Near-Duplicate Keyframe Identification With Interest Point Matching and Pattern Learning , 2007, IEEE Transactions on Multimedia.

[7]  Chong-Wah Ngo,et al.  Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation , 2006, MM '06.

[8]  Jonathan Foote,et al.  An overview of audio information retrieval , 1999, Multimedia Systems.

[9]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[10]  Chong-Wah Ngo,et al.  Clip-based similarity measure for hierarchical video retrieval , 2004, MIR '04.

[11]  Marcel Worring,et al.  Optimization of interactive visual-similarity-based search , 2008, TOMCCAP.

[12]  Eli Shechtman,et al.  Space-time behavior based correlation , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[13]  Hsin-Min Wang,et al.  Fast min-hashing indexing and robust spatio-temporal matching for detecting video copies , 2010, TOMCCAP.

[14]  Ruud M. Bolle,et al.  Comparison of sequence matching techniques for video copy detection , 2001, IS&T/SPIE Electronic Imaging.