Probabilistic indexing of media sequences

Accurate and fast nearest neighbor search is often required in applications involving media sequences, such as duplicate detection in video collections, music retrieval in digital libraries, and event discovery in streaming documents. Among various related techniques, developing indexing scheme is probably most challenging because of its complexity. This paper documents a novel scheme called HMMH (Hidden Markov Model based Hashing) to facilitate scalable and efficient media sequence retrieval based on advanced hashing algorithm. Main conjecture of our approach is that media sequence's content is complex and the associated dynamic characteristics cannot be ignored. As such, we propose to use hidden Markov model (HMM) for comprehensive media sequence modeling and calculate HMM supervector to represent segments of media sequence. With the novel scheme, more discriminative information about temporal structure can be captured. In addition, the difference of two media sequences is approximated by the Euclidean distance between the associated HMM supervectors. The statistical property enables the proposed HMMH to enjoy good system flexibility - various hashing algorithms (e.g., LSH and SPH) can be applied on HMM supervectors for effective binary code calculation. Our experimental results using both large scale video and music collections demonstrate that the proposed scheme various kinds of advantages over existing techniques.

[1]  Jun Wang,et al.  Self-taught hashing for fast similarity search , 2010, SIGIR.

[2]  Meng Wang,et al.  Unified Video Annotation via Multigraph Learning , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Bingjun Zhang,et al.  CompositeMap: a novel framework for music similarity measure , 2009, SIGIR.

[4]  Piotr Indyk,et al.  Similarity Search in High Dimensions via Hashing , 1999, VLDB.

[5]  Sharad Mehrotra,et al.  The hybrid tree: an index structure for high dimensional feature spaces , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[6]  Pavel Zezula,et al.  M-tree: An Efficient Access Method for Similarity Search in Metric Spaces , 1997, VLDB.

[7]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[8]  Radford M. Neal Pattern Recognition and Machine Learning , 2007, Technometrics.

[9]  Ramesh C. Jain,et al.  Similarity indexing with the SS-tree , 1996, Proceedings of the Twelfth International Conference on Data Engineering.

[10]  Christian Böhm,et al.  Independent quantization: an index compression technique for high-dimensional data spaces , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[11]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[12]  Meng Wang,et al.  Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation , 2009, IEEE Transactions on Multimedia.

[13]  Antonio Torralba,et al.  Spectral Hashing , 2008, NIPS.

[14]  Fernando Diaz,et al.  Temporal profiles of queries , 2007, TOIS.