MSRA-MM: Bridging Research and Industrial Societies for Multimedia Information Retrieval

This report introduces a dataset named Microsoft Research Asia Multimedia (MSRA-MM)1 that aims to en- courage research in multimedia information retrieval and the related areas. The images and videos in the dataset are collected from Microsoft Live Search and the performance of state-of-the-art industrial techniques can be evaluated accordingly. Therefore, conducting research using the dataset is able to demonstrate the practical usefulness of the studied algorithms. The data have been comprehensively annotated, such as their relevance, concept, category and quality, and different research scenarios can be implemented on the dataset.

[1]  Diane J. Cook,et al.  Automatic Video Classification: A Survey of the Literature , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[2]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Marcel Worring,et al.  Content-Based Image Retrieval at the End of the Early Years , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  G. Griffin,et al.  Caltech-256 Object Category Dataset , 2007 .

[5]  Xian-Sheng Hua,et al.  Bayesian video search reranking , 2008, ACM Multimedia.

[6]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[7]  Shih-Fu Chang,et al.  Active Context-Based Concept Fusionwith Partial User Labels , 2006, 2006 International Conference on Image Processing.

[8]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[9]  Tao Mei,et al.  Home Video Visual Quality Assessment With Spatiotemporal Factors , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Tao Mei,et al.  Correlative multi-label video annotation , 2007, ACM Multimedia.

[11]  Shumeet Baluja,et al.  Pagerank for product image search , 2008, WWW.