论文信息 - EMIF: Towards a Scalable and Effective Indexing Framework for Large Scale Music Retrieval

EMIF: Towards a Scalable and Effective Indexing Framework for Large Scale Music Retrieval

In this article, we present a novel indexing technique called EMIF (Effective Music Indexing Framework) to facilitate scalable and accurate content based music retrieval. It is designed based on a "classification-and-indexing" principle and consists of two main functionality layers: 1) a novel semantic-sensitive classification to identify input music's category and 2) multiple indexing structures - one local indexing structure corresponds to one semantic category. EMIF's layered architecture not only enables superior search accuracy but also reduces query response time significantly. To evaluate the system, a set of comprehensive experimental studies have been carried out using large test collection and EMIF demonstrates promising performance over state-of-the-art approaches.

[1] Michael I. Jordan. Why the logistic function? A tutorial discussion on probabilities and neural networks , 1995 .

[2] David G. Stork,et al. Pattern Classification , 1973 .

[3] Sharad Mehrotra,et al. The hybrid tree: an index structure for high dimensional feature spaces , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[4] Shigeo Abe DrEng. Pattern Classification , 2001, Springer London.

[5] Anne H. H. Ngu,et al. Towards Effective Content-Based Music Retrieval With Multiple Acoustic Feature Combination , 2006, IEEE Transactions on Multimedia.

[6] François Pachet,et al. Content management for electronic music distribution , 2003, CACM.

[7] George Tzanetakis,et al. Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[8] D. Ruppert. The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[9] Tao Li,et al. A comparative study on content-based music genre classification , 2003, SIGIR.

[10] Christian Böhm,et al. Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases , 2001, CSUR.

[11] Christos Faloutsos,et al. MidiFind: Similarity Search and Popularity Mining in Large MIDI Databases , 2013, CMMR.

[12] Xuelong Li,et al. QUC-Tree: Integrating Query Context Information for Efficient Music Retrieval , 2009, IEEE Transactions on Multimedia.

[13] Matti Karjalainen,et al. A computationally efficient multipitch analysis model , 2000, IEEE Trans. Speech Audio Process..

[14] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[15] Kian-Lee Tan,et al. A novel framework for efficient automated singer identification in large music databases , 2009, TOIS.

[16] Ernesto Damiani,et al. Combining multi-probe histogram and order-statistics based LSH for scalable audio content retrieval , 2010, ACM Multimedia.

[17] Cheng Yang,et al. Efficient acoustic index for music retrieval with various degrees of similarity , 2002, MULTIMEDIA '02.

[18] Christopher M. Bishop,et al. Neural Network for Pattern Recognition , 1995 .