EMIF: Towards a Scalable and Effective Indexing Framework for Large Scale Music Retrieval

In this article, we present a novel indexing technique called EMIF (Effective Music Indexing Framework) to facilitate scalable and accurate content based music retrieval. It is designed based on a "classification-and-indexing" principle and consists of two main functionality layers: 1) a novel semantic-sensitive classification to identify input music's category and 2) multiple indexing structures - one local indexing structure corresponds to one semantic category. EMIF's layered architecture not only enables superior search accuracy but also reduces query response time significantly. To evaluate the system, a set of comprehensive experimental studies have been carried out using large test collection and EMIF demonstrates promising performance over state-of-the-art approaches.

[1]  Michael I. Jordan Why the logistic function? A tutorial discussion on probabilities and neural networks , 1995 .

[2]  David G. Stork,et al.  Pattern Classification , 1973 .

[3]  Sharad Mehrotra,et al.  The hybrid tree: an index structure for high dimensional feature spaces , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[4]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[5]  Anne H. H. Ngu,et al.  Towards Effective Content-Based Music Retrieval With Multiple Acoustic Feature Combination , 2006, IEEE Transactions on Multimedia.

[6]  François Pachet,et al.  Content management for electronic music distribution , 2003, CACM.

[7]  George Tzanetakis,et al.  Musical genre classification of audio signals , 2002, IEEE Trans. Speech Audio Process..

[8]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[9]  Tao Li,et al.  A comparative study on content-based music genre classification , 2003, SIGIR.

[10]  Christian Böhm,et al.  Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases , 2001, CSUR.

[11]  Christos Faloutsos,et al.  MidiFind: Similarity Search and Popularity Mining in Large MIDI Databases , 2013, CMMR.

[12]  Xuelong Li,et al.  QUC-Tree: Integrating Query Context Information for Efficient Music Retrieval , 2009, IEEE Transactions on Multimedia.

[13]  Matti Karjalainen,et al.  A computationally efficient multipitch analysis model , 2000, IEEE Trans. Speech Audio Process..

[14]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[15]  Kian-Lee Tan,et al.  A novel framework for efficient automated singer identification in large music databases , 2009, TOIS.

[16]  Ernesto Damiani,et al.  Combining multi-probe histogram and order-statistics based LSH for scalable audio content retrieval , 2010, ACM Multimedia.

[17]  Cheng Yang,et al.  Efficient acoustic index for music retrieval with various degrees of similarity , 2002, MULTIMEDIA '02.

[18]  Christopher M. Bishop,et al.  Neural Network for Pattern Recognition , 1995 .