An efficient tree-based quantization for content based music retrieval system

In this paper, we have proposed and implemented a new music retrieval system based on content of music wave files. We have investigated different quantization methods by constructing them into the music data histograms as the feature vectors for the music files. There are three important aspects that will affect implementation of the system: audio feature extraction, quantization and distance computation. The proposed new system can allow the users to search the waveform data based on the query music samples. Without converting the audio wave file into a MIDI format, histograms are generated from the query data by vector quantization directly. By using the vector quantization, we can achieve the high accuracy in audio retrieval rate. In the proposed CBMR system, the use of 128 clusters in Kmeans-clustering quantization algorithm can achieve 87% retrieval accuracy and 90% high retrieval accuracy rate with the tree-based quantization.

[1]  Jonathan Foote,et al.  An overview of audio information retrieval , 1999, Multimedia Systems.

[2]  Douglas Keislar,et al.  Content-Based Classification, Search, and Retrieval of Audio , 1996, IEEE Multim..

[3]  Cheng Yang Peer-to-peer architecture for content-based music retrieval on acoustic data , 2003, WWW '03.

[4]  Jonathan Foote,et al.  Content-based retrieval of music and audio , 1997, Other Conferences.

[5]  Harvey F. Silverman,et al.  A model distance measure for talker clustering and identification , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.