论文信息 - Content-based retrieval of MP3 songs based on query by singing

Content-based retrieval of MP3 songs based on query by singing

With the growth of multimedia in the Internet, content analysis of multimedia plays an important role for humanistic management. We investigate the content-based retrieval of MP3 songs based on the interface of query by singing. MDCT (modified DCT) spectral coefficients are directly used to represent the tonic characteristics of a short-term sound. This spectral profile is used for detailed matching between two audio segments. Perceptual features are also computed from MDCT coefficients for audio classification. Two pre-stages based on SVM and k-means classifications are used to remove incorrect (or noisy) segment candidates and to speed up the subsequent matching process. On the other hand, exponential key-scaling schemes and time-warping techniques are developed to overcome key difference and tempo variation between different singers. Experiments show that the retrieval probability of our design can achieve up to 76% among the top 5 out of a total of 114 excerpts in the database.

Wen-Nung Lie | Chen-Kang Su

[1] Guodong Guo,et al. Content-based audio classification and retrieval by support vector machines , 2003, IEEE Trans. Neural Networks.

[2] C.-C. Jay Kuo,et al. Audio content analysis for online audiovisual data segmentation and classification , 2001, IEEE Trans. Speech Audio Process..

[3] Mario Vento,et al. A Neural Multi-expert Classification System for MPEG Audio Segmentation , 2001, ICAPR.

[4] Jyh-Shing Roger Jang,et al. A Query-by-Singing System based on Dynamic Programming , 2000 .

[5] Lie Lu,et al. Digital Object Identifier (DOI) 10.1007/s00530-002-0065-0 Multimedia Systems , 2003 .

[6] Lie Lu,et al. A new approach to query by humming in music retrieval , 2001, IEEE International Conference on Multimedia and Expo, 2001. ICME 2001..

[7] Noel E. O'Connor,et al. Speech-music discrimination from MPEG-1 bitstream , 2001 .

[8] Lie Lu,et al. Music type classification by spectral contrast feature , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.