Automatic Vocal Segments Detection in Popular Music

We propose a technique for the automatic vocal segments detection in an acoustical polyphonic music signal. We use a combination of several characteristics specific to singing voice as the feature and employ a Gaussian Mixture Model (GMM) classifier for vocal and non-vocal classification. We have employed a pre-processing of spectral whitening and archived a performance of 81.3% over the RWC popular music dataset.

[1]  Daniel P. W. Ellis,et al.  USING VOICE SEGMENTS TO IMPROVE ARTIST CLASSIFICATION OF MUSIC , 2002 .

[2]  Hsin-Min Wang,et al.  Blind Clustering of Popular Music Recordings Based on Singer Voice Characteristics , 2004, Computer Music Journal.

[3]  Masataka Goto,et al.  Development of the RWC Music Database , 2004 .

[4]  Joseph Picone,et al.  Signal modeling techniques in speech recognition , 1993, Proc. IEEE.

[5]  Changsheng Xu,et al.  Singing voice detection using twice-iterated composite Fourier transform , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[6]  Anssi Klapuri,et al.  Multiple Fundamental Frequency Estimation by Summing Harmonic Amplitudes , 2006, ISMIR.

[7]  Liang Gu,et al.  Robust singing detection in speech/music discriminator design , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[8]  Hsin-Min Wang,et al.  Automatic singer identification of popular music recordings via estimation and modeling of solo vocal signal , 2003, INTERSPEECH.

[9]  Youngmoo E. Kim,et al.  Singer Identification in Popular Music Recordings Using Voice Coding Features , 2002 .

[10]  Christian Dittmar,et al.  EFFECTIVE SINGING VOICE DETECTION IN POPULAR MUSIC USING ARMA FILTERING , 2007 .

[11]  T. Zhang System and Method for Automatic Singer Identification , 2003 .

[12]  Ye Wang,et al.  Singing voice detection in popular music , 2004, MULTIMEDIA '04.