Pitch Oriented Automatic Singer Identification in Pop Music

In this paper, we proposed two novel methods used to distinguish the singer of a pop music. We focused on a single singer and single track case. These two methods are “Pitch Extraction” method and “1/12 OFCC” method. The Pitch Extraction method is composed of three stages and they are Singing pitch estimation stage, Exact pitch calculation stage and GMM classification stage. “1/12 OFCC” method is composed of “Pitch Feature Calculation” and GMM classification. We also compare these two methods with OFCC [1] based method. With “Pitch Extraction” and “1/12 OFCC” method, we have some improvement on works of singer identification using single feature.

[1]  Ye Wang,et al.  Automatic Detection Of Vocal Segments In Popular Songs , 2004, ISMIR.

[2]  Ye Wang,et al.  Singing voice detection for karaoke application , 2005, Visual Communications and Image Processing.

[3]  George Martin The Opera Companion , 1961 .

[4]  A. Chanrungutai,et al.  Singing Voice Separation in Mono-Channel Music , 2008, 2008 International Symposium on Communications and Information Technologies.

[5]  Wei-Ho Tsai,et al.  Automatic Identification of Simultaneous Singers in Duet Recordings , 2008, ISMIR.

[6]  Hsin-Min Wang,et al.  Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[7]  Haizhou Li,et al.  Singing voice detection in pop songs using co-training algorithm , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[8]  DeLiang Wang,et al.  Separation of Singing Voice From Music Accompaniment for Monaural Recordings , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[9]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[10]  Hsin-Min Wang,et al.  On the extraction of vocal-related information to facilitate the management of popular music collections , 2005, Proceedings of the 5th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL '05).

[11]  Alex Acero,et al.  Spoken Language Processing: A Guide to Theory, Algorithm and System Development , 2001 .

[12]  T. Subba Rao,et al.  Classification, Parameter Estimation and State Estimation: An Engineering Approach Using MATLAB , 2004 .

[13]  Gert R. G. Lanckriet,et al.  Towards musical query-by-semantic-description using the CAL500 data set , 2007, SIGIR.

[14]  Haizhou Li,et al.  Exploring Vibrato-Motivated Acoustic Features for Singer Identification , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[15]  Kian-Lee Tan,et al.  Towards efficient automated singer identification in large music databases , 2006, SIGIR.