Effect of GSM speech coding on the performance of Speaker Recognition System

This paper investigates the influence of GSM speech coding on the performance of a text independent Speaker Recognition System (SRS). The SRS developed perform recognition on reconstructed speech waveform from the coded parameters using Gaussian Mixture Models (GMM) technique. The performance evaluation due to the use of the GSM speech coding namely the GSMEFR (Global System Mobile Enhanced Full Rate) codec was conducted, using three transcoded databases, obtained by passing the local ARADIGIT database through the GSM coder/decoder. The recognition evaluation was also conducted using original ARADIGIT sampled at 16 KHz and its 8 KHz downsampled version. The ARADIGIT database consists of 60 speakers (31 male speakers and 29 female speakers) pronouncing the ten Arabic digits five time each. Several experiments were conducted in order to evaluate the degradation introduced by different aspects of the simulated coder.

[1]  Richard J. Mammone,et al.  Speaker recognition - general classifier approaches and data fusion methods , 2002, Pattern Recognit..

[2]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[3]  Douglas A. Reynolds,et al.  An overview of automatic speaker recognition technology , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Mark C. Huggins,et al.  Confidence metrics for speaker identification , 2002, INTERSPEECH.

[5]  Fausto Pellandini,et al.  GSM speech coding and speaker recognition , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[6]  Sadaoki Furui,et al.  Recent advances in speaker recognition , 1997, Pattern Recognit. Lett..

[7]  John H. L. Hansen,et al.  Discrete-Time Processing of Speech Signals , 1993 .

[8]  Til T. Phan,et al.  Text-Independent Speaker Identification , 1999 .

[9]  Alex Waibel,et al.  Robust speaker recognition , 2007 .

[10]  Fausto Pellandini,et al.  Influence of GSM speech coding on the performance of text-independent speaker recognition , 2000, 2000 10th European Signal Processing Conference.