Biometric identification through speaker verification over telephone lines

In this paper, the identity of a remote user is verified through his voice by means of a simple telephone in order to gain access to a specific system or service. We have used state-of-the-art text-independent speaker modeling algorithms, likelihood normalization in the verification process, and channel normalization techniques. Several experiments are presented showing the negative effects of channel variability and temporal lapse between training and testing recordings, but we will show that using the appropriate parameterization, channel compensation and multisession training will allow us to obtain less than 1% of joint false acceptances and false rejections from single utterances of the speaker passport number.

[1]  Javier Ortega-Garcia,et al.  AHUMADA: a large speech corpus in Spanish for speaker identification and verification , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[2]  Douglas A. Reynolds,et al.  The NIST speaker recognition evaluation - Overview, methodology, systems, results, perspective , 2000, Speech Commun..

[3]  Jean-Claude Junqua,et al.  Robustness in Automatic Speech Recognition , 1996 .

[4]  Douglas A. Reynolds,et al.  Robust text-independent speaker identification using Gaussian mixture speaker models , 1995, IEEE Trans. Speech Audio Process..

[5]  Javier Ortega-Garcia,et al.  Robust speaker recognition through acoustic array processing and spectral normalization , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  Carmen García-Mateo,et al.  On the use of acoustic segmentation in speaker identification , 1997, EUROSPEECH.

[7]  Lawrence G. Bahler,et al.  Speaker verification using randomized phrase prompting , 1991, Digit. Signal Process..