Speaker Verification using 3-D ROC Curves for Increasing Imposter Rejections

A speaker verification system (SVS) is developed based on a bank of radial basis function (RBF) neural modules (BRBFNM). The output thresholds of the RBF networks are set using 3-dimensional receiver operating characteristic (ROC) curves. Moreover, a customized cepstral-based feature extraction pre-processing approach is used at each module to select only those features of the speakers that will contribute to enhancing the SVS's performance. These feature vectors are used to train and test the BRBFNM. Each of four speakers was asked to speak 10 different key words 12 times. From the 479 signals (one speaker missed one word), 279 signals were used for training and 200 for testing. The BRBFNM can achieve a 90.5% correct verification rate (CVR), thus reducing false acceptances (i.e., increasing imposter rejections).

[1]  Hugo Leonardo Rufiner,et al.  Automatic speaker identification by means of Mel cepstrum, wavelets and wavelet packets , 2000, Proceedings of the 22nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (Cat. No.00CH37143).

[2]  Sun-Yuan Kung,et al.  Biometric Authentication: A Machine Learning Approach , 2004 .

[3]  Say Wei Foo,et al.  Speaker recognition using adaptively boosted classifier , 2001, Proceedings of IEEE Region 10 International Conference on Electrical and Electronic Technology. TENCON 2001 (Cat. No.01CH37239).

[4]  R. P. Ramachandran,et al.  Robust speaker recognition: a feature-based approach , 1996, IEEE Signal Processing Magazine.

[5]  Juan Carlos,et al.  Review of "Discrete-Time Speech Signal Processing - Principles and Practice", by Thomas Quatieri, Prentice-Hall, 2001 , 2003 .

[6]  Geoffrey E. Hinton,et al.  Adaptive Mixtures of Local Experts , 1991, Neural Computation.

[7]  Robert N. McDonough,et al.  Detection of signals in noise , 1971 .

[8]  Young-Chan Lee,et al.  Classification of infrasound events using radial basis function neural networks , 2005, Proceedings. 2005 IEEE International Joint Conference on Neural Networks, 2005..

[9]  Ivica Kostanic,et al.  Principles of Neurocomputing for Science and Engineering , 2000 .

[10]  C. Goose,et al.  Glossary of Terms , 2004, Machine Learning.

[11]  Nils J. Nilsson,et al.  Learning Machines: Foundations of Trainable Pattern-Classifying Systems , 1965 .

[12]  Thomas Quatieri,et al.  Discrete-Time Speech Signal Processing: Principles and Practice , 2001 .

[13]  M. Inal,et al.  Self organizing map and associative memory model hybrid classifier for speaker recognition , 2002, 6th Seminar on Neural Network Applications in Electrical Engineering.