On the use of score pruning in speaker verification for speaker dependent threshold estimation

The use of a priori speaker-dependent thresholds has been shown convenient for speaker verification. However, their estimation is highly affected by the difficulty of obtaining data from impostors, the mismatched conditions, the scarcity of data in real applications and the need of setting the threshold a priori, during enrollment. In this context, possible outliers, i.e., those client scores which are distant with respect to mean in terms of Log-Likelihood Ratio (LLR), could lead to wrong estimations of client mean and variance. To overcome this problem, we propose here several methods based on pruning LLR scores with different statistical criteria. Before estimating the threshold, score pruning removes outliers and improves subsequent estimations. To solve the problem of impostor data, we also suggest a speaker dependent threshold estimation with only data from clients. Text-dependent and textindependent experiments have been carried out by using a telephonic multisession database in Spanish with 184 speakers, that has been recorded by the authors.

[1]  Frédéric Bimbot,et al.  Techniques for a priori decision threshold estimation in speaker verification , 1998 .

[2]  Ke Chen,et al.  Towards better making a decision in speaker verification , 2003, Pattern Recognit..

[3]  Larry P. Heck,et al.  An adaptive speaker verification system with speaker dependent a priori decision thresholds , 2002, INTERSPEECH.

[4]  Jean-François Bonastre,et al.  Time and frequency pruning for speaker identification , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[5]  Jean-François Bonastre,et al.  Frame pruning for speaker recognition , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[6]  Jean-François Bonastre,et al.  Frame pruning for automatic speaker identification , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[7]  Chin-Hui Lee,et al.  A priori threshold selection for fixed vocabulary speaker verification systems , 2000, INTERSPEECH.

[8]  S. Furui,et al.  Cepstral analysis technique for automatic speaker verification , 1981 .

[9]  Dominique Genoud,et al.  A comparison of a priori threshold setting procedures for speaker verification in the CAVE project , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[10]  Man-Wai Mak,et al.  A priori threshold determination for phrase-prompted speaker verification , 1999, EUROSPEECH.

[11]  Dominique Genoud,et al.  Likelihood ratio adjustment for the compensation of model mismatch in speaker verification , 1997, EUROSPEECH.

[12]  Javier Hernando,et al.  Automatic Estimation of a Priori Speaker Dependent Thresholds in Speaker Verification , 2003, AVBPA.

[13]  Biing-Hwang Juang,et al.  Verbal information verification , 1997, EUROSPEECH.

[14]  Douglas A. Reynolds,et al.  Comparison of background normalization methods for text-independent speaker verification , 1997, EUROSPEECH.