论文信息 - Revisiting Doddington"s Zoo: A Systematic Method to Assess User-dependent Variabilities

Revisiting Doddington"s Zoo: A Systematic Method to Assess User-dependent Variabilities

A systematic analysis of user-dependent performance variability in the context of automatic speaker verification was first studied by Doddington \etal (1998). Different categories of users were distinguished and were called by animal names such as sheep, goats, lambs and wolves. Although such distinctions are important, it does not directly discriminate ``well-behaved'' users from ``badly behaved'' users. In our context, the badly behaved users are those who will bring the performance down when added to the system. We then extend such a study to formulate a user-specific score normalization (called F-norm's variant) and show that the user-dependent variability can be reduced to obtain an enhanced performance. By introducing some constraints, the proposed framework can also provide a stable user-dependent performance in terms of DET despite the fact that few (genuine) samples are available. In the context of multimodal biometrics, we show that it is possible to decide whether or not fusing the output of several systems is better than selecting any one of them, on a per user basis. This strategy is called an ``OR-switcher''. Based on 15 multimodal fusion experiments, the performance of OR-switcher is significantly better than the state-of-the-art score-level fusion algorithms.

[1] Samy Bengio,et al. F-ratio client dependent normalisation for biometric authentication tasks , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[2] John Daugman,et al. High Confidence Visual Recognition of Persons by a Test of Statistical Independence , 1993, IEEE Trans. Pattern Anal. Mach. Intell..

[3] Samy Bengio,et al. Why do multi-stream, multi-band and multi-modal approaches work on biometric user authentication tasks? , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4] Heekuck Oh,et al. Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[5] Samy Bengio,et al. Database, protocols and tools for evaluating score-level fusion algorithms in biometric authentication , 2006, Pattern Recognit..

[6] Samy Bengio,et al. A unified framework for score normalization techniques applied to text-independent speaker verification , 2005, IEEE Signal Processing Letters.

[7] S. Buxbaum. Sheep , 2004 .

[8] Samy Bengio,et al. A Bayesian Framework for Score Normalization Techniques Applied to Text Independent Speaker Verification , 2004 .

[9] S. Furui,et al. Cepstral analysis technique for automatic speaker verification , 1981 .

[10] Douglas A. Reynolds,et al. SHEEP, GOATS, LAMBS and WOLVES A Statistical Analysis of Speaker Performance in the NIST 1998 Speaker Recognition Evaluation , 1998 .

[11] Eric R. Ziegel,et al. The Elements of Statistical Learning , 2003, Technometrics.