论文信息 - Confidence measures in multiple pronunciations modeling for speaker verification

Confidence measures in multiple pronunciations modeling for speaker verification

The paper investigates the use of multiple pronunciations modeling for user-customized password speaker verification (UCP-SV). The main characteristic of UCP-SV is that the system does not have any a priori knowledge about the password used by the speaker. Our aim is to exploit the information about how the speaker pronounces a password in the decision process. This information is extracted automatically using a speaker-independent speech recognizer. We investigate and compare several techniques. Some of them are based on the combination of confidence scores estimated by different models. In this context, we propose a new confidence measure that uses acoustic information extracted during speaker enrollment and based on a log likelihood ratio measure. These techniques show significant improvement (15.7% relative improvement in terms of equal error rate) compared to a UCP-SV baseline system where the speaker is modeled by only one model (corresponding to one utterance).

Hervé Bourlard | Mohamed Faouzi BenZeghiba

[1] Chin-Hui Lee,et al. Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains , 1994, IEEE Trans. Speech Audio Process..

[2] Hervé Bourlard,et al. Connectionist probability estimators in HMM speech recognition , 1994, IEEE Trans. Speech Audio Process..

[3] Gérard Chollet,et al. Swiss French PolyPhone and PolyVar: telephone speech databases to model inter- and intra-speaker variability , 1996 .

[4] Hervé Bourlard,et al. User-Customized Password HMM Based Speaker Verification , 2002 .

[5] Christophe Ris,et al. Use of acoustic prior information for confidence measure in ASR (automatic speech recognition) applications , 2005 .

[6] Biing-Hwang Juang,et al. Automatic verbal information verification for user authentication , 2000, IEEE Trans. Speech Audio Process..

[7] Christophe Ris,et al. Use of acoustic prior information for confidence measure in ASR applications , 2001, INTERSPEECH.