A speaker model in speaker recognition system is to be trained from a large data set gathered in multiple sessions. Large data set requires large amount of memory and computation, and moreover it's practically hard to make users utter the data in several sessions. Recently the incremental adaptation methods are proposed to cover the problems. However, the data set gathered from multiple session is vulnerable to the outliers from the irregular utterance variations and the presence of noise, which result in inaccurate speaker model. In this paper, we propose an incremental robust adaptation method to minimize the influence of outliers on Gaussian Mixture Model based speaker model. The robust adaptation is abtained from an incremental verstion of Mestimation. Speaker model is initially trained from small amount of data and it is adapted recursively with the data available in each session. Experimental results from the data set gathered over seven months show that the proposed method is robust against outliers.
[1]
Hanseok Ko,et al.
Speaker adaptations in sparse training data for improved speaker verification
,
2000
.
[2]
Douglas A. Reynolds,et al.
Robust text-independent speaker identification using Gaussian mixture speaker models
,
1995,
IEEE Trans. Speech Audio Process..
[3]
S. Furui,et al.
Cepstral analysis technique for automatic speaker verification
,
1981
.
[4]
Chin-Hui Lee,et al.
Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
,
1994,
IEEE Trans. Speech Audio Process..
[5]
Chafic Mokbel,et al.
Behavior of a Bayesian adaptation method for incremental enrollment in speaker verification
,
2000,
2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).