Ambiguity reduction in speaker identification by the relaxation labeling process

A nonlinear probabilistic model of the relaxation labeling (RL) process is implemented in the speaker identification task in order to disambiguate the labeling of the speech feature vectors. In this proposed algorithm, the deterministic labeling of the vector quantization (VQ)-based speaker identification is relaxed by means of introducing initial probabilistic weights to the labeling process of the speech feature vectors. This process is then iteratively updated until no further significant improvement is found. Experimental results on speaker identification using a commercial speech corpus show that the relaxation labeling outperforms the conventional VQ method.

[1]  Hideo Ogawa A fuzzy relaxation technique for partial shape matching , 1994, Pattern Recognit. Lett..

[2]  J. Y. S. Luh,et al.  Relaxation labeling algorithm for information integration and its convergence , 1995, Pattern Recognit..

[3]  Günther Palm,et al.  A text-independent speaker identification system based on neural networks , 1994, ICSLP.

[4]  Jack-Gérard Postaire,et al.  A relaxation scheme for improving a convexity based clustering method , 1994, Pattern Recognit. Lett..

[5]  G.R. Doddington,et al.  Speaker recognition—Identifying people by their voices , 1985, Proceedings of the IEEE.

[6]  Michael J. Carey,et al.  Discriminative phonemes for speaker identification , 1994, ICSLP.

[7]  Milan Sonka,et al.  Image Processing, Analysis and Machine Vision , 1993, Springer US.

[8]  Shmuel Peleg,et al.  Determining Compatibility Coefficients for Curve Enhancement Relaxation Processes , 1978 .

[9]  Fabio Cocurullo,et al.  A new algorithm for vector quantization , 1995, Proceedings DCC '95 Data Compression Conference.

[10]  Lionel Pelkowitz,et al.  A continuous relaxation labeling algorithm for Markov random fields , 1990, IEEE Trans. Syst. Man Cybern..

[11]  Qin Chen,et al.  Ambiguity reduction by relaxation labeling , 1994, Pattern Recognit..

[12]  Sadaoki Furui,et al.  An Overview of Speaker Recognition Technology , 1996 .

[13]  Azriel Rosenfeld,et al.  Scene Labeling by Relaxation Operations , 1976, IEEE Transactions on Systems, Man, and Cybernetics.

[14]  Gérard Chollet,et al.  Combining methods to improve speaker verification decision , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[15]  Biing-Hwang Juang,et al.  A vector quantization approach to speaker recognition , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[16]  E. V. Krishnamurthy,et al.  Relaxation Processes forScene Labeling: Convergence, Speed, andStability , 1978 .