Speaker recognition by separating phonetic space and speaker space
暂无分享,去创建一个
In speaker recognition, it is a problem that speech f e a-ture varies depending on sentences and time diierence. This variation is mainly attributed to the variation of phonetic information and speaker information included in speech data. If these two kinds of information are separated each other, robust speaker recognition will be realized. In this study, w e propose a speaker identiica-tion and speaker veriication method by separating the phonetic information and speaker information by a sub-space method, under the assumption that a space with large within-speaker variance is a \phonetic space" and a space with small within-speaker variance is a \speaker space". We carried out comparative experiments of the proposed method with a conventional method based on GMM in an observation space as well as in a space transformed by L D A. As a result, we could construct a robust speaker model with a few model parameters using a few training data by the proposed method.
[1] Yasuo Ariki,et al. Speaker recognition and speaker normalization by projection to speaker subspace , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.
[2] Yasuo Ariki,et al. Speaker recognition based on subspace methods , 1994, ICSLP.