Fully Bayesian inference of multi-mixture Gaussian model and its evaluation using speaker clustering
暂无分享,去创建一个
This study aims to verify effective optimization methods for estimating parametric, fully Bayesian models in speech processing. For that purpose, we investigate the impact of the difference in optimization methods for the multi-scale Gaussian mixture model, which is suitable for speaker clustering, on the clustering accuracy. The Markov chain Monte Carlo (MCMC)-based method was compared with the variational Bayesian method in the speaker clustering experiment; with a small amount of data, the MCMC-based method was more effective; with large scale data (more than one million samples), the difference between these methods in terms of the clustering accuracy decreased and the MCMC-based method was computationally efficient.
[1] Michael I. Jordan,et al. Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..
[2] Tetsuji Ogawa,et al. Speaker Clustering Based on Utterance-Oriented Dirichlet Process Mixture Model , 2011, INTERSPEECH.
[3] Mark Steyvers,et al. Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.
[4] Fabio Valente,et al. Variational Bayesian speaker clustering , 2004, Odyssey.