Audio segmentation, classification and clustering in a broadcast news task
暂无分享,去创建一个
The paper describes our work on the development of an audio segmentation, classification and clustering system applied to a broadcast news task for the European Portuguese language. We developed a new algorithm for audio segmentation that is both accurate and uses fewer computational resources than other approaches. Our speaker clustering module uses a modified BIC (Bayesian information criterion) algorithm which performs substantially better than the standard symmetric Kullback-Liebler, KL2, and is much faster than the full BIC. Finally, we developed a scheme for tagging certain speaker clusters (anchors) using trained cluster models. A series of tests were conducted showing the advantage of the new algorithms. This system is part of a prototype system that is daily processing the main news show of the national Portuguese broadcaster.
[1] M. A. Siegler,et al. Automatic Segmentation, Classification and Clustering of Broadcast News Audio , 1997 .
[2] Daniel P. W. Ellis,et al. Speech/music discrimination based on posterior probability features , 1999, EUROSPEECH.
[3] S. Chen,et al. Speaker, Environment and Channel Change Detection and Clustering via the Bayesian Information Criterion , 1998 .
[4] John H. L. Hansen,et al. Unsupervised audio stream segmentation and clustering via the Bayesian information criterion , 2000, INTERSPEECH.