论文信息 - An Experimental Study of Semi-Supervised EM algorithms in Audio Classification and Speaker Identification

An Experimental Study of Semi-Supervised EM algorithms in Audio Classification and Speaker Identification

Most pattern recognition techniques assume the existence of large quantities of carefully labeled data for training classifiers. However, the generation of his labeled data is an expensive and timet -amounts of data are generated daily, and labeling this data to refine classifiers becomes impossible. In the last years, a new body of techniques has emerged that explore how to take advantage of vast quantities of unlabeled data, i.e. data with no class assignment information. In this paper we study the applicability of these techniques to various audio classification tasks. We show very promising results that demonstrate a reduction in half of audio classification and speaker identification error rates.

Shivani Agarwal | Pedro J. Moreno

[1] Ayhan Demiriz,et al. Exploiting unlabeled data in ensemble methods , 2002, KDD.

[2] Roger K. Moore. Computer Speech and Language , 1986 .

[3] Rayid Ghani,et al. Combining labeled and unlabeled data for text classification with a large number of categories , 2001, Proceedings 2001 IEEE International Conference on Data Mining.

[4] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[5] Sebastian Thrun,et al. Text Classification from Labeled and Unlabeled Documents using EM , 2000, Machine Learning.

[6] Tommi S. Jaakkola,et al. Kernel Expansions with Unlabeled Examples , 2000, NIPS.

[7] D. Rubin,et al. Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[8] Avrim Blum,et al. The Bottleneck , 2021, Monopsony Capitalism.