Kernel Based Clustering for Multiclass Data
暂无分享,去创建一个
In this paper we address the issues in construction of discrete hidden Markov models (HMMs) in the feature space of Mercer kernels. The kernel space HMMs are suitable for complex pattern recognition tasks that involve varying length patterns as in speech recognition. The main issues addressed are related to clustering in the kernel feature space for large data sets consisting of the data of multiple classes. Convergence of kernel based clustering method [1] is slow when the size of the data set is large. We consider an approach in which the multiclass data set is partitioned into subsets, and clustering for the data in each subset is done separately. We consider two methods for partitioning the multiclass data. In the all-class-data method, the partitioning is done in such a way that each subset contains a part of the total data set of each class. In the class-wise-data method, a subset contains the total data of only one class. We study the performance of the two methods on kernel based clustering used to build discrete HMMs in the kernel feature space for recognition of spoken utterances of letters in E-set of English alphabet.
[1] Simon Haykin,et al. Neural Networks: A Comprehensive Foundation , 1998 .
[2] Gunnar Rätsch,et al. An introduction to kernel-based learning algorithms , 2001, IEEE Trans. Neural Networks.
[3] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.
[4] Mark A. Girolami,et al. Mercer kernel-based clustering in feature space , 2002, IEEE Trans. Neural Networks.