The trended HMM with discriminative training for phonetic classification

The authors extend the maximum likelihood (ML) training algorithm to the minimum classification error (MCE) training algorithm for optimal estimation of the state-dependent polynomial coefficients in the trended HMM. The problem of automatic speech recognition is viewed as a discriminative dynamic data-fitting problem, where relative (not absolute) closeness in fitting an array of dynamic speech models to the unknown speech data sequence provides the recognition decision. In this view, the properties of the MCE formulation for training the trended HMM are analyzed by fitting raw speech data using MCE-trained trended HMMs, contrasting the poor discriminative fitting using the ML-trained models. Comparisons between the phonetic classification as well as data-fitting results obtained with ML and with MCE training algorithms demonstrate the effectiveness of the discriminatively trained trended HMMs.