Minimum classification error rate method using genetic algorithms

Hidden Markov models (HMM) is one the most common statistical matching methods used for speech recognition, especially for continuous speech utterances. One major problem in HMM is that the training process that aims to generate a set of HMM models (recognizer) for matching the speech source usually is based on a set of limited training data. The minimum classification error (MCE) training approach (Juang et al., 1997) is regarded as a discriminative method that is proven to be superior to other traditional probability distribution estimation approaches, such as the maximum likelihood (ML) approach. The main drawback in the MCE is to formulate the error rate estimate function as a smooth loss function for applying a gradient search technique that subsequently leads to a local optimal solution. In this paper, a genetic algorithm based approach (GA-MCE) for the MCE is proposed to solve these problems. In our experiments, the results demonstrated that the GA-MCE is superior to the original MCE method. It can also significantly increase the performance of voice input systems.

[1]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[2]  Sam Kwong,et al.  Optimization of HMM by a genetic algorithm , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Lawrence. Davis,et al.  Handbook Of Genetic Algorithms , 1990 .

[4]  Louis A. Liporace,et al.  Maximum likelihood estimation for multivariate observations of Markov sources , 1982, IEEE Trans. Inf. Theory.

[5]  Fang Sun,et al.  Speech recognition based on genetic algorithm for training HMM , 1998 .

[6]  Sam Kwong,et al.  A maximum model distance approach for HMM-based speech recognition , 1998, Pattern Recognit..

[7]  Lalit R. Bahl,et al.  A Maximum Likelihood Approach to Continuous Speech Recognition , 1983, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[9]  A. Nadas,et al.  A decision theorectic formulation of a training problem in speech recognition and a comparison of training by unconditional versus conditional maximum likelihood , 1983 .

[10]  L. R. Rabiner,et al.  An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition , 1983, The Bell System Technical Journal.

[11]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[12]  Biing-Hwang Juang,et al.  Minimum classification error rate methods for speech recognition , 1997, IEEE Trans. Speech Audio Process..