论文信息 - Towards a Maximum Entropy Method for Estimating HMM Parameters

Towards a Maximum Entropy Method for Estimating HMM Parameters

Training a Hidden Markov Model (HMM) to maximise the probability of a given sequence can result in over-fitting. That is, the model represents the training sequence well, but fails to generalise. In this paper, we present a possible solution to this problem, which is to maximise a linear combination of the likelihood of the training data, and the entropy of the model. We derive the necessary equations for gradient based maximisation of this combined term. The performance of the system is then evaluated in comparison with three other algorithms, on a classification task using synthetic data. The results indicate that the method is potentially useful. The main problem with the method is the computational intractability of the entropy calculation.

[1] Brian C. Lovell,et al. Improved estimation of hidden Markov model parameters from multiple observation sequences , 2002, Object recognition supported by user interaction for service robots.

[2] S. Leigh,et al. Probability and Random Processes for Electrical Engineering , 1989 .

[3] Roy L. Streit,et al. The moments of matched and mismatched hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[4] L. R. Rabiner,et al. An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition , 1983, The Bell System Technical Journal.

[5] E. T. Jaynes,et al. Papers on probability, statistics and statistical physics , 1983 .

[6] Matthew Brand,et al. Structure Learning in Conditional Probability Models via an Entropic Prior and Parameter Extinction , 1999, Neural Computation.

[7] Gary James Jason,et al. The Logic of Scientific Discovery , 1988 .

[8] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.