论文信息 - Markov Processes on Curves for Automatic Speech Recognition

Markov Processes on Curves for Automatic Speech Recognition

We investigate a probabilistic framework for automatic speech recognition based on the intrinsic geometric properties of curves. In particular, we analyze the setting in which two variables-one continuous (x), one discrete (s)-evolve jointly in time. We suppose that the vector x traces out a smooth multidimensional curve and that the variable s evolves stochastically as a function of the arc length traversed along this curve. Since arc length does not depend on the rate at which a curve is traversed, this gives rise to a family of Markov processes whose predictions, Pr[s|x], are invariant to nonlinear warpings of time. We describe the use of such models, known as Markov processes on curves (MPCs), for automatic speech recognition, where x are acoustic feature trajectories and s are phonetic transcriptions. On two tasks--recognizing New Jersey town names and connected alpha-digits--we find that MPCs yield lower word error rates than comparably trained hidden Markov models.

Lawrence K. Saul | Mazin G. Rahim | L. Saul | M. Rahim

[1] Lawrence K. Saul,et al. Automatic Segmentation of Continuous Trajectories with Invariance to Nonlinear Warpings of Time , 1998, ICML.

[2] Biing-Hwang Juang,et al. Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[3] Mari Ostendorf,et al. From HMM's to segment models: a unified view of stochastic modeling for speech recognition , 1996, IEEE Trans. Speech Audio Process..

[4] H. Piaggio. Differential Geometry of Curves and Surfaces , 1952, Nature.

[5] Richard M. Stern,et al. On the effects of speech rate in large vocabulary speech recognition systems , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[6] Naftali Tishby,et al. A dynamical systems approach to speech processing , 1990, International Conference on Acoustics, Speech, and Signal Processing.