论文信息 - Parametric trajectory models for speech recognition

Parametric trajectory models for speech recognition

The basic motivation for employing trajectory models for speech recognition is that sequences of speech features are statistically dependent and that the effective and efficient modeling of the speech process will incorporate this dependency. In our previous work we presented an approach to modeling the speech process with trajectories. In this paper we continue our development of parametric trajectory models for speech recognition. We extend our models to include time-varying covariances and describe our approach for defining a metric between speech segments based on trajectory models; it is important in developing mixture models of trajectories.

Herbert Gish | Kenney Ng | H. Gish | Kenney Ng

[1] James R. Glass,et al. Statistical trajectory models for phonetic recognition , 1994, ICSLP.

[2] Victor Zue,et al. Signal Representation Attribute Extraction and the Use Distinctive Features for Phonetic Classification , 1991, HLT.

[3] Herbert Gish,et al. Segregation of speakers for speech recognition and speaker identification , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[4] Herbert Gish,et al. A segmental speech model with applications to word spotting , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5] Mari Ostendorf,et al. A stochastic segment model for phoneme-based continuous speech recognition , 1989, IEEE Trans. Acoust. Speech Signal Process..