Advanced acoustic techniques in automatic speech understanding
暂无分享,去创建一个
The BBN Speech Understanding System (HWIM) has two ways by which it may access the acoustic data of an unknown utterance. The first is an Acoustic-Phonetic Recognition component which derives a bottom-up phonetic transcription of the utterance. New techniques in probabilistic modeling of multi-dimensional density distributions have allowed many acoustic features to be fully utilized. HWIM (for Hear What I Mean) also contains a Parametric Word Verification component which does a top-down parametric word match to determine if the acoustic evidence is consistent with the presence of a hypothesized word. The component uses a synthesis-by-rule program to generate synthetic templates and a dynamic programming algorithm to do time normalization.
[1] Craig Cook,et al. Word verification in a speech understanding system , 1976, ICASSP.
[2] R. Viswanathan,et al. The application of a functional perceptual model of speech to variable-rate LPC systems , 1977 .
[3] F. Itakura,et al. Minimum prediction residual principle applied to speech recognition , 1975 .