Keyword-spotting in noisy continuous speech using word pattern vector subabstraction and noise immunity learning

Noise immunity learning, previously proposed by the authors (1991) for isolated word recognition in noisy environments, is extended to keyword spotting in noisy continuous speech. The powerful features of the noise immunity keyword-spotting method are keyword spotting based on the multiple similarity (MS) method for reliable keyword detection, noise immunity learning for greater robustness in recognition of spontaneous or noisy speech, and word pattern vector subabstraction to represent noisy keyword patterns from different viewpoints. Integrating the spotting results obtained by different kinds of subabstracted word pattern vectors significantly improved the performance of the keyword spotting. A system to spot 30 keywords currently runs in real-time on a workstation with two accelerators. The spotted keywords are fed into a keyword sequence LR parser for spontaneous speech understanding.<<ETX>>

[1]  Yoichi Takebayashi,et al.  A robust speech recognition system using word-spotting with noise immunity learning , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[2]  T. Ukita,et al.  A similarity value transformation method for probabilistic scoring , 1988, [1988 Proceedings] 9th International Conference on Pattern Recognition.

[3]  Yoichi Takebayashi,et al.  An accelerator for high-speed spoken word-spotting and noise immunity learning system , 1990, ICSLP.

[4]  Yoichi Takebayashi,et al.  A real-time task-oriented speech understanding system using keyword-spotting , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  L. G. Miller,et al.  Improvements and applications for key word recognition using hidden Markov modeling techniques , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[6]  Wayne H. Ward Understanding spontaneous speech: the Phoenix system , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[7]  Richard P. Lippmann,et al.  Techniques for information retrieval from voice messages , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[8]  Alexander I. Rudnicky,et al.  Spoken language recognition in an office management domain , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.