A New Framework For Large Vocabulary Keyword Spotting Using Two-Pass Confidence Measure

In this paper, a new framework for large vocabulary keyword spotting is proposed, which involves three phases. In the first phase, N-best sub-word lattice is generated by hidden Markov model (HMM). Keyword candidates are hypothesized by dynamic keyword matching during the second phase. In the last phase, two-pass confidence measure, which provides complementary information, is used for keyword verification. Experimental results show that, with the use of these improvements, the keyword spotting system proves to be more accurate and robust without much computation cost.

[1]  Victor Zue,et al.  A segment-based wordspotter using phonetic filler models , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  Steve J. Young,et al.  A fast lattice-based approach to vocabulary independent wordspotting , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[3]  Chiu-yu Tseng,et al.  A multi-phase approach for fast spotting of large vocabulary Chinese keywords from Mandarin speech using prosodic information , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Thomas Fang Zheng,et al.  A two-step keyword spotting method based on context-dependent a posteriori probability , 2004, 2004 International Symposium on Chinese Spoken Language Processing.

[5]  Gérard Chollet,et al.  Confidence measures for keyword spotting using support vector machines , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..