Optimizing recognition and rejection performance in wordspotting systems

Compares the performance which can be achieved by different hidden Markov model (HMM) based wordspotting techniques when their parameters are tuned to optimize recognition and rejection rates. An alternative approach which does not attempt to explicitly model extraneous speech or non-speech noise is also proposed. After optimization of each of these approaches, it appears that the proposed version performs at least as well as the other methods with the advantage of simplicity and possibility to be used in hybrid models using HMMs with a multilayer perceptron (MLP). Test results are reported on a speaker independent telephone database containing 10 keywords as well as on the speaker independent ARPA resource management database in which between 10 and 250 keywords were defined.<<ETX>>

[1]  L. G. Miller,et al.  Improvements and applications for key word recognition using hidden Markov modeling techniques , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[2]  Richard C. Rose Definition of subword acoustic units for wordspotting , 1993, EUROSPEECH.

[3]  E. M. Hofstetter,et al.  Techniques for task independent word spotting in continuous speech messages , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Hervé Bourlard,et al.  Connectionist probability estimators in HMM speech recognition , 1994, IEEE Trans. Speech Audio Process..

[5]  Richard Rose,et al.  A hidden Markov model based keyword recognition system , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[6]  Chin-Hui Lee,et al.  Automatic recognition of keywords in unconstrained speech using hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[7]  Hervé Bourlard,et al.  Connectionist Speech Recognition: A Hybrid Approach , 1993 .

[8]  R. Wohlford,et al.  Keyword recognition using template concatenation , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.