In the usual pattern matching approach to automatic recognition of speech, some measurement of pattern similarity is used to obtain a tentative word identification. This paper presents techniques for using additional information for further testing of this hypothesis. This information is in the form of other types of spectral match statistics, energy ratio statistics, and energy derivative statistics. A stepwise linear discriminant analysis procedure is used to select, for each template, a subset of the available statistics, and derive a linear combination of these statistics. The resulting score is the log odds in favor of this tentative identification given the observed values of these additional statistics.
[1]
T. M. Cannon,et al.
Blind deconvolution through digital signal processing
,
1975,
Proceedings of the IEEE.
[2]
S. Chiba,et al.
Dynamic programming algorithm optimization for spoken word recognition
,
1978
.
[3]
Lawrence R. Rabiner,et al.
Isolated word recognition using a two-pass pattern recognition approach
,
1981,
ICASSP.
[4]
R. Christiansen,et al.
Detecting and locating key words in continuous speech using linear predictive coding
,
1977
.
[5]
Dean P. McCullough.
Variations on Itakura's spectral match score
,
1981,
ICASSP.