Speaker independent recognition of isolated words using clustering techniques

A speaker independent, isolated word recognition system is proposed which is based on the use of multiple templates for each word in the vocabulary. The word templates are obtained from a statistical clustering analysis of a large data base consisting of 100 replications of each word (i.e. once by each of 100 talkers). The recognition system, which uses telephone recordings, is based on an LPC analysis of the unknown word, dynamic time warping of each reference template to the unknown word (using the Itakura LPC distance measure), and the application of a K-nearest neighbor (KNN) decision rule to lower the probability of error. Results are presented on two test sets of data which show error rates that are comparable to, or better than, those obtained with speaker trained, isolated word recognition systems.

[1]  S. S. Wilks Determination of Sample Sizes for Setting Tolerance Limits , 1941 .

[2]  C. Quesenberry,et al.  A nonparametric estimate of a multivariate density function , 1965 .

[3]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[4]  J. Shearme,et al.  Some experiments with a simple word recognition system , 1968 .

[5]  Hiroaki Sakoe,et al.  A Dynamic Programming Approach to Continuous Speech Recognition , 1971 .

[6]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[7]  Phillips B. Scott VICI - A speaker independent word recognition system , 1976, ICASSP.

[8]  A. E. Rosenberg,et al.  Evaluation of an automatic word recognition system over dialed‐up telephone lines , 1976 .

[9]  Aaron E. Rosenberg,et al.  Evaluation of a word recognition system using syntax analysis , 1977 .

[10]  C. E. Schmidt,et al.  Recognition of spoken spelled names applied to directory assistance , 1977 .

[11]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[12]  Lawrence R. Rabiner,et al.  On creating reference templates for speaker independent recognition of isolated words , 1978 .

[13]  Aaron E. Rosenberg,et al.  Considerations in dynamic time warping algorithms for discrete word recognition , 1978 .

[14]  S. Levinson,et al.  Considerations in dynamic time warping algorithms for discrete word recognition , 1978 .

[15]  J. Gowdy,et al.  A speaker-independent speech-recognition system based on linear prediction , 1978 .