Speaker independent connected word recognition using a syntax-directed dynamic programming procedure

A method for speaker independent connected word recognition is described. Speaker independence is achieved by clustering isolated word utterances of a 100 speaker population. Connected word recognition is based on a syntax-directed dynamic programming algorithm which matches the isolated word templates to sentence length utterances. The method has been tested on an artificial task-oriented language based on a 127 word vocabulary. Four subjects, two men and two women, spoke a total of 209 sentences comprising 1750 words. At an average speaking rate of 171 words/min over dialed-up telephone lines, a correct word recognition rate of 97 percent was observed.

[1]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[2]  John W. Klovstad Probabilistic lexical retrieval component with embedded phonological word boundary rules , 1976, ICASSP.

[3]  Lalit R. Bahl,et al.  Design of a linguistic statistical decoder for the recognition of continuous speech , 1975, IEEE Trans. Inf. Theory.

[4]  Aaron E. Rosenberg,et al.  Considerations in dynamic time warping algorithms for discrete word recognition , 1978 .

[5]  Lalit R. Bahl,et al.  Decoding for channels with insertions, deletions, and substitutions with applications to speech recognition , 1975, IEEE Trans. Inf. Theory.

[6]  J. Baker,et al.  The DRAGON system--An overview , 1975 .

[7]  Hiroaki Sakoe,et al.  A Dynamic Programming Approach to Continuous Speech Recognition , 1971 .

[8]  H. Sakoe,et al.  Two-level DP-matching--A dynamic programming-based pattern matching algorithm for connected word recognition , 1979 .

[9]  Aaron E. Rosenberg,et al.  Some experiments with a syntax directed speech recognition system , 1978, ICASSP.

[10]  A.V. Oppenheim,et al.  Analysis of linear digital networks , 1975, Proceedings of the IEEE.

[11]  Lawrence R. Rabiner,et al.  Application of dynamic time warping to connected digit recognition , 1980 .

[12]  Aaron E. Rosenberg,et al.  A preliminary study on the use of demisyllables in automatic speech recognition , 1981, ICASSP.

[13]  Jay G. Wilpon,et al.  Considerations in applying clustering techniques to speaker-independent word recognition. , 1979 .

[14]  N. G. Zagoruyko,et al.  Automatic recognition of 200 words , 1970 .

[15]  F. Jelinek Fast sequential decoding algorithm using a stack , 1969 .

[16]  Aaron E. Rosenberg,et al.  A new system for continuous speech recognition - preliminary results , 1979, ICASSP.

[17]  Bruce T. Lowerre,et al.  The HARPY speech recognition system , 1976 .

[18]  S. E. Levinson,et al.  The effects of syntactic analysis on word recognition accuracy , 1978, The Bell System Technical Journal.

[19]  C. Myers,et al.  A level building dynamic time warping algorithm for connected word recognition , 1981 .

[20]  Stephen E. Levinson,et al.  Computing relative redundancy to measure grammatical constraint in speech recognition tasks , 1978, ICASSP.