A new system for continuous speech recognition - preliminary results

A speaker dependent system for recognizing carefully articulated continuous speech is described. The system accepts English sentences composed from a 127 word vocabulary appropriate to an airline information reservation task. The system is controlled by a finite state parser which generates word candidates and established their temporal locations in hypothetical sentences. The word candidates are evaluated by an LPC distance measure and a dynamic programming algorithm which nonlinearly time aligns isolated word reference templates with the input speech stream. The input is recognized as the hypothetical sentence having the lowest distance according to a well-defined criterion. In a preliminary test based on 100 sentences spoken over dialed up telephone lines by two male talkers, 90% word accuracy, resulting in 75% sentence recognition, was achieved.

[1]  Hiroaki Sakoe,et al.  A Dynamic Programming Approach to Continuous Speech Recognition , 1971 .

[2]  W. Woods,et al.  Motivation and overview of SPEECHLIS: An experimental prototype for speech understanding research , 1975 .

[3]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[4]  J. Baker,et al.  The DRAGON system--An overview , 1975 .

[5]  F. Jelinek,et al.  Continuous speech recognition by statistical methods , 1976, Proceedings of the IEEE.

[6]  Lawrence R. Rabiner,et al.  Some preliminary experiments in the recognition of connected digits , 1975 .

[7]  P. Quinton Utilisation d’un analyseur syntaxique pour la reconnaissance de la parole continue , 1977 .

[8]  L. Erman A functional description of the Hearsay-II speech understanding system , 1977 .

[9]  R. Christiansen,et al.  Detecting and locating key words in continuous speech using linear predictive coding , 1977 .

[10]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[11]  S. E. Levinson,et al.  The effects of syntactic analysis on word recognition accuracy , 1978, The Bell System Technical Journal.

[12]  Yasuhisa Niimi,et al.  A voice-input programming system using basic-like language , 1978, ICASSP.

[13]  A. E. Rosenberg,et al.  Evaluation of a word recognition system using syntax analysis , 1978, The Bell System Technical Journal.

[14]  Aaron E. Rosenberg,et al.  Considerations in dynamic time warping algorithms for discrete word recognition , 1978 .