Speech recognition: From the laboratory to the real world

The art and science of speech recognition research have advanced to the point where it is now possible to communicate reliably with a machine via telephone to carry out simple tasks. However, the creation of robust algorithms is only part of the overall process of making speech recognition technology a commercial reality. In this paper, we present a brief overview of speech recognition technology and discuss the implementation of the algorithms with digital signal processing chips. In addition, we will show how theory and practice come together in real-world conditions by describing a telephone network trial of automatic credit card verification using speech recognition technology.

[1]  Perry K. White,et al.  Converging Technologies: Automotive, Cellular, and Voice Input/Output , 1986 .

[2]  Allen Louis Gorin,et al.  Incorporating syntax into the level-building algorithm on a tree-structured parallel computer , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[3]  James L. Flanagan,et al.  HuMaNet: An experimental human-machine communications network based on ISDN wideband audio , 1990 .

[4]  David R. Fischell,et al.  Interactive voice technology applications , 1990, AT&T Technical Journal.

[5]  Chin-Hui Lee,et al.  Automatic recognition of keywords in unconstrained speech using hidden Markov models , 1990, IEEE Trans. Acoust. Speech Signal Process..

[6]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[7]  Sadaoki Furui Recent advances in speech recognition , 1991, EUROSPEECH.

[8]  S. P. Pekarich,et al.  The DSP32C: AT&Ts second generation floating point digital signal processor , 1988, IEEE Micro.

[9]  Chin-Hui Lee,et al.  Acoustic modeling of subword units for speech recognition , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[10]  J. G. Wilpon,et al.  A study on the ability to automatically recognize telephone-quality speech from large customer populations , 1985, AT&T Technical Journal.

[11]  J.G. Wilpon,et al.  Isolated word recognition over the DDD telephone network. Results of two extensive field studies , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[12]  J. Mariani,et al.  Recent advances in speech processing , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[13]  A. L. Gorin,et al.  Parallel level-building on a tree machine (speech recognition) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[14]  Frank K. Soong,et al.  High performance connected digit recognition using hidden Markov models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[15]  L. Rabiner,et al.  Isolated and Connected Word Recognition - Theory and Selected Applications , 1981, IEEE Transactions on Communications.

[16]  Robert J. Perdue,et al.  AT&T voice processing system architectures , 1990, AT&T Technical Journal.