Spoken Language Communication with Machines: The Long and Winding Road from Research to Business

This paper traces the history of spoken language communication with computers, from the first attempts in the 1950s, through the establishment of the theoretical foundations in the 1980s, to the incremental improvement phase of the 1990s and 2000s. Then a perspective is given on the current conversational technology market and industry, with an analysis of its business value and commercial models.

[1]  Roberto Pieraccini,et al.  A Learning Approach to Natural Language Understanding , 1994, ArXiv.

[2]  Hiroaki Sakoe,et al.  A Dynamic Programming Approach to Continuous Speech Recognition , 1971 .

[3]  J. Schroeter,et al.  Speech and language processing for next-millennium communications services , 2000, Proceedings of the IEEE.

[4]  Frederick Jelinek,et al.  Speech Recognition by Statistical Methods , 1976 .

[5]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[6]  Bruce Lowerre,et al.  The Harpy speech understanding system , 1990 .

[7]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[8]  F. Jelinek,et al.  Continuous speech recognition by statistical methods , 1976, Proceedings of the IEEE.

[9]  Patti Price,et al.  The DARPA 1000-word resource management database for continuous speech recognition , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[10]  Dennis H. Klatt,et al.  Review of the ARPA speech understanding project , 1990 .

[11]  L. R. Rabiner,et al.  An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition , 1983, The Bell System Technical Journal.

[12]  Gregory A. Sanders,et al.  DARPA communicator: cross-system results for the 2001 evaluation , 2002, INTERSPEECH.

[13]  T. K. Vintsyuk Speech discrimination by dynamic programming , 1968 .

[14]  A. Samuel,et al.  Whither speech recognition? , 1969, The Journal of the Acoustical Society of America.

[15]  Salim Roukos,et al.  Free-flow dialog management using forms , 1999, EUROSPEECH.

[16]  L. Baum,et al.  An inequality and associated maximization technique in statistical estimation of probabilistic functions of a Markov process , 1972 .

[17]  Lynette Hirschman,et al.  Multi-Site Data Collection for a Spoken Language Corpus , 1992, HLT.

[18]  K. Davis,et al.  Automatic Recognition of Spoken Digits , 1952 .