History and Development of Speech Recognition

Speech is the primary means of communication between humans. For reasons ranging from technological curiosity about the mechanisms for mechanical realization of human speech capabilities to the desire to automate simple tasks which necessitate human–machine interactions, research in automatic speech recognition by machines has attracted a great deal of attention for five decades.

[1]  Wu Chou,et al.  Pattern Recognition in Speech and Language Processing , 2002 .

[2]  Hy Murveit,et al.  Linguistic constraints in hidden Markov model based speech recognition , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[3]  Wu Chou Minimum Classification Error (MCE) Approach in Pattern Recognition , 2003 .

[4]  Harry F. Olson,et al.  Phonetic typewriter , 1957 .

[5]  John Makhoul,et al.  BYBLOS: The BBN continuous speech recognition system , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[7]  Biing-Hwang Juang,et al.  Flexible speech understanding based on combined key-phrase detection and verification , 1998, IEEE Trans. Speech Audio Process..

[8]  T. B. Martin,et al.  SPEECH RECOGNITION BY FEATURE-ABSTRACTION TECHNIQUES. , 1964 .

[9]  Roger K. Moore,et al.  Hidden Markov model decomposition of speech and noise , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[10]  D. B. Fry,et al.  Theoretical aspects of mechanical speech recognition , 1959 .

[11]  Chin-Hui Lee,et al.  A frame-synchronous network search algorithm for connected word recognition , 1989, IEEE Trans. Acoust. Speech Signal Process..

[12]  Sadaoki Furui Recent Progress in Corpus-Based Spontaneous Speech Recognition , 2005, IEICE Trans. Inf. Syst..

[13]  K. M. Ponting,et al.  Computational Models of Speech Pattern Processing , 1999, NATO ASI Series.

[14]  Bruce Lowerre,et al.  The Harpy speech understanding system , 1990 .

[15]  Sadaoki Furui,et al.  Speaker-independent isolated word recognition using dynamic features of speech spectrum , 1986, IEEE Trans. Acoust. Speech Signal Process..

[16]  Sadaoki Furui,et al.  Fifty years of progress in speech and speaker recognition , 2004 .

[17]  Shigeru Katagiri Speech Pattern Recognition using Neural Networks , 2003 .

[18]  C. Myers,et al.  A level building dynamic time warping algorithm for connected word recognition , 1981 .

[19]  F. Itakura,et al.  Minimum prediction residual principle applied to speech recognition , 1975 .

[20]  Robert C. Moore Using Natural-Language Knowledge Sources in Speech Recognition , 1999 .

[21]  D. B. Paul,et al.  The Lincoln robust continuous speech recognizer , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[22]  Biing-Hwang Juang,et al.  Key-phrase detection and verification for flexible speech understanding , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[23]  Lalit R. Bahl,et al.  Design of a linguistic statistical decoder for the recognition of continuous speech , 1975, IEEE Trans. Inf. Theory.

[24]  Myoung-Wan Koo,et al.  Speech recognition and utterance verification based on a generalized confidence score , 2001, IEEE Trans. Speech Audio Process..

[25]  Geoffrey Zweig,et al.  The IBM 2004 conversational telephony system for rich transcription , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[26]  B. Beek,et al.  An assessment of the technology of automatic speech recognition for military applications , 1977 .

[27]  Sadaoki Furui,et al.  Speech-to-text and speech-to-speech summarization of spontaneous speech , 2004, IEEE Transactions on Speech and Audio Processing.

[28]  Richard Lippmann,et al.  Speech recognition by machines and humans , 1997, Speech Commun..

[29]  J. Forgie,et al.  Results Obtained from a Vowel Recognition Computer Program , 1959 .

[30]  Chin-Hui Lee,et al.  A structural Bayes approach to speaker adaptation , 2001, IEEE Trans. Speech Audio Process..

[31]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[32]  Shuji Doshita,et al.  The Phonetic Typewriter , 1962, IFIP Congress.

[33]  Jean-Claude Junqua,et al.  Robustness in Automatic Speech Recognition , 1996 .

[34]  Victor Zue,et al.  The MIT SUMMIT Speech Recognition System: A Progress Report , 1989, HLT.

[35]  N. G. Zagoruyko,et al.  Automatic recognition of 200 words , 1970 .

[36]  H. Sakoe,et al.  Two-level DP-matching--A dynamic programming-based pattern matching algorithm for connected word recognition , 1979 .

[37]  Chin-Hui Lee,et al.  Acoustic modeling for large vocabulary speech recognition , 1990 .

[38]  P. Denes,et al.  The design and operation of the mechanical speech recognizer at University College London , 1959 .

[39]  T. K. Vintsyuk Speech discrimination by dynamic programming , 1968 .

[40]  Philip C. Woodland,et al.  Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[41]  Andreas Stolcke,et al.  Structural metadata research in the EARS program , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[42]  Frederick Jelinek,et al.  The development of an experimental discrete dictation recognizer , 1985 .

[43]  S. Furui,et al.  Automatic recognition and understanding of spoken language - a first step toward natural human-machine communication , 2000, Proceedings of the IEEE.

[44]  K. Nagata Spoken digit recognizer for Japanese language. , 1963 .

[45]  Dennis H. Klatt,et al.  Review of the ARPA speech understanding project , 1990 .

[46]  Geoffrey E. Hinton,et al.  Phoneme recognition using time-delay neural networks , 1989, IEEE Trans. Acoust. Speech Signal Process..

[47]  H Bung Automatic speech recognition and understanding : A first step toward natural human-machine communication , 2000 .

[48]  D. R. Reddy An approach to computer speech recognition by direct analysis of the speech wave , 1966 .

[49]  Steve Young,et al.  Parallel model combination for speech recognition in noise , 1993 .

[50]  Geoffrey Zweig,et al.  Speech Recognition with Dynamic Bayesian Networks , 1998, AAAI/IAAI.

[51]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[52]  Aaron E. Rosenberg,et al.  Speaker-independent recognition of isolated words using clustering techniques , 1979 .

[53]  Patrick L. Combettes,et al.  Methods for digital restoration of signals degraded by a stochastic impulse response , 1989, IEEE Trans. Acoust. Speech Signal Process..

[54]  K. Davis,et al.  Automatic Recognition of Spoken Digits , 1952 .

[55]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[56]  Hsiao-Wuen Hon,et al.  An overview of the SPHINX speech recognition system , 1990, IEEE Trans. Acoust. Speech Signal Process..

[57]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .