A hybrid computational model for spoken language understanding

This paper shows that the integration of statistical and connectionist methods can greatly enhance human-computer interaction through speech. The research approach is inspired by recent advances in high performance automatic speech recognition (ASR) systems and neurocognitive researches of natural language understanding (NLU). And a modest hybrid computational model is proposed and implemented to achieve intelligent spoken language understanding (SLU) in an information retrieval system.

[1]  L. R. Rabiner,et al.  Speech recognition: Statistical methods , 2006 .

[2]  Guo-Zheng Sun,et al.  A neural network approach to speech recognition , 1988, Neural Networks.

[3]  Dilek Z. Hakkani-Tür,et al.  Spoken language understanding , 2008, IEEE Signal Processing Magazine.

[4]  V.W. Zue,et al.  The use of speech knowledge in automatic speech recognition , 1985, Proceedings of the IEEE.

[5]  A. Friederici The Developmental Cognitive Neuroscience of Language: A New Research Domain , 2000, Brain and Language.

[6]  D.R. Reddy,et al.  Speech recognition by machine: A review , 1976, Proceedings of the IEEE.

[7]  R. Cole,et al.  Survey of the State of the Art in Human Language Technology , 2010 .

[8]  Kai-Fu Lee,et al.  On speaker-independent, speaker-dependent, and speaker-adaptive speech recognition , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[9]  Karin Humphreys,et al.  The psychology of language : from data to theory , 2001 .

[10]  Alfred Mertins,et al.  Automatic speech recognition and speech variability: A review , 2007, Speech Commun..

[11]  S. Furui,et al.  Automatic recognition and understanding of spoken language - a first step toward natural human-machine communication , 2000, Proceedings of the IEEE.

[12]  Richard P. Lippmann,et al.  Review of Neural Networks for Speech Recognition , 1989, Neural Computation.

[13]  A. Friederici Towards a neural basis of auditory sentence processing , 2002, Trends in Cognitive Sciences.

[14]  Alex Waibel,et al.  Readings in speech recognition , 1990 .

[15]  Mark Newson,et al.  乔姆斯基的普遍语法教程 = Chomsky's universal grammar : an introduction : , 1988 .

[16]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[17]  Chin-Hui Lee,et al.  Acoustic modeling for large vocabulary speech recognition , 1990 .

[18]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[19]  Dennis H. Klatt,et al.  Review of the ARPA speech understanding project , 1990 .

[20]  Hsiao-Wuen Hon,et al.  An overview of the SPHINX speech recognition system , 1990, IEEE Trans. Acoust. Speech Signal Process..

[21]  Michael Picheny,et al.  A method for the construction of acoustic Markov models for words , 1993, IEEE Trans. Speech Audio Process..

[22]  Sheryl R. YOUNG,et al.  Use of dialogue, pragmatics and sematics to enhance speech recognition , 1990, Speech Commun..

[23]  L. R. Rabiner,et al.  Recognition of isolated digits using hidden Markov models with continuous mixture densities , 1985, AT&T Technical Journal.

[24]  L. Rabiner,et al.  An introduction to hidden Markov models , 1986, IEEE ASSP Magazine.

[25]  Marco Gori,et al.  A survey of hybrid ANN/HMM models for automatic speech recognition , 2001, Neurocomputing.

[26]  Philippe Olivier Alexandre Navaux,et al.  A Connectionist Approach to Speech Understanding , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[27]  Yosef Grodzinsky,et al.  Neuroimaging of syntax and syntactic processing , 2006, Current Opinion in Neurobiology.

[28]  Richard Lippmann,et al.  Speech recognition by machines and humans , 1997, Speech Commun..