Massively Parallel Parsing in ΦDmDialog: Integrated Architecture for Parsing Speech Inputs

This paper describes the parsing scheme in the 𝛷DmDialog speech-to-speech dialog translation system, with special emphasis on the integration of speech and natural language processing. We propose an integrated architecture for parsing speech inputs based on a parallel marker-passing scheme and attaining dynamic participation of knowledge from the phonological-level to the discourse-level. At the phonological level, we employ a stochastic model using a transition matrix and a confusion matrix and markers which carry a probability measure. At a higher level, syntactic/semantic and discourse processing, we integrate a case-based and constraint-based scheme in a consistent manner so that a priori probability and constraints, which reflect linguistic and discourse factors, are provided to the phonological level of processing. A probability/cost-based scheme in our model enables ambiguity resolution at various levels using one uniform principle.

[1]  James F. Allen,et al.  A Plan Recognition Model for Subdialogues in Conversations , 1987, Cogn. Sci..

[2]  Yen-Lu Chow Salim RoJor SPEECH UNDERSTANDING USING A UNIFICATION GRAMMAR , 1989 .

[3]  Hiroaki Kitano,et al.  Multilingual Information Retrieval Mechanism Using VLSI. Requirements and Approaches for Information Retrieval Systems in the Computer-Aided Software Engineering and Document Processing Environment , 1988, RIAO.

[4]  James F. Allen,et al.  A Plan Recognition Model for Subdialogues in Conversations , 1987, Cogn. Sci..

[5]  Eduard Hovy,et al.  Generating Natural Language Under Pragmatic Constraints , 1988 .

[6]  Masaru Tomita,et al.  Parsing noisy sentences , 1988, COLING.

[7]  Kenji Kita,et al.  HMM continuous speech recognition using predictive LR parsing , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[8]  Hideto Tomabechi,et al.  Direct Memory Access Translation for Speech Input , 1988, FGCS.

[9]  Satoru Fujii,et al.  Large vocabulary speaker-independent Japanese speech recognition system , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Wayne H. Ward,et al.  Layering Predictions: Flexible Use of Dialog Expectation in Speech Recognition , 1989, IJCAI.

[11]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[12]  Hiroaki Kitano,et al.  A massively parallel model of speech-to-speech dialog translation: a step toward interpreting telephony , 1989, EUROSPEECH.

[13]  C SchankRoger,et al.  Dynamic Memory: A Theory of Reminding and Learning in Computers and People , 1983 .

[14]  Bonnie Webber,et al.  So what can we talk about now , 1986 .