论文信息 - Massively Parallel Parsing in ΦDmDialog: Integrated Architecture for Parsing Speech Inputs

Massively Parallel Parsing in ΦDmDialog: Integrated Architecture for Parsing Speech Inputs

This paper describes the parsing scheme in the 𝛷DmDialog speech-to-speech dialog translation system, with special emphasis on the integration of speech and natural language processing. We propose an integrated architecture for parsing speech inputs based on a parallel marker-passing scheme and attaining dynamic participation of knowledge from the phonological-level to the discourse-level. At the phonological level, we employ a stochastic model using a transition matrix and a confusion matrix and markers which carry a probability measure. At a higher level, syntactic/semantic and discourse processing, we integrate a case-based and constraint-based scheme in a consistent manner so that a priori probability and constraints, which reflect linguistic and discourse factors, are provided to the phonological level of processing. A probability/cost-based scheme in our model enables ambiguity resolution at various levels using one uniform principle.

Hiroaki Kitano | Teruko Mitamura | Masaru Tomita

[1] James F. Allen,et al. A Plan Recognition Model for Subdialogues in Conversations , 1987, Cogn. Sci..

[2] Yen-Lu Chow Salim RoJor. SPEECH UNDERSTANDING USING A UNIFICATION GRAMMAR , 1989 .

[3] Hiroaki Kitano,et al. Multilingual Information Retrieval Mechanism Using VLSI. Requirements and Approaches for Information Retrieval Systems in the Computer-Aided Software Engineering and Document Processing Environment , 1988, RIAO.

[4] James F. Allen,et al. A Plan Recognition Model for Subdialogues in Conversations , 1987, Cogn. Sci..

[5] Eduard Hovy,et al. Generating Natural Language Under Pragmatic Constraints , 1988 .

[6] Masaru Tomita,et al. Parsing noisy sentences , 1988, COLING.

[7] Kenji Kita,et al. HMM continuous speech recognition using predictive LR parsing , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[8] Hideto Tomabechi,et al. Direct Memory Access Translation for Speech Input , 1988, FGCS.

[9] Satoru Fujii,et al. Large vocabulary speaker-independent Japanese speech recognition system , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10] Wayne H. Ward,et al. Layering Predictions: Flexible Use of Dialog Expectation in Speech Recognition , 1989, IJCAI.

[11] Andrew J. Viterbi,et al. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[12] Hiroaki Kitano,et al. A massively parallel model of speech-to-speech dialog translation: a step toward interpreting telephony , 1989, EUROSPEECH.

[13] C SchankRoger,et al. Dynamic Memory: A Theory of Reminding and Learning in Computers and People , 1983 .

[14] Bonnie Webber,et al. So what can we talk about now , 1986 .