论文信息 - Construction of an advanced in-car spoken dialogue corpus and its characteristic analysis

Construction of an advanced in-car spoken dialogue corpus and its characteristic analysis

This paper describes an advanced spoken language corpus which has been constructed by enhancing an in-car speech database. The corpus has the following characteristic features: (1) Advanced tag: Not only linguistic phenomena tags but also advanced discourse tags such as sentential structures, and utterance intentions, have been provided for the transcribed texts. (2) Large-scale: The sentential structures and the intentions are currently provided for 45,053 phrases and 35,421 utterance units, respectively. (3) Multi-layer: The corpus consists of different levels of spoken language data such as speech signals, transcribed texts, sentential structures, intentional markers and dialogue structures, moreover, they are related with each other. It allows a very wide variety of analysis of spontaneous spoken dialogue to utilize the multi-layered corpus. This paper also reports the result of investigation of the corpus, especially, forcusing on the relations between the syntactic style and the intentional style of spoken utterances.

[1] Kazuya Takeda,et al. A study on domain recognition of spoken dialogue systems , 2003, INTERSPEECH.

[2] Kazuya Takeda,et al. Multi-Dimensional Data Acquisition for Integrated Acoustic Information Research , 2002, LREC.

[3] Shinichi Kimura,et al. Example-based Speech Intention Understanding and Its Application to In-Car Spoken Dialogue System , 2002, COLING.

[4] Kazuya Takeda,et al. Design and Characterization of In-Car Speech Corpus , 2000 .

[5] Yasuyoshi Inagaki,et al. Example-Based Query Generation for Spontaneous Speech , 2005, IEICE Trans. Inf. Syst..

[6] Hitoshi Isahara,et al. Spontaneous Speech Corpus of Japanese , 2000, LREC.

[8] Yasuyoshi Inagaki,et al. Stochastic Dependency Parsing of Spontaneous Japanese Spoken Language , 2002, COLING.

[9] Shigeki Matsubara,et al. CIAIR speech corpus for real world speech recognition , 2002 .