论文信息 - Construction and Analysis of a Multi-Layered In-car Spoken Dialogue Corpus

Construction and Analysis of a Multi-Layered In-car Spoken Dialogue Corpus

In this chapter, we will discuss the construction of the multi-layered in-car spoken dialogue corpus and the preliminary result of the analysis. We have developed the system specially built in a Data Collection Vehicle (DCV) which supports synchronous recording of multi-channel audio data from 16 microphones that can be placed in flexible positions, multi-channel video data from 3 cameras and the vehicle related data. Multimedia data has been collected for three sessions of spoken dialogue with different types of navigator in about 60-minute drive by each of 800 subjects. We have defined the Layered Intention Tag for the analysis of dialogue structure for each of speech unit. Then we have marked the tag to all of the dialogues for over 35,000 speech units. By using the dialogue sequence viewer we have developed, we can analyze the basic dialogue strategy of the human-navigator. We also report the preliminary analysis of the relation between the intention and linguistic phenomenon.

[1] Deb Roy,et al. Grounded speech communication , 2000, INTERSPEECH.

[2] Nobuaki Minematsu,et al. Japanese dictation toolkit: plug-and-play framework for speech recognition R&D , 1999 .

[3] John H. L. Hansen,et al. "CU-move" : analysis & corpus development for interactive in-vehicle speech systems , 2001, INTERSPEECH.

[4] Kazuya Takeda,et al. Multimedia data collection of in-car speech communication , 2001, INTERSPEECH.

[5] Yasuyoshi Inagaki,et al. Construction of an advanced in-car spoken dialogue corpus and its characteristic analysis , 2003, INTERSPEECH.

[6] Shigeki Matsubara,et al. CIAIR speech corpus for real world speech recognition , 2002 .

[7] Peter A. Heeman,et al. The u.s. speechdat-car data collection , 2001, INTERSPEECH.

[8] Yasuyoshi Inagaki,et al. Example-based query generation for spontaneous speech , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[9] Yasuyoshi Inagaki,et al. Stochastic Dependency Parsing of Spontaneous Japanese Spoken Language , 2002, COLING.

[10] Kazuya Takeda,et al. Multi-Dimensional Data Acquisition for Integrated Acoustic Information Research , 2002, LREC.

[11] Shigeki Matsubara,et al. An Advanced Japanese Speech Corpus for In-car Spoken Dialogue Research , 2003 .

[12] Hitoshi Isahara,et al. Spontaneous Speech Corpus of Japanese , 2000, LREC.