Multi-Dimensional Data Acquisition for Integrated Acoustic Information Research

The Center for Integrated Acoustic Information Research (CIAIR) at Nagoya University has been collecting various kinds of speech corpora for both of acoustic modeling and speech modeling. The corpora include multi-media data collection in moving-car environment, collection of children's voice while video gaming, room acoustics at multiple points, head related transfer functions of multiple subjects, and simultaneous interpretation of the speech between English and Japanese. This paper introduces these multi-dimensional data acquisition activities in CIAIR, and gives the basic information of the collected databases.

[1]  F. Itakura,et al.  Interpolating head related transfer functions in the median plane , 1999, Proceedings of the 1999 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. WASPAA'99 (Cat. No.99TH8452).

[2]  Kazuya Takeda,et al.  Multimedia data collection of in-car speech communication , 2001, INTERSPEECH.

[3]  Akira Takagi,et al.  Bilingual Spoken Monologue Corpus for Simultaneous Machine Interpretation Research , 2002, LREC.

[4]  Nobuaki Minematsu,et al.  Japanese dictation toolkit: plug-and-play framework for speech recognition R&D , 1999 .

[5]  Yasuyoshi Inagaki,et al.  Spoken language corpus for machine interpretation research , 2000, INTERSPEECH.

[6]  John H. L. Hansen,et al.  "CU-move" : analysis & corpus development for interactive in-vehicle speech systems , 2001, INTERSPEECH.

[7]  Kazuya Takeda,et al.  Speech recognition based on space diversity using distributed multi-microphone , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[8]  Peter A. Heeman,et al.  The u.s. speechdat-car data collection , 2001, INTERSPEECH.