论文信息 - CENSREC-3: Data Collection for In-Car Speech Recognition and Its Common Evaluation Framework

CENSREC-3: Data Collection for In-Car Speech Recognition and Its Common Evaluation Framework

This paper introduces a common database, an evaluation framework, and its baseline recognition results for in-car speech recognition, CENSREC-3, as an outcome of IPSJ-SIG SLP Noisy Speech Recognition Evaluation Working Group. CENSREC-3 which is a sequel of AURORA-2J is designed as the evaluation framework of isolated word recognition in real driving car environments. Speech data was collected using 2 microphones, a close-talking microphone and a hands-free microphone, under carefully controlled 16 different driving conditions, i.e., combinations of 3 car speeds and 5 car conditions. CENSREC-3 provides 6 evaluation environments which are designed using speech data collected in these car conditions.

[1] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[2] Kazuya Takeda,et al. Construction and Evaluation of a Large In-Car Speech Corpus , 2005, IEICE Trans. Inf. Syst..

[3] David Pearce. Developing the ETSI Aurora advanced distributed speech recognition front-end and what next? , 2001, IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..

[4] Kazuya Takeda,et al. CIAIR In-Car Speech Corpus - Influence of Driving Status , 2005, IEICE Trans. Inf. Syst..

[5] Satoshi Nakamura,et al. AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition , 2005, IEICE Trans. Inf. Syst..

[6] R. G. Leonard,et al. A database for speaker-independent digit recognition , 1984, ICASSP.