论文信息 - An In-car Chinese Noise Corpus for Speech Recognition

An In-car Chinese Noise Corpus for Speech Recognition

In this paper, we present an in-car Chinese noise corpus that can be used in simulating complicated car environment for robust speech recognition research and experiment. The corpus was collected in mainland China in 2009 and 2010. The corpus includes a diversity of car conditions including different car speed, open/close windows, weather conditions as well as environment conditions. Specially, the rumble strips are also taken into account due to the typical noise generated as the car is passing on. In order to use the corpus efficiently, we performed some acoustic signal analyses on those noise data, mainly focused on stationary properties and energy distribution in the frequency domain. We also performed ASR experiments using selected noise data from the corpus, by adding noise data to clean speech to simulate the in-car environment. The corpus is the first of its kind for in-car Chinese noise corpus, providing abundant and diversified samples for car noise speech recognition task.

Yi Liu | Chao Zhang | Jue Hou | Shilei Huang

[1] Hing-Cheung So,et al. Speech enhancement in car noise envoronment based on an analysis-synthesis approach using harmonic noise model , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[2] A. Enis Çetin,et al. Teager energy based feature parameters for speech recognition in car noise , 1999, IEEE Signal Processing Letters.

[3] Khalid Choukri,et al. SPEECHDAT-CAR. A Large Speech Database for Automotive Environments , 2000, LREC.

[4] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.

[5] Ruofei Chen. Speech enhancement in car noise environment using harmonic noise model , 2009 .

[6] Shingo Kuroiwa,et al. DATA COLLECTION AND EVALUATION OF AURORA-2 JAPANESE CORPUS , 2003 .

[7] Kazuya Takeda,et al. Multimedia Corpus of In-Car Speech Communication , 2004, J. VLSI Signal Process..

[8] Antonio M. Peinado,et al. Model-based compensation of the additive noise for continuous speech recognition. experiments using the Aurora II database and tasks , 2001, INTERSPEECH.

[9] Björn Schuller,et al. Effects of In-Car Noise-Conditions on the Recognition of Emotion within Speech , 2007 .