Analysis of drivers' speech in a car environment
暂无分享,去创建一个
In order to accelerate the promotion of speech recognition systems to the public; understanding characteristics of speech in real environments is one of the most important issues. This paper reports variations of speech characteristics in a car environment. To analyze speech characteristics in the specific environment, a corpus, recorded carefully in terms of equality of utterances and conditions for whole set of speakers, is necessary. We created a new corpus named “Drivers’ Japanese Speech Corpus in a Car Environment (DJS-C)”: composed of utterances of words useful for the operation of in-vehicle information appliances. Analysis of the DJS-C corpus shows that differences in speech characteristics are diverse among drivers and change with driving conditions. Quantitative analysis and speech recognition experiments show that performance degrades due to Distance between Phonemes, Uniqueness of Speaker’s Voice, and SNNR.
[1] J C Junqua,et al. The Lombard reflex and its role on human listeners and automatic speech recognizers. , 1993, The Journal of the Acoustical Society of America.
[2] Kazuya Takeda,et al. CIAIR in-car speech database , 2004, INTERSPEECH.
[3] S. Furui,et al. ACOUSTIC AND LINGUISTIC CHARACTERIZATION OF SPONTANEOUS SPEECH , 2006 .
[4] H. Lane,et al. The Lombard Sign and the Role of Hearing in Speech , 1971 .