The simulation of realistic acoustic input scenarios for speech recognition systems
暂无分享,去创建一个
A tool for simulating the acoustic conditions during the speech input to a recognition system and the transmission in telephone networks is presented in this paper. The simulation covers the hands-free speech input in rooms and the existence of noise in the background. Furthermore the presence of telephone frequency characteristics can be simulated. Finally the transmission in a cellular telephone system like GSM or UMTS is covered including the encoding and decoding of speech and the transmission over the erroneous radio channel. The tool has been realized by integrating functions from the ITU software library for implementing telephone frequency characteristics and the estimation of the speech level as well as software modules from ETSI and 3GPP for the AMR encoding and decoding of speech. A Web interface has been designed to experience the simulation tool with acoustic examples.
[1] David Pearce,et al. The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions , 2000, INTERSPEECH.
[2] Pedro Novo,et al. IKA-SIM: A System to Generate Auditory Virtual Environments , 2004 .
[3] Jean-Marc Jot,et al. An analysis/synthesis approach to real-time artificial reverberation , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.
[4] James A. Moorer,et al. About This Reverberation Business , 1978 .