论文信息 - Speech interface VLSI for car applications

Speech interface VLSI for car applications

A user-friendly speech interface for car applications is highly needed for safety reasons. This paper describes a speech interface VLSI designed for car environments, with speech recognition and speech compression/decompression functions. The chip has a heterogeneous architecture composed of ADC/DAC, DSP, RISC, hard-wired logic and peripheral circuits. The DSP not only executes acoustic analysis and output probability calculation of HMMs for speech recognition, but also does speech compression/decompression. On the other hand, the RISC works as a CPU of the whole chip and Viterbi decoder with an aid of hard-wired logic. An algorithm to recognize a mixed vocabulary of speaker-independent fixed words and speaker-dependent user-defined words in a seamless way is proposed. It is based on acoustic event HMMs which enable a template creation from one sample utterance. The proposed algorithm embedded in the chip is evaluated. Promising results of the algorithm for multiple languages are shown.

Makoto Shozakai

[1] Satoshi Nakamura,et al. A non-iterative model-adaptive e-CMN/PMC approach for speech recognition in car environments , 1997, EUROSPEECH.

[2] Thomas P. Barnwell,et al. The multimodal multipulse excitation vocoder , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3] Kiyohiro Shikano,et al. A speech enhancement approach E-CMN/CSS for speech recognition in car environments , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[4] Emiel Krahmer,et al. Robust spoken dialogue management for driver information systems , 1997, EUROSPEECH.

[5] Bhuvana Ramabhadran,et al. Acoustics-only based automatic phonetic baseform generation , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[6] Olli Viikki,et al. A recursive feature vector normalization approach for robust speech recognition in noise , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[7] Satoshi Nakamura,et al. Robust speech recognition in car environments , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).