Speech interface VLSI for car applications

A user-friendly speech interface for car applications is highly needed for safety reasons. This paper describes a speech interface VLSI designed for car environments, with speech recognition and speech compression/decompression functions. The chip has a heterogeneous architecture composed of ADC/DAC, DSP, RISC, hard-wired logic and peripheral circuits. The DSP not only executes acoustic analysis and output probability calculation of HMMs for speech recognition, but also does speech compression/decompression. On the other hand, the RISC works as a CPU of the whole chip and Viterbi decoder with an aid of hard-wired logic. An algorithm to recognize a mixed vocabulary of speaker-independent fixed words and speaker-dependent user-defined words in a seamless way is proposed. It is based on acoustic event HMMs which enable a template creation from one sample utterance. The proposed algorithm embedded in the chip is evaluated. Promising results of the algorithm for multiple languages are shown.

[1]  Satoshi Nakamura,et al.  A non-iterative model-adaptive e-CMN/PMC approach for speech recognition in car environments , 1997, EUROSPEECH.

[2]  Thomas P. Barnwell,et al.  The multimodal multipulse excitation vocoder , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Kiyohiro Shikano,et al.  A speech enhancement approach E-CMN/CSS for speech recognition in car environments , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[4]  Emiel Krahmer,et al.  Robust spoken dialogue management for driver information systems , 1997, EUROSPEECH.

[5]  Bhuvana Ramabhadran,et al.  Acoustics-only based automatic phonetic baseform generation , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[6]  Olli Viikki,et al.  A recursive feature vector normalization approach for robust speech recognition in noise , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[7]  Satoshi Nakamura,et al.  Robust speech recognition in car environments , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).