New Concept Service for the Mobile Era Using Speech Technologies

In this paper, we describe new concept services based on speech processing technologies for the new digital/mobile era called a ubiquitous society. First, we propose a compact and noise robust embedded speech recognition middleware implemented on microprocessors aiming for sophisticated HMIs (Human Machine Interfaces) of car information systems. The compactness is essential for embedded systems because there are strict restrictions of CPU (Central Processing Unit) power and available memory capacities. Second, we report a promising multi-language interpreter named “Mobilingual TM ” based on speech recognition. Our research activities aim to realize sophisticated and human-centered intelligent HMIs that will be absolutely necessary in the ubiquitous social environment. For the embedded speech middleware, we propose first, a noise robust and compact Spectral Subtraction(SS) method after exhausting evaluation stages using real speech data recorded at car running environments. Next, we propose very novel memory assignment of acoustic models based on the product codes or sub-vector quantization technique resulting on 1 fourth memory reduction for the 2000-word vocabulary.

[1]  Yasunari Obuchi,et al.  Development of robust speech recognition middleware on microprocessor , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[2]  Brian Kan-Wing Mak,et al.  Subspace distribution clustering for continuous observation density hidden Markov models , 1997, EUROSPEECH.

[3]  Yasunari Obuchi,et al.  Multilingual Speech Interpretation for Cellular Phones , 2001 .

[4]  Vassilios Digalakis,et al.  Efficient speech recognition using subvector quantization and discrete-mixture HMMS , 2000, Comput. Speech Lang..

[5]  Yifan Gong,et al.  Implementing a high accuracy speaker-independent continuous speech recognizer on a fixed-point DSP , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[6]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[7]  Kiyohiro Shikano,et al.  Recognition of noisy speech by composition of hidden Markov models , 1993, EUROSPEECH.

[8]  Yasunari Obuchi,et al.  Compact and robust speech recognition for embedded use on microprocessors , 2002, 2002 IEEE Workshop on Multimedia Signal Processing..