We describe new consumer services based on speech processing technologies to support a new digital/mobile era of ubiquitous communication. First, we propose, a compact and noise robust embedded speech recognition middleware implemented on microprocessors focused on sophisticated HMIs (human machine interfaces) for car information systems (i.e. car telematics). Second, we report on a novel and sophisticated dialog management/manager (DM) system, based on VoiceXML (voice extensible markup language), called CAMMIA (conversational agent for multimedia mobile information access). The proposed DM handles two important issues: an automatic generation scheme for lexicons and grammars, and an effective combination/merger between automatic speech recognition (ASR) and natural language processing (NLP). The new DM scheme has been evaluated for an application of the car telematics service task after integration with ASR and a VoiceXML interpreter (VXI).
[1]
Teruko Mitamura,et al.
The KANTOO Machine Translation Environment
,
2000,
AMTA.
[2]
Teruko Mitamura,et al.
DialogXML: extending VoiceXML for dynamic dialog management
,
2002
.
[3]
Yasunari Obuchi,et al.
Compact and robust speech recognition for embedded use on microprocessors
,
2002,
2002 IEEE Workshop on Multimedia Signal Processing..
[4]
Bob Carpenter,et al.
A portable, server-side dialog framework for voiceXML
,
2002,
INTERSPEECH.
[5]
Yasunari Obuchi,et al.
Robust Dialog Management Architecture Using VoiceXML for Car Telematics Systems
,
2005
.
[6]
Yasunari Obuchi,et al.
Development of robust speech recognition middleware on microprocessor
,
1998,
Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).