Automobiles MULTIMODAL CONVERSATIONAL SYSTEMS FOR C urrently available in-vehicle speech recognition systems are designed around a single-utterance-command paradigm [2], with as many as 200 commands 1 that must be learned or referenced in a manual—an unpractical option while driving. The combination of a flexible dialogue-based speech system with a visual and haptic touch screen, while still an area of active research [1, 3], provides the opportunity for an intuitive and effective multimodal interface for vehicles. SpeechWorks 2 and Ford designed and realized a prototype targeted at relaxing the limitations of the current systems by adopting a conversational speech interface coupled with a touch-screen display. The system, which controls vehicle functions such as climate, telephone, navigation, MP3 player, and personalization, was installed in the Ford Model U concept vehicle and first shown at the 2003 North Ameri-can International Auto Show in Detroit. Figure 1 shows an image of the actual car interior where the touch-screen interface is visible on the dashboard. Figure 2 shows a depiction of the GUI in one of its various configurations. The multimodality of the system allows users to adapt to their environment , for example, interacting through the GUI when the car is stopped at a light versus when the car is moving. In addition, we designed the two modalities to complement
[1]
Paul Heisterkamp.
Linguatronic: Product-Level Speech System for Mercedes-Benz Car
,
2001,
HLT.
[2]
Bob Carpenter,et al.
ETUDE, a recursive dialog manager with embedded user interface patterns
,
2001,
IEEE Workshop on Automatic Speech Recognition and Understanding, 2001. ASRU '01..
[3]
Eric A. Woudenberg,et al.
Multi-modal Spoken Dialog with Wireless Devices
,
2002
.
[4]
Roberto Pieraccini,et al.
A multimodal conversational interface for a concept vehicle
,
2004,
INTERSPEECH.
[5]
Niels Ole Bernsen,et al.
A Multimodal Virtual Co-driver's Problems with the Driver
,
2002
.