Evaluation of speech-based HMI concepts for information exchange tasks: A driving simulator study

HighlightsWe compare different speech-based in-car HMI concepts in a driving simulator study.The HMI concepts are evaluated in terms of usability and driver distraction.The comparison of speech dialog strategies revealed only differences in usability.The use of a GUI impaired the driving performance and raised gaze-based distraction.An avatar does not additionally raise driver distraction but is not accepted by users. Due to the mobile Internet revolution, people tend to browse the Web while driving their car which puts the driver's safety at risk. Therefore, an intuitive and non-distractive in-car speech interface to the Web needs to be developed. Before developing a new speech dialog system (SDS) in a new domain developers have to examine the user's preferred interaction style and its influence on driving safety.This paper reports a driving simulation study, which was conducted to compare different speech-based in-car human-machine interface concepts concerning usability and driver distraction. The applied SDS prototypes were developed to perform an online hotel booking by speech while driving. The speech dialog prototypes were based on different speech dialog strategies: a command-based and a conversational dialog. Different graphical user interface (GUI) concepts (one including a human-like avatar) were designed in order to support the respective dialog strategy the most and to evaluate the effect of the GUI on usability and driver distraction.The results show that only few differences concerning speech dialog quality were found when comparing the speech dialog strategies. The command-based dialog was slightly better accepted than the conversational dialog, which seems to be due to the high concept error rate of the conversational dialog. A SDS without GUI also seems to be feasible for the driving environment and was accepted by the users. The comparison of speech dialog strategies did not reveal differences in driver distraction. However, the use of a GUI impaired the driving performance and increased gaze-based distraction. The presence of an avatar was not appreciated by participants and did not affect the dialog performance. Concerning driver distraction, the virtual agent did neither negatively affect the driving performance nor increase visual distraction.The results implicate that in-car SDS developers should take both speaking styles into consideration when designing an SDS for information exchange tasks. Furthermore, developers have to consider reducing the content presented on the screen in order to reduce driver distraction. A human-like avatar was not appreciated by users while driving. Research should further investigate if other kinds of avatars might achieve different results.

[1]  Jurek Kirakowski,et al.  The Software Usability Measurement Inventory: Background and Usage , 1996 .

[2]  Hansjörg Hofmann,et al.  Speech Interaction with the Internet -- A User Study , 2012, 2012 Eighth International Conference on Intelligent Environments.

[3]  Ute Ehrlich Task hierarchies representing sub-dialogs in speech dialog systems , 1999, EUROSPEECH.

[4]  Michael F. McTear,et al.  Book Review , 2005, Computational Linguistics.

[5]  David L. Thomson,et al.  User Confusion in Natural Language Services , 2000 .

[6]  Masahiro Araki,et al.  Spoken, Multilingual and Multimodal Dialogue Systems: Development and Assessment , 2005 .

[7]  Michael Burmester,et al.  AttrakDiff: Ein Fragebogen zur Messung wahrgenommener hedonischer und pragmatischer Qualität , 2003, MuC.

[8]  Paul Green,et al.  Safety and Usability of Speech Interfaces for In-Vehicle Tasks while Driving: A Brief Literature Review , 2006 .

[9]  Robert Graham,et al.  Subjective assessment of speech-system interface usability , 2001, INTERSPEECH.

[10]  Marilyn A. Walker,et al.  Evaluating competing agent strategies for a voice email agent , 1997, EUROSPEECH.

[11]  Hélène Bonneau-Maynard,et al.  Evaluation of dialog strategies for a tourist information retrieval system , 1998, ICSLP.

[12]  Volker Steinbiss,et al.  The Philips automatic train timetable information system , 1995, Speech Commun..

[13]  Hansjörg Hofmann,et al.  Development of a Conversational Speech Interface Using Linguistic Grammars , 2013 .

[14]  Lou Boves,et al.  Issues in Spoken Dialogue Systems: Experiences with the Dutch Arise System , 2000 .

[15]  A. Pauzie,et al.  A method to assess the driver mental workload: The driving activity load index (DALI) , 2008 .

[16]  Yu Zhang,et al.  The Effects of Visual and Cognitive Distraction on Driver Situation Awareness , 2011, HCI.

[17]  Michael F. McTear,et al.  Spoken Dialogue Technology , 2004, Springer London.

[18]  Markus M. Berg Natürlichsprachlichkeit in Dialogsystemen , 2012, Informatik-Spektrum.

[19]  Nemanja Memarovic,et al.  Glancing at personal navigation devices can affect driving: experimental results and design implications , 2009, AutomotiveUI.

[20]  Julia Niemann,et al.  Designing Speech Output for In-car Infotainment Applications Based on a Cognitive Model of Attention Allocation , 2013 .

[21]  William J Horrey,et al.  Modeling drivers' visual attention allocation while interacting with in-vehicle technologies. , 2006, Journal of experimental psychology. Applied.

[22]  Hansjörg Hofmann,et al.  Comparison of speech-based in-car HMI concepts in a driving simulation study , 2014, IUI.