Multimodal system evaluation using modality efficiency and synergy metrics

In this paper, we propose two new objective metrics, relative modality efficiency and multimodal synergy, that can provide valuable information and identify usability problems during the evaluation of multimodal systems. Relative modality efficiency (when compared with modality usage) can identify suboptimal use of modalities due to poor interface design or information asymmetries. Multimodal synergy measures the added value from efficiently combining multiple input modalities, and can be used as a single measure of the quality of modality fusion and fission in a multimodal system. The proposed metrics are used to evaluate two multimodal systems that combine pen/speech and mouse/keyboard modalities respectively. The results provide much insight into multimodal interface usability issues, and demonstrate how multimodal systems should adapt to maximize modalities synergy resulting in efficient, natural, and intelligent multimodal interfaces.

[1]  P. John Statistical Design and Analysis of Experiments , 1971 .

[2]  Emiel Krahmer,et al.  Preferred modalities in dialogue systems , 2000, INTERSPEECH.

[3]  Sharon L. Oviatt,et al.  The efficiency of multimodal interaction: a case study , 1998, ICSLP.

[4]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[5]  James A. Larson,et al.  Guidelines for multimodal user interface design , 2004, CACM.

[6]  Jerome L. Myers,et al.  Research Design and Statistical Analysis , 1991 .

[7]  Alexandros Potamianos,et al.  A Study in Efficiency and Modality Usage in Multimodal Form Filling Systems , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[8]  Lars Bo Larsen,et al.  Issues in the evaluation of spoken dialogue systems using objective and subjective measures , 2003, 2003 IEEE Workshop on Automatic Speech Recognition and Understanding (IEEE Cat. No.03EX721).

[9]  Shimei Pan,et al.  Designing and Evaluating an Adaptive Spoken Dialogue System , 2002, User Modeling and User-Adapted Interaction.

[10]  K. Á. T.,et al.  Towards a tool for the Subjective Assessment of Speech System Interfaces (SASSI) , 2000, Natural Language Engineering.

[11]  David S. Ebert,et al.  The integrality of speech in multimodal interfaces , 1998, TCHI.

[13]  Nicole Yankelovich,et al.  Conversational speech interfaces , 2002 .

[14]  Marilyn A. Walker,et al.  Towards developing general models of usability with PARADISE , 2000, Natural Language Engineering.