CoWME: a general framework to evaluate cognitive workload during multimodal interaction

Evaluating human machine interaction in the case of multimodal systems is often a difficult task involving the monitoring of multiple sources, data fusion and results interpretation. While subtasks are highly dependent on the specific goal of the application and on the available interaction modalities, it is possible to formalize this workflow into a standard process and to consider a generic measure to estimate the ease of use of a specific application. In this work, we present CoWME, a modular software architecture describing multimodal human machine interaction evaluation, from data collection to final evaluation, in a formal way, in terms of cognitive workload. Communication protocols between modules are described in XML while data fusion is delegated to a configurable rule engine. An interface module is introduced between the monitoring modules and the rule engine to collect and summarize data streams for cognitive workload evaluation. We present a deployment example showing how this architecture is deployed by monitoring an interactive session with an Android application taking into account stressed speech detection, mydriasis and touch analysis.

[1]  Otto Jespersen,et al.  Lehrbuch der Phonetik , 1904 .

[2]  Charles L. Forgy,et al.  Rete: a fast algorithm for the many pattern/many object pattern match problem , 1991 .

[3]  W. Levelt,et al.  Pupillary dilation as a measure of attention: a quantitative system analysis , 1993 .

[4]  S. Rixon English Phonetics and Phonology , 2003 .

[5]  S. Hart,et al.  Development of NASA-TLX (Task Load Index): Results of Empirical and Theoretical Research , 1988 .

[6]  D Kahneman,et al.  Pupil Diameter and Load on Memory , 1966, Science.

[7]  Björn W. Schuller,et al.  Recognising realistic emotions and affect in speech: State of the art and lessons learnt from the first challenge , 2011, Speech Commun..

[8]  Denis Lalanne,et al.  Benchmarking fusion engines of multimodal interactive systems , 2009, ICMI-MLMI '09.

[9]  Sebastian Möller,et al.  Evaluating multimodal systems: a comparison of established questionnaires and interaction parameters , 2010, NordiCHI.

[10]  Deborah J. Mayhew Keystroke Level Modeling as a Cost Justification Tool , 2005 .

[11]  Yang Wang,et al.  Multimodal behavior and interaction as indicators of cognitive load , 2012, TIIS.

[12]  David House Differential perception of tonal contours through the syllable , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[13]  Francesco Cutugno,et al.  A syllable segmentation algorithm for English and italian , 2003, INTERSPEECH.

[14]  Antonio Origlia,et al.  Prosomarker: a prosodic analysis tool based on optimal pitch stylization and automatic syllabi fication , 2012, LREC.

[15]  J. Beatty,et al.  Pupillometric signs of brain activation vary with level of cognitive processing. , 1978, Science.

[16]  J. Pospí The Human Iris Structure and Its Usages , 2000 .

[17]  Antonio Origlia,et al.  A dynamic tonal perception model for optimal pitch stylization , 2013, Comput. Speech Lang..

[18]  S. Klein,et al.  Pupil dilation during visual target detection. , 2010, Journal of vision.

[19]  Christophe d'Alessandro,et al.  Automatic pitch contour stylization using a model of tonal perception , 1995, Comput. Speech Lang..

[20]  Xiaolu Dong,et al.  Mental workload measurement for emergency operating procedures in digital nuclear power plants , 2013, Ergonomics.

[21]  Brian P. Bailey,et al.  Categories & Subject Descriptors: H.5.2 [Information , 2022 .

[22]  D. House Tonal perception in speech , 1990 .

[23]  P. Chandler,et al.  Cognitive load as a factor in the structuring of technical material. , 1990 .

[24]  Peter Roach English Phonetics and Phonology:A Practical Course , 1983 .

[25]  Jan Stelovsky,et al.  Measuring cognitive load with EventStream software framework , 2003, 36th Annual Hawaii International Conference on System Sciences, 2003. Proceedings of the.

[26]  Sharon L. Oviatt,et al.  Human-centered design meets cognitive load theory: designing interfaces that help people think , 2006, MM '06.

[27]  David House Perception of prepausal tonal contours: implications for automatic stylization of intonation , 1995, EUROSPEECH.