论文信息 - Preferred modalities in dialogue systems

Preferred modalities in dialogue systems

This research describes which modalities are preferred in particular contexts when interacting with a multi-modal dialogue system. The trade-off between three factors is investigated: (i) speech recognition performance, (ii) efficiency of input modality and (iii) the system’s output modality. Four versions were developed of a multimodal examinator to be used in elementary school. The versions differed in recognition performance (‘perfect’ vs. realistic) and output modality (speech or text). In all systems, subjects could provide input via speaking or typing. Answer length in characters was used as a measure of efficiency. Results show that both speech recognition performance and efficiency have a strong impact on preferred modalities. No effect was found of the system’s output modality.

[1] M. F. Weegels. Users' (mis)conceptions of a voice-operated train travel information service , 1998 .

[2] Christopher Baber,et al. Speech technology in control room systems : a human factors perspective , 1991 .

[3] Emiel Krahmer,et al. The dual of denial: Two uses of disconfirmations in dialogue and their prosodic correlates , 2002, Speech Commun..

[4] Julia Hirschberg,et al. Corrections in spoken dialogue systems , 2000, INTERSPEECH.

[5] Robert E. Kraut,et al. Expressive richness: a comparison of speech and text as media for revision , 1991, CHI.

[6] B. Shneiderman. Designing the User Interface (3rd Ed.) , 1998 .

[7] Wendy Wood,et al. Inferred sex differences in status as a determinant of gender stereotypes about social influence. , 1982 .

[8] Clifford Nass,et al. The media equation - how people treat computers, television, and new media like real people and places , 1996 .

[9] Sharon L. Oviatt,et al. Integration themes in multimodal human-computer interaction , 1994, ICSLP.

[10] Philip R. Cohen,et al. MULTIMODAL INTERFACES THAT PROCESS WHAT COMES NATURALLY , 2000 .

[11] S Oviatt,et al. Linguistic Adaptations During Spoken and Multimodal Error Resolution , 1998, Language and speech.

[12] Ben Shneiderman,et al. Designing The User Interface , 2013 .

[13] George Kingsley Zipf,et al. Human behavior and the principle of least effort , 1949 .