A Study of the Feasibility and Effectiveness of Dual-Modal Information Presentations

Multimodal interfaces with both visual and auditory output are becoming important, especially for applications using small-screen displays and for user access under mobile conditions. The research presented here investigated the feasibility of simultaneously presenting distinct textual information through both visual and auditory channels by examining two multimodal interfaces with irrelevant or relevant auditory information. These interfaces were intended to study two problems: (a) Can users attend to and process additional information delivered through the auditory channel during a typical Web-browsing process, and (b) what are the effects of information overlap between the visual and auditory channels? Controlled experiments were conducted to evaluate these two questions. The findings suggest that users can attend to auditory information while visually browsing textual information and that information overlap may reduce distraction. These findings have implications for the design of multimodal interfaces for small-screen mobile applications.

[1]  Mark S. Sanders,et al.  Human Factors in Engineering and Design , 2016 .

[2]  J. Deutsch Perception and Communication , 1958, Nature.

[3]  Robert H. McKim,et al.  Human engineering guide to equipment design , 1963 .

[4]  A. Treisman VERBAL CUES, LANGUAGE, AND MEANING IN SELECTIVE ATTENTION. , 1964, The American journal of psychology.

[5]  D. Kahneman,et al.  Attention and Effort , 1973 .

[6]  Daniel Gopher,et al.  On the Economy of the Human Processing System: A Model of Multiple Capacity. , 1977 .

[7]  Christopher D. Wickens,et al.  The Structure of Attentional Resources , 1980 .

[8]  Gwen Nugent,et al.  Pictures, audio, and print: symbolic representation and effect on learning , 1982 .

[9]  A. Ehrenfeucht,et al.  Encoding and retaining information in the visuals and verbals of an educational movie , 1983 .

[10]  Lynn A. Streeter,et al.  Chapter 15 – Applying Speech Synthesis to User Interfaces , 1988 .

[11]  Fred D. Davis Perceived Usefulness, Perceived Ease of Use, and User Acceptance of Information Technology , 1989, MIS Q..

[12]  Robert E. Kraut,et al.  Expressive richness: a comparison of speech and text as media for revision , 1991, CHI.

[13]  Philip R. Cohen The role of natural language in a multimodal interface , 1992, UIST '92.

[14]  William A. Wallace,et al.  The Effects on Decision Task Performance of Computer Synthetic Voice Output , 1992, Int. J. Man Mach. Stud..

[15]  Janice C. Sipior,et al.  Merging expert systems with multimedia technology , 1992, DATB.

[16]  Robert W. Proctor,et al.  Human factors in simple and complex systems , 1993 .

[17]  R. Schlosser,et al.  The effects of speech output technology in the learning of graphic symbols. , 1995, Journal of applied behavior analysis.

[18]  J. Sweller,et al.  Reducing cognitive load by mixing auditory and visual presentation modes , 1995 .

[19]  Yufei Yuan,et al.  Investigation of Voice and Text Output Modes with Abstraction in a Computer Interface , 1996, Interact. Comput..

[20]  Murray F Spiegel,et al.  Applying Speech Synthesis to User Interfaces , 1997 .

[21]  Shuang Xu,et al.  Usability for Mobile Commerce Across Multiple Form Factors , 2002, J. Electron. Commer. Res..

[22]  M. D’Esposito Working memory. , 2008, Handbook of clinical neurology.