An Empirical Study on Users' Acceptance of Speech Recognition Errors in Text-Messaging

Although speech recognition technology and voice synthesis systems have become readily available, recognition accuracy remain a serious problem in the design and implementation of voice-based user interfaces. Error correction becomes particularly difficult on mobile devices due to the limited system resources and constrained input methods. This research is aimed to investigate users' acceptance of speech recognition errors in mobile text messaging. Our results show that even though the audio presentation of the text messages does help users understand the speech recognition errors, users indicate low satisfaction when sending or receiving text messages with errors. Specifically, senders show significantly lower acceptance than the receivers due to the concerns of follow-up clarifications and the reflection of the sender's personality. We also find that different types of recognition errors greatly affect users' overall acceptance of the received message.

[1]  Sharon L. Oviatt,et al.  Error resolution during multimodal human-computer interaction , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[2]  N. Bell,et al.  Gestalt imagery: A critical factor in language comprehension , 1991, Annals of dyslexia.

[3]  G. V. van Orden A ROWS is a ROSE: spelling, sound, and reading. , 1987, Memory & cognition.

[4]  Henry Lieberman,et al.  How to wreck a nice beach you sing calm incense , 2005, IUI.

[5]  Sabine Deligne,et al.  Pervasive Speech Recognition , 2004, IEEE Pervasive Comput..

[6]  Kevin Larson,et al.  Speech Error Correction: The Story of the Alternates List , 2003, Int. J. Speech Technol..

[7]  Alexander H. Waibel,et al.  Multimodal error correction for speech user interfaces , 2001, TCHI.

[8]  I. Scott MacKenzie,et al.  Text Entry for Mobile Computing: Models and Methods,Theory and Practice , 2002, Hum. Comput. Interact..

[9]  R Frost,et al.  Phonetic recoding of phonologically ambiguous printed words. , 1993, Journal of experimental psychology. Learning, memory, and cognition.

[10]  G. C. Orden A ROWS is a ROSE: Spelling, sound, and reading , 1987 .