Tlk or txt? Using voice input for SMS composition

This paper reports a series of investigations, which aim to test the appropriateness of voice recognition as an interaction method for mobile phone use. First, a KLM model was used in order to compare the speed of using voice recognition against using multi-tap and predictive text (the two most common methods of text entry) to interact with the phone menus and compose a text message. The results showed that speech is faster than the other two methods and that a combination of input methods provides the quickest task completion times. The first experiment used a controlled message creation task to validate the KLM predictions. This experiment also confirmed that the result was not due to a speed/accuracy trade off and that participants preferred to use the combination of input methods rather than a single method for menu interaction and text composition. The second experiment investigated the effect of limited visual feedback (when walking down the road or driving a car for example) on interaction, providing further evidence in support of speech as a useful input method. These experiments not only indicate the usefulness of voice in SMS input but also that users could also be satisfied with voice input in hands-busy, eyes-busy situations.

[1]  Ron Van Buskirk,et al.  A comparison of speech and mouse/keyboard GUI navigation , 1995, CHI '95.

[2]  Alexander I. Rudnicky Mode preference in a simple data-retrieval task , 1993, HLT.

[3]  I. Scott MacKenzie,et al.  LetterWise: prefix-based disambiguation for mobile text input , 2001, UIST '01.

[4]  Stephen A. Brewster,et al.  Multimodal 'eyes-free' interaction techniques for wearable devices , 2003, CHI '03.

[5]  Philip R. Cohen The role of natural language in a multimodal interface , 1992, UIST '92.

[6]  Daniel B. Horn,et al.  Patterns of entry and correction in large vocabulary continuous speech recognition systems , 1999, CHI '99.

[7]  Sharon L. Oviatt,et al.  Multimodal interfaces for dynamic interactive maps , 1996, CHI.

[8]  Mark D. Dunlop,et al.  Predictive text entry methods for mobile phones , 2000, Personal Technologies.

[9]  Nambu Hirotaka,et al.  Reassessing current cell phone designs: using thumb input effectively , 2003, CHI Extended Abstracts.

[10]  Steinar Kristoffersen,et al.  “Making place” to make IT work: empirical explorations of HCI for mobile CSCW , 1999, GROUP.

[11]  Miika Silfverberg Using Mobile Keypads with Limited Visual Feedback: Implications to Handheld and Wearable Devices , 2003, Mobile HCI.

[12]  Virpi Roto,et al.  Interaction in 4-second bursts: the fragmented nature of attentional resources in mobile HCI , 2005, CHI.

[13]  Allen Newell,et al.  The psychology of human-computer interaction , 1983 .

[14]  Alexander H. Waibel,et al.  Multimodal error correction for speech user interfaces , 2001, TCHI.

[15]  Robert Rosenthal,et al.  Repeated-measures contrasts for "multiple-pattern" hypotheses. , 2003, Psychological methods.

[16]  I. Scott MacKenzie,et al.  Predicting text entry speed on mobile phones , 2000, CHI.

[17]  Alan F. Blackwell,et al.  Dasher—a data entry interface using continuous gestures and language models , 2000, UIST '00.

[18]  Clare-Marie Karat,et al.  Hands-Free, Speech-Based Navigation During Dictation: Difficulties, Consequences, and Solutions , 2003, Hum. Comput. Interact..

[19]  Mark D. Dunlop,et al.  Mobile Human-Computer Interaction - MobileHCI 2004 , 2004, Lecture Notes in Computer Science.

[20]  Rebecca E. Grinter,et al.  Y Do Tngrs Luv 2 Txt Msg? , 2001, ECSCW.

[21]  Allen Newell,et al.  Towards real-time GOMS: a model of expert behaviour in a highly interactive task , 1994, Behav. Inf. Technol..

[22]  Andriy Pavlovych,et al.  Model for non-expert text entry speed on 12-button phone keypads , 2004, CHI '04.

[23]  Sharon Oviatt,et al.  Integration and synchronization of input modes during multimodal human-computer interaction , 1997 .

[24]  Sharon L. Oviatt,et al.  The efficiency of multimodal interaction: a case study , 1998, ICSLP.

[25]  Alan F. Blackwell,et al.  Dasher: A Gesture-Driven Data Entry Interface for Mobile Computing , 2002, Hum. Comput. Interact..

[26]  Alan F. Blackwell,et al.  Dasher: A Gesture-Driven Data Entry Interface for Mobile Computing , 2002 .

[27]  Andy Cockburn,et al.  An Evaluation of Mobile Phone Text Input Methods , 2002, AUIC.

[28]  Rebecca E. Grinter,et al.  Wan2tlk?: everyday text messaging , 2003, CHI '03.

[29]  Michael E. Atwood,et al.  Project Ernestine: Validating a GOMS Analysis for Predicting and Explaining Real-World Task Performance , 1993, Hum. Comput. Interact..

[30]  J. D. Gould How experts dictate. , 1978 .

[31]  Antonella De Angeli,et al.  Integration and synchronization of input modes during multimodal human-computer interaction , 1997, CHI.

[32]  David S. Ebert,et al.  The integrality of speech in multimodal interfaces , 1998, TCHI.

[33]  Fabio Crestani,et al.  Second International Workshop on Mobile and Ubiquitous Information Access , 2004 .

[34]  Robert D. Rodman,et al.  Computer Speech Technology , 1999 .

[35]  Mark D. Dunlop,et al.  Mobile Human-computer interaction - MobileHCI 2004 : 6th International Symposium, MobileHCI 2004, Glasgow, UK, September 13-16, 2004 : proceedings , 2004 .

[36]  Gareth J. F. Jones,et al.  From Multimedia Retrieval to Knowledge Management , 2002, Computer.

[37]  Ben Shneiderman,et al.  A comparison of voice controlled and mouse controlled web browsing , 2000, Assets '00.

[38]  Allen Newell,et al.  Cumulating the science of HCI: from s-R compatibility to transcription typing , 1989, CHI '89.

[39]  Christina L. James,et al.  Text input for mobile devices: comparing model prediction to actual performance , 2001, CHI.

[40]  David E. Kieras,et al.  Towards a Practical GOMS Model Methodology for User Interface Design , 1988 .

[41]  Kent Lyons,et al.  The impacts of limited visual feedback on mobile text entry for the Twiddler and mini-QWERTY keyboards , 2005, Ninth IEEE International Symposium on Wearable Computers (ISWC'05).

[42]  I. Scott MacKenzie,et al.  Text Entry for Mobile Computing: Models and Methods,Theory and Practice , 2002, Hum. Comput. Interact..

[43]  Jennifer Lai,et al.  Facilitating mobile communication with multimodal access to email messages on a cell phone , 2004, CHI EA '04.

[44]  Clare-Marie Karat,et al.  Overcoming unusability: developing efficient strategies in speech recognition systems , 2000, CHI Extended Abstracts.