Ivr Usability Engineering Using Guidelines And Analyses Of End-to-End Calls

While speech offers unique advantages and opportunities as an interface modality, the known limitations of speech recognition technology and cognitive limitations of spoken interaction amplify the importance of usability in the development of speech applications. The competitive business environment, on the other hand, requires sound business justification for any investment in speech technology and proof of its usability and effectiveness. This chapter presents design principles and usability engineering methods that empower practitioners to optimize both usability and ROI of telephone speech applications, frequently also referred to as telephone Voice User Interface (VUI) or Interactive Voice Response (IVR) systems. The first section discusses limitations of speech user interfaces and their repercussions on design. From a survey of research and industry know-how a short list of guidelines for IVR design is derived. Examples illustrate how to apply these guidelines during the design phase of a telephone speech application. The second section presents a data-driven methodology for optimizing usability and effectiveness of IVRs. The methodology is grounded in the analysis of live, end-to-end calls - the ultimate field data for telephone speech applications. We will describe how to capture end-to-end call data from deployed systems and how to mine this data to measure usability and identify problems. Leveraging end-to-end call data empowers practitioners to build solid business cases, optimize ROI, and justify the cost of IVR usability engineering. Case studies from the consulting practice at BBN Technologies illustrate how these methods were applied in some of the largest US deployments of automated telephone applications.

[1]  David G. Novick,et al.  Limiting Factors of Automated Telephone Dialogues , 1999 .

[2]  Bernhard Suhm,et al.  Towards best practices for speech user interface design , 2003, INTERSPEECH.

[3]  Ben Shneiderman,et al.  The limits of speech recognition , 2000, CACM.

[4]  Daryle Gardner-Bonneau,et al.  Human Factors and Voice Interactive Systems , 1999 .

[5]  Pat Peterson,et al.  Evaluating commercial touch-tone and speech-enabled telephone voice user interfaces using a single measure , 2001, CHI Extended Abstracts.

[6]  Teresa L. Roberts,et al.  The effects of device technology on the usability of advanced telephone functions , 1989, CHI '89.

[7]  Richard Halstead-Nussloch,et al.  The design of phone-based interfaces for consumers , 1989, CHI '89.

[8]  Antonella De Angeli,et al.  Integration and synchronization of input modes during multimodal human-computer interaction , 1997, CHI.

[9]  Bruce Balentine,et al.  How to Build a Speech Recognition Application , 1999 .

[10]  Clare-Marie Karat,et al.  Overcoming unusability: developing efficient strategies in speech recognition systems , 2000, CHI Extended Abstracts.

[11]  Alexander H. Waibel,et al.  Model-based and empirical evaluation of multimodal interactive error correction , 1999, CHI '99.

[12]  Keith Edwards,et al.  Evaluating commercial speech recognition and DTMF technology for automated telephone banking services , 1997 .

[13]  Philip Hodgson Talk to your Computer: Speech Recognition Made Easy by Dan Newman 2000, 191 pages, $14.95 Berkeley, CA: Waveside Publishing ISBN 0–9670389–3–6 , 2001 .

[14]  E. Schegloff,et al.  A simplest systematics for the organization of turn-taking for conversation , 1974 .

[15]  Karen Holtzblatt,et al.  Contextual design , 1997, INTR.

[16]  C. Delogu,et al.  A comparison between DTMF and ASR IVR services through objective and subjective evaluation , 1998, Proceedings 1998 IEEE 4th Workshop Interactive Voice Technology for Telecommunications Applications. IVTTA '98 (Cat. No.98TH8376).

[17]  Paul Resnick,et al.  Relief from the audio interface blues: expanding the spectrum of menu, list, and form styles , 1995, TCHI.

[18]  David L. Parnas,et al.  On the use of transition diagrams in the design of a user interface for an interactive computer system , 1969, ACM '69.

[19]  Matthias Peissner,et al.  Voice User Interface Design , 2004, UP.

[20]  Lori Lamel,et al.  Dialog in the RAILTEL telephone-based system , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[21]  Hagen Soltau,et al.  Specialized acoustic models for hyperarticulated speech , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[22]  A. L. Gorin,et al.  How may I help you , 1996 .

[23]  Jakob Nielsen,et al.  Usability engineering , 1997, The Computer Science and Engineering Handbook.

[24]  Daniel B. Horn,et al.  Patterns of entry and correction in large vocabulary continuous speech recognition systems , 1999, CHI '99.