Error Resolution Strategies for Interactive Television Speech Interfaces

The transition to digital TV is changing the television set into an entertainment as well as information supplier device that provides two-way communication with the viewer. However, the present remote control device is not appropriate for navigation through the huge amount of services and information provided by the future digital TV, presumably also a device for accessing the Internet. One possibility for coping with the complex information navigation required by TV viewers is an augmentation of the interaction tools currently available for TV. Two approaches to such an augmentation are investigated in this thesis: linking paper-based TV guides to the digital TV and enhancing the remote control unit with speech interaction. Augmentation of paper-based TV guides is a futuristic research approach based on the integration of paper-based TV guides into computation technology. This solution provides interactive paper-based TV guides that also function as a remote control for the TV. A prototype system is developed and explorative studies are conducted to investigate this approach. These studies indicate the benefits of integrating paper-based TV guides into the TV set. They also illuminate the potential to provide innovative solutions for home information systems. Integrating familiar physical artefacts, such as paper and pen into TV technology may provide easy access to information services usually provided by PCs and the Internet. Thus, the same augmentation needed for TV as an entertainment device also opens up new communication channels for providing society information to citizens who do not feel comfortable with conventional computers. The thesis also reports on studies of speech interfaces for TV information navigation. Traditional speech interfaces have several common problems, such as user acceptance and misinterpretation of user input. These problems are investigated in empirical and explorative studies with implementation of mockups and running research systems. We have found that the pragmatic solution of augmenting remote control devices by speech is a suitable solution that eases information navigation and search.

[1]  William A. Ainsworth,et al.  Feedback Strategies for Error Correction in Speech Recognition Systems , 1992, Int. J. Man Mach. Stud..

[2]  Jonas Lundberg,et al.  Speech enhanced remote control for media terminal , 2001, INTERSPEECH.

[3]  Dylan M. Jones,et al.  Decline in Accuracy of Automatic Speech Recognition as Function of Time on Task: Fatigue or Voice Drift? , 1992, Int. J. Man Mach. Stud..

[4]  Wayne A. Lea,et al.  Trends in Speech Recognition , 1980 .

[5]  Sharon L. Oviatt,et al.  Confirmation in Multimodal Systems , 1998, ACL.

[6]  Clare-Marie Karat,et al.  The Beauty of Errors: Patterns of Error Correction in Desktop Speech Systems , 1999, INTERACT.

[7]  Sharon L. Oviatt,et al.  Mutual disambiguation of recognition errors in a multimodel architecture , 1999, CHI '99.

[8]  Pontus Johansson,et al.  Multimodal dialogue systems for interactive TV applications , 2002, Proceedings. Fourth IEEE International Conference on Multimodal Interfaces.

[9]  Dylan M. Jones,et al.  Data-entry by voice: facilitating correction of misrecognitions , 1993 .

[10]  Gregory D. Abowd,et al.  OOPS: a toolkit supporting mediation techniques for resolving ambiguity in recognition-based interfaces , 2000, Comput. Graph..

[11]  Chris Baber,et al.  Interactive speech technology: human factors issues in the application of speech input/output to computers , 1993 .

[12]  Alexander H. Waibel,et al.  Multimodal error correction for speech user interfaces , 2001, TCHI.

[13]  Alexander H. Waibel,et al.  Model-based and empirical evaluation of multimodal interactive error correction , 1999, CHI '99.

[14]  Daniel B. Horn,et al.  Patterns of entry and correction in large vocabulary continuous speech recognition systems , 1999, CHI '99.

[15]  Susan Brennan,et al.  Interaction and feedback in a spoken language system: a theoretical framework , 1995, Knowl. Based Syst..