论文信息 - Putting people first: specifying proper names in speech interfaces

Putting people first: specifying proper names in speech interfaces

Communication is about people, not machines. But as firms and families alike spread out geographically, we rely increasingly on telecommunications tools to keep us “connected”. The challenge of such systems is to enable conversation between individuals without computational infrastructure getting in the way. This paper compares two speech-based communication systems, Phoneshell and Chatter, in how they deal with the keys to communication: proper names. Chatter, a conversational system using speech-recognition, improves upon the hierarchical nature of the touch-tone based Phoneshell by maintaining context and enabling use of anaphora. Proper names can present particular problems for speech recognizers, so an interface algorithm for reliable name specification by spelling is offered. Since individual letter recognition is non-robust, Chatter implicitly disambiguates strings of letters based on context. We hypothesize that the right interface can make faulty speech recognition as usable as TouchTones—even more so.

Chris Schmandt | Matthew Marx | C. Schmandt | Matthew Marx

[1] Candace L. Sidner,et al. Attention, Intentions, and the Structure of Discourse , 1986, CL.

[2] Emmon Bach,et al. Informal Lectures on Formal Semantics , 1989 .

[3] James Raymond Davis. Let Your Fingers do the Spelling: Implicit disambiguation of words spelled with the telephone keypad , 1991 .

[4] Robert E. Kraut,et al. Expressive richness: a comparison of speech and text as media for revision , 1991, CHI.

[5] Tony Vitale,et al. An Algorithm for High Accuracy Name Pronunciation by Parametric Speech Synthesizer , 1991, Comput. Linguistics.

[6] Chris Schmandt,et al. Phoneshell: the telephone as computer terminal , 1993, MULTIMEDIA '93.

[7] Chris Schmandt. Chatter: A Conversational Learning Speech Interface , 1994 .