Feedback Requirements for Automatic Speech Recognition in the Process Control Room

Abstract Automatic Speech Recognition (ASR) has great potential for use in control room systems; to date, there has been little research into the human factors issues this raises. For example, careful consideration needs to be given to the provision of adequate feedback to the user. We concentrate on the two main types of visual feedback: textual and symbolic. Two studies reported here show that little difference exists between them in user performance on a task requiring spoken control of a process. However, the results demonstrate a significant reduction in learning time when textual and symbolic feedback are combined. We defined the correction of device misrecognitions as a verbal decision task, for which Study 1 shows that textual feedback is most appropriate. However, Study 2 shows that textual feedback is more likely to be misunderstood than either symbols or a combination of text and symbols. A combination of both text and symbols is proposed as the most efficient form of feedback for the use of ASR in control room systems.

[1]  Christopher Baber,et al.  Speech technology in control room systems : a human factors perspective , 1991 .

[2]  R. E. Walker,et al.  Comparative accuracy of recognizing American and international road signs. , 1965, The Journal of applied psychology.

[3]  J. G. Hollands,et al.  Engineering Psychology and Human Performance , 1984 .

[4]  J. R. Simon,et al.  Same-different reaction time to stimuli presented simultaneously to separate cerebral hemispheres. , 1988, Ergonomics.

[5]  Bruce A. Steiner,et al.  Situation Awareness: Icons vs. Alphanumerics , 1989 .

[6]  Ray Eberts,et al.  Comparison of Speech and Pictorial Displays in a Cockpit Environment , 1987 .

[7]  R. Ratcliff Theoretical interpretations of the speed and accuracy of positive and negative responses. , 1985, Psychological review.

[8]  Robert E. Dewar,et al.  Rapid Comprehension of Verbal and Symbolic Traffic Sign Messages , 1979 .

[9]  Donald A. Norman,et al.  Explorations in Cognition , 1975 .

[10]  Alphonse Chapanis,et al.  Interactive human communication , 1975 .

[11]  R Remington,et al.  On the Selection and Evaluation of Visual Display Symbology: Factors Influencing Search and Identification Times , 1986, Human factors.

[12]  Richard D. Gilson,et al.  Short Term Memory Demands in Processing Synthetic Speech , 1989 .

[13]  J. Morton,et al.  The effects of priming on picture recognition. , 1982, British journal of psychology.

[14]  Stephen M. Kosslyn,et al.  Naming pictures , 1990, J. Vis. Lang. Comput..

[15]  James L. McClelland,et al.  A distributed, developmental model of word recognition and naming. , 1989, Psychological review.

[16]  G. Corso,et al.  The Effects of Extended Practice on the Evaluation of Visual Display Codes , 1983 .

[17]  Roberta L. Klatzky,et al.  Commonalities and differences in semantic decisions about pictures and words , 1980 .

[18]  Steven W. Keele,et al.  Attention and human performance , 1973 .

[19]  R. Linggard Beyond speech recognition: language processing , 1990 .

[20]  M. Potter,et al.  Time to understand pictures and words , 1975, Nature.

[21]  H. H. Clark,et al.  Psychology and language : an introduction to psycholinguistics , 1979 .

[22]  R. Dewar,et al.  Reaction Time as an Index of Traffic Sign Perception , 1976, Human factors.

[23]  A W Young,et al.  Naming and Categorizing Faces and Written Names , 1986, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[24]  Beverly H. Williges,et al.  User feedback requirements with automatic speech recognition , 1985 .