Effect of Error Correction Strategy on Speech Dictation Throughput
暂无分享,去创建一个
The eight participants in this experiment used two different commercially available speech recognition dictation systems to complete a variety of reading transcription tasks. Participants enrolled fully in both systems. They received training in two correction strategies for both systems: multimodal correction (voice plus mouse plus keyboard) and hands-free correction (voice-only), and used both strategies during the experiment. The key findings were: • Both dictation systems were equally accurate. • Throughput (corrected words per minute) was significantly (63%) faster using multimodal correction. • Speaking rates were the same for both systems and correction strategies, averaging around 105-110 utterances (words and commands) per minute. • Correction speeds for the multimodal correction strategy (13.2 seconds per correction) were significantly faster than (a little more than twice as fast as) those for hands-free correction (29.1 seconds per correction). • At the end of the experiment, participants indicated they significantly preferred the multimodal correction strategy.
[1] D. Massaro. Preperceptual images, processing time, and perceptual units in auditory perception. , 1972, Psychological review.
[2] James R. Lewis,et al. IBM computer usability satisfaction questionnaires: Psychometric evaluation and instructions for use , 1995, Int. J. Hum. Comput. Interact..
[3] Allen Newell,et al. The psychology of human-computer interaction , 1983 .
[4] Dominic W. Massaro,et al. 4 – Preperceptual Images, Processing Time, and Perceptual Units in Speech Perception , 1975 .