Qualitative investigation of the display of speech recognition results for communication with deaf people

Speech technologies provide ways of helping people with hearing loss by improving their autonomy. This study focuses on an application in French language which is developed in the collaborative project RAPSODIE in order to improve communication between a hearing person and a deaf or hard-of-hearing person. Our goal is to investigate different ways of displaying the speech recognition results which takes also into account the reliability of the recognized items. In this qualitative study, 10 persons have been interviewed to find the best way of displaying the speech transcription results. All the participants are deaf with different levels of hearing loss and various modes of communication.

[1]  Olivier Galibert,et al.  The ETAPE corpus for the evaluation of speech-based TV content processing in the French language , 2012, LREC.

[2]  Dale L. Johnson The Psychology of Deafness , 1962 .

[3]  Alexander H. Waibel,et al.  Multimodal error correction for speech user interfaces , 2001, TCHI.

[4]  P. Worchel,et al.  The Psychology of Deafness , 1961 .

[5]  Denis Jouvet,et al.  Comparison of approaches for an efficient phonetic decoding , 2013, INTERSPEECH.

[6]  Alexander I. Rudnicky,et al.  Pocketsphinx: A Free, Real-Time Continuous Speech Recognition System for Hand-Held Devices , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[7]  Hui Jiang,et al.  Confidence measures for speech recognition: A survey , 2005, Speech Commun..

[8]  Brian Amento,et al.  Error correction of voicemail transcripts in SCANMail , 2006, CHI.

[9]  Per Ola Kristensson,et al.  On the benefits of confidence visualization in speech recognition , 2008, CHI.

[10]  Masood Masoodian,et al.  Interface design strategies for computer-assisted speech transcription , 2008, OZCHI '08.

[11]  Jean Paul Haton,et al.  Frame-Synchronous and Local Confidence Measures for Automatic Speech Recognition , 2011, Int. J. Pattern Recognit. Artif. Intell..

[12]  Walter Bender,et al.  Improving speech playback using time-compression and speech recognition , 2004, CHI.

[13]  Denis Jouvet,et al.  Hybrid language models for speech transcription , 2014, INTERSPEECH.