论文信息 - Session-independent EMG-based Speech Recognition

Session-independent EMG-based Speech Recognition

This paper reports on our recent research in speech recognition by surface electromyography (EMG), which is the technology of recording the electric activation potentials of the human articulatory muscles by surface electrodes in order to recognize speech. This method can be used to create Silent Speech Interfaces, since the EMG signal is available even when no audible signal is transmitted or captured. Several past studies have shown that EMG signals may vary greatly between different recording sessions, even of one and the same speaker. This paper shows that session-independent training methods may be used to obtain robust EMGbased speech recognizers which cope well with unseen recording sessions as well as with speaking mode variations. Our best session-independent recognition system, trained on 280 utterances of 7 different sessions, achieves an average 21.93% Word Error Rate (WER) on a testing vocabulary of 108 words. The overall best session-adaptive recognition system, based on a session-independent system and adapted towards the test session with 40 adaptation sentences, achieves an average WER of 15.66%, which is a relative improvement of 21% compared to the baseline average WER of 19.96% of a session-dependent recognition system trained only on a single session of 40 sentences.

Tanja Schultz | Michael Wand | Tanja Schultz | Michael Wand

[1] D. F. Lovely,et al. Myo-electric signals to augment speech recognition , 2001, Medical and Biological Engineering and Computing.

[2] Tanja Schultz,et al. Impact of lack of acoustic feedback in EMG-based silent speech recognition , 2010, INTERSPEECH.

[3] Tanja Schultz,et al. Continuous Electromyographic Speech Recognition with a Multi-Stream Decoding Architecture , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[4] Philip C. Woodland,et al. Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models , 1995, Comput. Speech Lang..

[5] Tanja Schultz,et al. Modeling coarticulation in EMG-based continuous speech recognition , 2010, Speech Commun..

[6] Tanja Schultz,et al. Towards Speaker-adaptive Speech Recognition based on Surface Electromyography , 2009, BIOSIGNALS.

[7] J. M. Gilbert,et al. Silent speech interfaces , 2010, Speech Commun..

[8] Tanja Schultz,et al. Towards continuous speech recognition using surface electromyography , 2006, INTERSPEECH.

[9] Chuck,et al. Sub Auditory Speech Recognition based on EMG/EPG Signals , 2022 .

[10] L. Maier-Hein,et al. Session independent non-audible speech recognition using surface electromyography , 2005, IEEE Workshop on Automatic Speech Recognition and Understanding, 2005..

[11] Tanja Schultz,et al. A Spectral Mapping Method for EMG-based Recognition of Silent Speech , 2010, B-Interface.