Facial SEMG for Speech Recognition Inter-Subject Variation

The aim of this project is to identify speech using the facial muscle activity and without audio signals. The paper presents an effective technique that measures the relative muscle activity of the articulatory muscles. The paper has also tested the performance of this system for inter subject variation. Three English vowels were used as recognition variables. This paper reports using moving root mean square (RMS) of surface electromyogram (SEMG) of four facial muscles to segment the signal and identify the start and end of the utterance. The RMS of the signal between the start and end markers was integrated and normalised. This represented the relative muscle activity, and the relative muscle activities of the four muscles were classified using back propagation neural network to identify the speech. The results show that this technique gives high recognition rate when used for each of the subjects. The results also indicate that the system accuracy drops when the network trained with one subject is tested with another subject. This suggests that there is a large inter-subject variation in the speaking style for similar sounds. The experiments also show that the system is easy to train for a new user. It is suggested that such a system is suitable for simple commands for human computer interface when it is trained for the user.

[1]  S. Kumar,et al.  EMG based voice recognition , 2004, Proceedings of the 2004 Intelligent Sensors, Sensor Networks and Information Processing Conference, 2004..

[2]  J. Basmajian Muscles Alive—their functions revealed by electromyography , 1963 .

[3]  A. J. Fridlund,et al.  Guidelines for human electromyographic research. , 1986, Psychophysiology.

[4]  Kevin Englehart,et al.  A multi-expert speech recognition system using acoustic and myoelectric signals , 2002, Proceedings of the Second Joint 24th Annual Conference and the Annual Fall Meeting of the Biomedical Engineering Society] [Engineering in Medicine and Biology.

[5]  Toshiaki Sugimura,et al.  Speech recognition using EMG; mime speech recognition , 2003, INTERSPEECH.

[6]  D. Stegeman,et al.  A surface EMG electrode for the simultaneous observation of multiple facial muscles , 2003, Journal of Neuroscience Methods.

[7]  A. Gaillard,et al.  The influence of mental fatigue on facial EMG activity during a simulated workday , 2003, Biological Psychology.