Phoneme Classification for Speech Synthesiser using Differential EMG Signals between Muscles

This paper proposes the use of differential electromyography (EMG) signals between muscles for phoneme classification, with which a Japanese speech synthesiser system can be constructed using fewer electrodes. In distinction from traditional methods using differential EMG signals between bipolar electrodes on the same muscle, an EMG signal is derived as differential between monopolar signals on two different muscles in the proposed method. Then, frequency-based feature patterns are extracted with filter banks, and classification of phonemes is realized by using a probabilistic neural network, which combines feature reduction and pattern classification processes in a single network structure. Experimental results show that the proposed method can achieve considerably high classification performance with fewer electrodes

[1]  Biing-Hwang Juang,et al.  Discriminative learning for minimum error classification [pattern recognition] , 1992, IEEE Trans. Signal Process..

[2]  Toshio Tsuji,et al.  Multivariate Pattern Classification based on Local Discriminant Component Analysis , 2004, 2004 IEEE International Conference on Robotics and Biomimetics.

[3]  Toshio Tsuji,et al.  Development of A Five-finger Prosthetic Hand Using Ultrasonic Motors Controlled by Two EMG Signals , 2002, J. Robotics Mechatronics.

[4]  Osamu FUKUDA,et al.  A Substitute Vocalization System Based on EMG Signals , 2004 .

[5]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[6]  R. Goode,et al.  Artificial laryngeal devices in post‐laryngectomy rehabilitation , 1975, The Laryngoscope.

[7]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[8]  A. Siebens,et al.  New Unidirectional Airflow Ball Tracheostomy Speaking Valve , 2000, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[9]  J H Abbs,et al.  Strain gauge transduction of lip and jaw motion in the midsagittal plane: refinement of a prototype system. , 1979, The Journal of the Acoustical Society of America.

[10]  Toshio Tsuji,et al.  A log-linearized Gaussian mixture network and its application to EEG pattern classification , 1999, IEEE Trans. Syst. Man Cybern. Part C.