USE OF HARMONIC PLUS NOISE MODEL FOR REDUCTION OF SELF LEAKAGE IN ELECTROALARYNGEAL SPEECH

Artificial larynx is an assistive device for providing excitation to vocal tract as a substitute to a dysfunctional or removed larynx. The speech generated by electrolarynx, an external vibrator held against the neck tissue, is not natural and most of the time is unintelligible because of the improper shape of the excitation pulses and presence of a background noise caused by sound leakage from the vibrator. The objective of this paper is to enhance the intelligibility of electrolaryngeal speech by reducing the background noise using harmonic plus noise model (HNM). The alaryngeal speech and the leakage signal are analyzed using HNM and average harmonic spectrum of the leakage noise is subtracted from the harmonic magnitude spectrum of the noisy speech in each frame. HNM synthesis is carried out retaining the original phase spectra. Investigations show that the output is more natural and intelligible as compared to input speech signal and the enhanced signal obtained from spectral subtraction without HNM analysis and synthesis.

[1]  D G Childers,et al.  Speech synthesis by glottal excited linear prediction. , 1994, The Journal of the Acoustical Society of America.

[2]  Caroline B. Huang,et al.  Enhancement of alaryngeal speech by adaptive filtering , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[3]  Alexander Fischer,et al.  Quantile based noise estimation for spectral subtraction and Wiener filtering , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[4]  Yannis Stylianou,et al.  Applying the harmonic plus noise model in concatenative speech synthesis , 2001, IEEE Trans. Speech Audio Process..

[5]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[6]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[7]  Parveen Lehana,et al.  Enhancement of alaryngeal speech using spectral subtraction , 2002, 2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628).

[8]  Richard M. Schwartz,et al.  Enhancement of speech corrupted by acoustic noise , 1979, ICASSP.

[9]  H. L. Barney,et al.  An experimental transistorized artificial larynx , 1959 .

[10]  Parveen Lehana,et al.  Reduction of Background Noise in Alaryngeal Speech using Spectral Subtraction with Quantile Based Noise Estimation , 2003 .

[11]  Prem C. Pandey,et al.  QUANTILE BASED NOISE ESTIMATION FOR SPECTRAL SUBTRACTION OF SELF LEAKAGE NOISE IN ELECTROLARYNGEAL SPEECH , 2004 .

[12]  Matti Karjalainen Speech communication, human and machine: by Douglas O'Shaughnessy, INRS-Telecommunication. Publisher: Addison-Wesley Publishing Company, Route 128, Reading, MA 01867, U.S.A., 1987, xviii+568 pp., ISBN 0-201-16520-1 , 1988 .

[13]  Peter Vary,et al.  Digital Speech Signal Processing , 2004 .

[14]  Prem C. Pandey,et al.  SPEECH SYNTHESIS IN INDIAN LANGUAGES , 2002 .