In speech analysis and synthesis based on linear prediction, it is a common assumption that predictor coeffcients contain all the necessary spectral and phase information for accurate synthesis of the speech signal. However, even under the best circumstances, the synthetic speech sounds unnatural to the critical listener. Subjective tests reveal that spectral errors introduced by the linear prediction analysis techniques are a major source of unnatural sound quality in synthetic speech. This paper describes a modified analysis-synthesis procedure which, although relying on the basic LPC technique for analysis and synthesis, avoids spectral amplitude and phase distortions introduced by these techniques. In new method, proper reproduction of speech spectrum at the receiver is ensured by transmitting the short-time spectrum of prediction residual to the receiver.
[1]
Ronald W. Schafer,et al.
Digital Processing of Speech Signals
,
1978
.
[2]
John E. Markel,et al.
Linear Prediction of Speech
,
1976,
Communication and Cybernetics.
[3]
Bishnu S. Atal,et al.
On determining partial correlation coefficients by the covariance method of linear prediction
,
1977
.
[4]
J. Makhoul,et al.
A mixed‐source model for speech compression and synthesis
,
1978
.
[5]
Aaron E. Rosenberg,et al.
On reducing the buzz in LPC synthesis
,
1978
.
[6]
B. Atal,et al.
Speech analysis and synthesis by linear prediction of the speech wave.
,
1971,
The Journal of the Acoustical Society of America.