Predictive coding of speech signals and subjective error criteria

Predictive coding methods attempt to minimize the r.m.s. error in the coded signal. However, the human ear does not perceive signal distortion on the basis of r.m.s. error regardless of its spectral shape relative to the signal spectrum. Specifically, for speech signals, the locations of the formant frequencies and their rates of change with time influence the audibility, and thus the subjective distortion of any quantizing noise. In this paper, methods for reducing the subjective distortion in predictive coders for speech siganls are described and evaluated. Improved speech quality is obtained a) by efficient removal of formant and pitch related redundant structure of speech before quantizing and b) by effective masking of the quantizer noise by the speech signal.

[1]  Joel Max,et al.  Quantizing for minimum distortion , 1960, IRE Trans. Inf. Theory.

[2]  Peter Elias,et al.  Predictive coding-II , 1955, IRE Trans. Inf. Theory.

[3]  B. Atal,et al.  Speech analysis and synthesis by linear prediction of the speech wave. , 1971, The Journal of the Acoustical Society of America.

[4]  M. R. Schroeder,et al.  Adaptive predictive coding of speech signals , 1970, Bell Syst. Tech. J..

[5]  E. G. Kimme,et al.  Synthesis of optimal filters for a feedback quantization system , 1963 .

[6]  Peter Elias,et al.  Predictive coding-I , 1955, IRE Trans. Inf. Theory.

[7]  Bishnu S. Atal,et al.  On determining partial correlation coefficients by the covariance method of linear prediction , 1977 .