Low rate speech coding incorporating simultaneously masked spectrally weighted linear prediction

Linear prediction (LP) is the cornerstone of most modern speech compression algorithms. Previously it has been shown that incorporating a weighting function based on the simultaneous masking property of the ear into the calculation of the LP coefficients (SMWLPC) allows the filter to better model the unmasked sections of the input spectrum. This paper conducts a detailed analysis of the implementation of SMWLPC in low rate speech codecs. The analysis allows the cause of inconsistencies in the technique to be identified and solutions formulated. Experimental results show that when combined with the proposed changes, the SMWLPC technique is suitable for implementation in any low rate LP based speech codec and the net result is an improvement in the perceptual quality of synthesised speech for all speakers.

[1]  Jan Skoglund,et al.  On time-frequency masking in voiced speech , 2000, IEEE Trans. Speech Audio Process..

[2]  John G. Proakis,et al.  Digital signal processing (3rd ed.): principles, algorithms, and applications , 1996 .

[3]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[4]  Allen Gersho,et al.  s9.9 ENCODING OF LPC SPECTRAL PARAMETERS USING SWITCHED-ADAPTIVE INTERFRAME VECTOR PREDICTION? , 1988 .

[5]  G. S. Kang,et al.  Low-Bit Rate Speech Encoders Based on Line-Spectrum Frequencies (LSFs) , 1985 .

[6]  Joe F. Chicharo,et al.  Linear prediction incorporating simultaneous masking , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[7]  Allen Gersho,et al.  Encoding of LPC spectral parameters using switched-adaptive interframe vector prediction (speech coding) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[8]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[9]  Ian Burnett,et al.  Exploiting simultaneously masked linear prediction in a WI speech coder , 2000, 2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421).

[10]  Kuldip K. Paliwal,et al.  Efficient vector quantization of LPC parameters at 24 bits/frame , 1993, IEEE Trans. Speech Audio Process..

[11]  J. Makhoul,et al.  Linear Prediction and the Spectral Analysis of Speech , 1972 .

[12]  W. H. Holmes,et al.  PERCELP - Perceptually Enhanced Random Codebook Excited Linear Prediction , 1993, Proceedings., IEEE Workshop on Speech Coding for Telecommunications,.

[13]  W. Bastiaan Kleijn,et al.  A speech coder based on decomposition of characteristic waveforms , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.