Interpolation of the pitch-predictor parameters in analysis-by-synthesis speech coders

The pitch-predictor contributes greatly to the efficiency of current analysis-by-synthesis speech coders by mapping the past reconstructed signal into the present. However, for good performance, it is required that its parameters are updated often (one every 2.5-7.5 ms). A slower update rate of the pitch-predictor delay results in time misalignment between the original signal and the pitch-predictor contribution to the reconstructed signal and the pitch-predictor contribution to the reconstructed signal. The authors introduce a new procedure, that allows a slow update rate of the pitch-predictor parameters without this problem. In this method the original signal is modified in a closed-loop fashion such that the parameter values obtained by interpolation of open-loop estimates form the optimal encoding of the modified signal. This new paradigm is a generalization of the familiar analysis-by-synthesis principle. The generalized analysis-by-synthesis principle can be used for interpolation of both the pitch-predictor delay and gain. The authors compare, by means of a subjective test, speech signals encoded with different versions of the code-excited linear predictor delay and gain. They compare, by means of a subjective test, speech signals encoded with different versions of the code-excited linear predictor (CELP) coder. The comparison shows that a pitch predictor exploiting the present interpolation strategy, with an update rate of 50 Hz, provides a subjective speed quality similar to a conventional pitch predictor where the parameters are updated for every pitch cycle. >

[1]  M. Johnson,et al.  Pitch sharpening for perceptually improved CELP, and the sparse-delta codebook for reduced computation , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[2]  Allen Gersho,et al.  Subband vector excitation coding with adaptive bit-allocation , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[3]  W. B. Kleijn Analysis-by-synthesis speech coding based on relaxed waveform-matching constraints , 1991 .

[4]  P. Kroon,et al.  Generalized analysis-by-synthesis coding and its application to pitch prediction , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  M. R. Schroeder,et al.  Adaptive predictive coding of speech signals , 1970, Bell Syst. Tech. J..

[6]  Bishnu S. Atal,et al.  On the use of pitch predictors with high temporal resolution , 1991, IEEE Trans. Signal Process..

[7]  Yair Shoham Constrained-stochastic excitation coding of speech at 4.8 kb/s , 1990, ICSLP.

[8]  B. S. Atal,et al.  High-quality digital speech at 4 kb/s , 1990, [Proceedings] GLOBECOM '90: IEEE Global Telecommunications Conference and Exhibition.

[9]  Joseph Picone,et al.  Fast and accurate pitch detection using pattern recognition and adaptive time-domain analysis , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  B. Atal,et al.  Strategies for improving the performance of CELP coders at low bit rates (speech analysis) , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[11]  Joseph P. Campbell,et al.  The Dod 4.8 Kbps Standard (Proposed Federal Standard 1016) , 1991 .

[12]  B. S. Atal,et al.  PREDICTIVE CODING OF SPEECH USING ANALYSIS-BY-SYNTHESIS TECHNIQUES , 1990, 1990 Conference Record Twenty-Fourth Asilomar Conference on Signals, Systems and Computers, 1990..

[13]  I. A. Gerson,et al.  Techniques for improving the performance of CELP type speech coders , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[14]  Allen Gersho,et al.  Advances in speech coding , 1991 .

[15]  Bishnu S. Atal,et al.  Improving performance of multi-pulse LPC coders at low bit rates , 1984, ICASSP.

[16]  W. Bastiaan Kleijn,et al.  Fast methods for the CELP speech coding algorithm , 1990, IEEE Trans. Acoust. Speech Signal Process..

[17]  Willem Bastiaan Kleijn,et al.  Continuous representations in linear predictive coding , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[18]  B. Atal,et al.  Improved quantizer for adaptive predictive coding of speech signals at low bit rates , 1980, ICASSP.

[19]  Nuggehally Sampath Jayant,et al.  Speech coding with time-varying bit allocations to excitation and LPC parameters , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[20]  W. Bastiaan Kleijn,et al.  A 5.85 kbits CELP algorithm for cellular applications , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[21]  W. Bastiaan Kleijn,et al.  An efficient stochastically excited linear predictive coding algorithm for high quality low bit rate transmission of speech , 1988, Speech Commun..

[22]  Allen Gersho,et al.  Efficient Encoding of the Long-Term Predictor in Vector Excitation Coders , 1991 .

[23]  B. Atal,et al.  Predictive coding of speech signals and subjective error criteria , 1979 .

[24]  W. Bastiaan Kleijn,et al.  Methods for waveform interpolation in speech coding , 1991, Digit. Signal Process..