A multi-stage perspective on CELP speech coding

The author addresses methods for high-performance innovation coding in a code-excited linear prediction (CELP) speech coder context. He argues that care should be taken in order to ensure that innovations do not extend outside the Voronoi regions of the LTP-stage. With this cell-conditioned approach, it is shown that innovation gain can be coded with 1 bit only (i.e. sign only). The previous method is extended to the case of delayed decision (tree search methods). It is demonstrated that, in this case, the innovation vectors should be placed near the boundaries of the LTP-Voronoi regions. This is in contrast to the strict one-at-a-time (OAT) procedure presented by P. Heddin and A. Bergstrom (1991), where innovations should be placed well inside the regions. Simulation studies clearly demonstrate that high-quality speech coding is indeed feasible using this 1 bit gain coding procedure.<<ETX>>

[1]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[2]  David L. Neuhoff,et al.  Cell-conditioned multistage vector quantization , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[3]  I. A. Gerson,et al.  Vector sum excited linear prediction (VSELP) speech coding at 8 kbps , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[4]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[5]  Takehiro Moriya,et al.  4.8 kbit/s delayed decision CELP coder using tree coding , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[6]  Peter Kroon,et al.  Pitch predictors with high temporal resolution , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[7]  Per Hedelin,et al.  Amplitude quantization for CELP excitation signals , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.