High quality mid-rate speech coding

The authors present the results of a feasibility study comparing two basic classes of coders in the mid-rate range (6-8 kbps): CELP and sinusoid based coders. It is concluded that CELP coding is a mature technique able to yield high-quality synthetic speech with some background noise at the mid-rate range. It has a tractable computational complexity and good robustness, making it the best present candidate. Postprocessing can enhance its output quality, though further research is needed in this field. Sinusoidal coding, together with the use of narrowband basis functions, can produce synthetic speech of higher quality than that produced by CELP. Its robustness however, must be increased. This work suggests that a significant research effort into this technique is fully justified.<<ETX>>

[1]  Willem Bastiaan Kleijn,et al.  Improved speech quality and efficient vector quantization in SELP , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[2]  R. McAulay,et al.  Mid-rate coding based on a sinusoidal representation of speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  J. Rodrigues,et al.  Harmonic coding at 8 kbits/sec , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  José M. Tribolet,et al.  Harmonic post-processing of speech synthesized by stochastic coders , 1987, ECST.

[5]  Luís B. Almeida,et al.  Nonstationary spectral modeling of voiced speech , 1983 .

[6]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[7]  Isabel Trancoso,et al.  Efficient procedures for finding the optimum innovation in stochastic coders , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Luís B. Almeida,et al.  Variable-frequency synthesis: An improved harmonic coding scheme , 1984, ICASSP.

[9]  Luís B. Almeida,et al.  Quasi-optimal analysis for sinusoidal representation of speech , 1987 .