Recent advances in speech coding

After a short summary of some basic properties of speech signals and of speech signal models the effect of linear prediction and vector quantization for data compression in speech coding is outlined. Some well-known coding schemes are reviewed. The recently developed RELP-S schemes based on speech analysis by synthesis are discussed in more detail. In particular a scheme using stochastic excitation sequences is expected to guarantee high speech quality at data rates far below 8 kb/s.

[1]  Allen Gersho,et al.  On the structure of vector quantizers , 1982, IEEE Trans. Inf. Theory.

[2]  W Baier Zurich seminar on digital communications , 1980 .

[3]  Lawrence R. Rabiner,et al.  Speech research directions , 1986, AT&T Technical Journal.

[4]  J. Makhoul,et al.  Vector quantization in speech coding , 1985, Proceedings of the IEEE.

[5]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[6]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[7]  Robert M. Gray,et al.  Multiple local optima in vector quantizers , 1982, IEEE Trans. Inf. Theory.

[8]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[9]  Peter Kabal,et al.  Stability and performance analysis of pitch filters in speech coders , 1987, IEEE Trans. Acoust. Speech Signal Process..

[10]  Allen Gersho,et al.  Fast search algorithms for vector quantization and pattern matching , 1984, ICASSP.

[11]  R. Crochiere,et al.  Speech Coding , 1979, IEEE Transactions on Communications.

[12]  L. Rabiner,et al.  The acoustics, speech, and signal processing society - A historical perspective , 1984, IEEE ASSP Magazine.

[13]  H. Brehm,et al.  Description and generation of spherically invariant speech-model signals , 1987 .

[14]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[16]  Bishnu S. Atal,et al.  Predictive Coding of Speech at Low Bit Rates , 1982, IEEE Trans. Commun..

[17]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[18]  Ed F. Deprettere,et al.  Regular-pulse excitation-A novel approach to effective and efficient multipulse coding of speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[19]  N. S. Jayant Coding speech at low bit rates: Advanced algorithms and hardware for voice telecommunications are paring hit rates by at least a factor of four, without losing intelligibility , 1986, IEEE Spectrum.

[20]  Bishnu S. Atal,et al.  A new model of LPC excitation for producing natural-sounding speech at low bit rates , 1982, ICASSP.

[21]  B. Atal,et al.  Quantization procedures for the excitation in CELP coders , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[22]  B. Atal,et al.  Role of multi-pulse excitation in synthesis of natural-sounding voiced speech , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[23]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[24]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.