Fractional rate multitree speech coding

The authors present both forward and backward adaptive speech coders that operate at 9.6, 12, and 16 kb/s using integer and fractional rate trees, weighted squared error distortion measures, the (M,L) tree search algorithm, and incremental path map symbol release. They introduce the concept of multitree source codes and illustrate how the multitree structure allows scalar quantizer-based codes and scalar adaptation rules to be used for fractional rate tree coding. With a frequency weighted distortion measure, the forward and backward adaptive multitree coders produce near toll quality speech at 16 kb/s, while the backward adaptive 9.6 kb/s multitree coder substantially outperforms adaptive predictive coding and has an encoding delay of less than 2 ms. Performance results are present in terms of unweighted and weighted signal-to-noise ratio and segmental signal-to-noise ratio, sound spectrograms, and subjective listening tests. >

[1]  Nariman Farvardin,et al.  Quantizer design in LSP speech analysis-synthesis , 1988, IEEE J. Sel. Areas Commun..

[2]  Robert M. Gray,et al.  Information rates of autoregressive processes , 1970, IEEE Trans. Inf. Theory.

[3]  J.D. Gibson,et al.  Adaptive prediction in speech differential encoding systems , 1980, Proceedings of the IEEE.

[4]  David L. Cohn,et al.  Study of Sequential Estimation Methods for Speech Digitization. , 1975 .

[5]  John B. Anderson,et al.  Computationally Optimal Metric-First Code Tree Search Algorithms , 1984, IEEE Trans. Commun..

[6]  Stephen G. Wilson,et al.  Adaptive Tree Encoding of Speech at 8000 Bits/s with a Frequency-Weighted Error Criterion , 1979, IEEE Trans. Commun..

[7]  John B. Anderson,et al.  Speech Encoding by a Stack Algorithm , 1980, IEEE Trans. Commun..

[8]  Stephen G. Wilson,et al.  Adaptive tree encoding of discrete-time sources with speech applications , 1978 .

[9]  Jerry D. Gibson,et al.  Backward Adaptive Lattice and Transversal Predictors in ADPCM , 1985, IEEE Trans. Commun..

[10]  Bishnu S. Atal,et al.  Predictive Coding of Speech at Low Bit Rates , 1982, IEEE Trans. Commun..

[11]  J.J. Shynk,et al.  Backward adaptation for low delay vector excitation coding of speech at 16 kbit/s , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[12]  A. Gray,et al.  Quantization and bit allocation in speech processing , 1976 .

[13]  P. Kabal,et al.  A low delay 16 kbits/sec speech coder , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[14]  John B. Anderson,et al.  Tree encoding of speech , 1975, IEEE Trans. Inf. Theory.

[15]  J. Uddenfeldt,et al.  Adaptive Delta Modulation with Delayed Decision , 1974, IEEE Trans. Commun..

[16]  John B. Anderson,et al.  Low-rate tree coding of autoregressive sources , 1983, IEEE Trans. Inf. Theory.

[17]  Michael W. Marcellin,et al.  Predictive trellis coded quantization of speech , 1990, IEEE Trans. Acoust. Speech Signal Process..

[18]  J. Mark Adaptive predictive run-length encoding for analogue sources , 1976 .

[19]  William A. Pearlman,et al.  A transform tree code for stationary Gaussian sources , 1985, IEEE Trans. Inf. Theory.

[20]  A. Goldberg Predictive coding with delayed decision. , 1977 .

[21]  Jerry D. Gibson,et al.  Incremental tree coding of speech , 1981, IEEE Trans. Inf. Theory.

[22]  Paul Mermelstein,et al.  Ensuring predictor tracking in ADPCM speech coders under noisy transmission conditions , 1988, IEEE J. Sel. Areas Commun..

[23]  Jerry D. Gibson,et al.  Backward Adaptive Prediction Algorithms in Multi-Tree Speech Coders , 1991 .

[24]  Yair Shoham,et al.  New directions in subband coding , 1988, IEEE J. Sel. Areas Commun..

[25]  Masaaki Honda,et al.  Bit allocation in time and frequency domains for predictive coding of speech , 1984 .

[26]  J. Gibson Adaptive prediction for speech encoding , 1984, IEEE ASSP Magazine.

[27]  N. Jayant,et al.  Digital Coding of Waveforms: Principles and Applications to Speech and Video , 1990 .

[28]  M. R. Schroeder,et al.  Adaptive predictive coding of speech signals , 1970, Bell Syst. Tech. J..

[29]  J. Anderson,et al.  Real-number convolutional codes for speech-like quasi-stationary sources (Corresp.) , 1977, IEEE Trans. Inf. Theory.

[30]  Ed F. Deprettere,et al.  A class of analysis-by-synthesis predictive coders for high quality speech coding at rates between 4.8 and 16 kbit/s , 1988, IEEE J. Sel. Areas Commun..

[31]  J. Anderson,et al.  Adaptivity Versus Tree Searching in DPCM , 1982, IEEE Trans. Commun..

[32]  T. Fischer,et al.  A Trellis-Searched 16 Kbit/Sec Speech Coder with Low-Delay , 1991 .

[33]  V. Cuperman,et al.  Backward pitch prediction for low-delay speech coding , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[34]  Jerry D. Gibson,et al.  Path map symbol release algorithms and the exponential metric tree , 1989 .

[35]  Kouichi Honma,et al.  APC-AB codec modules operating at 16 and 8 kbit/s , 1988, IEEE J. Sel. Areas Commun..

[36]  Allen Gersho,et al.  Gain-Adaptive Vector Quantization with Application to Speech Coding , 1987, IEEE Trans. Commun..

[37]  Peter Kabal,et al.  Stability and performance analysis of pitch filters in speech coders , 1987, IEEE Trans. Acoust. Speech Signal Process..

[38]  J. Makhoul,et al.  Quantization properties of transmission parameters in linear predictive systems , 1975 .

[39]  T.S. Koubanitsas Application of the Viterbi algorithm to adaptive delta modulation with delayed decision , 1975, Proceedings of the IEEE.

[40]  B. Atal,et al.  Improved quantizer for adaptive predictive coding of speech signals at low bit rates , 1980, ICASSP.

[41]  V. Cuperman,et al.  Vector quantization: A pattern-matching technique for speech coding , 1983, IEEE Communications Magazine.

[42]  Nuggehally Sampath Jayant,et al.  Tree-Encoding of Speech Using the (M, L)-Algorithm and Adaptive Quantization , 1978, IEEE Trans. Commun..

[43]  J. Makhoul,et al.  Vector quantization in speech coding , 1985, Proceedings of the IEEE.

[44]  C. Cutler,et al.  Delayed Encoding: Stabilizer for Adaptive Coders , 1971 .

[45]  R. Cheung Application of CVSD with delayed decision to narrowband/Wideband tandem , 1977 .

[46]  W.-W. Chang,et al.  Objective and subjective optimization of APC system performance , 1990, IEEE Trans. Acoust. Speech Signal Process..

[47]  P. Noll,et al.  Multipath Search Coding of Stationary Signals with Applications to Speech , 1982, IEEE Trans. Commun..