Waveform approximating residual audio coding with perceptual pre- and post-filtering

We investigate waveform approximating residual coding for a sinusoidal parametric audio coder at low bit rates. The residual coding is based on the well-known pre- and post-filtering method with lossless coding which features perceptual weighting for short time segments. We compare the incurred perceptual distortion from joint quantization of the residual and the sinusoids for different bit rates. In addition to that, we develop a transform coding scheme for the coefficients in the pre- and post-filters which must be send as side information between the encoder and decoder. Our investigations show that the combination of the sinusoidal subcoder and the pre- and post-filtering entails an overall lower perceptual distortion for low as well as high bit rates. Also, the developed transform coding scheme enables efficient coding of the side information at a low bit rate.

[1]  K. H. Barratt Digital Coding of Waveforms , 1985 .

[2]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[3]  Martin Vetterli,et al.  Optimal time segmentation for signal modeling and compression , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Mads Græsbøll Christensen,et al.  Efficient parametric coding of transients , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Heiko Purnhagen,et al.  HILN-the MPEG-4 parametric audio coding tools , 2000, 2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No.00CH36353).

[6]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[7]  Bernd Edler,et al.  Audio coding using a psychoacoustic pre- and post-filter , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[8]  Bernd Edler,et al.  Parametric audio coding , 2000, WCC 2000 - ICSP 2000. 2000 5th International Conference on Signal Processing Proceedings. 16th World Computer Congress 2000.

[9]  Bin Yu,et al.  Perceptual audio coding using adaptive pre- and post-filters and lossless compression , 2002, IEEE Trans. Speech Audio Process..

[10]  Biing-Hwang Juang,et al.  Line spectrum pair (LSP) and speech data compression , 1984, ICASSP.

[11]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[12]  Rajiv Laroia,et al.  Efficient encoding of speech LSP parameters using the discrete cosine transformation , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[13]  William M. Hartmann,et al.  Psychoacoustics: Facts and Models , 2001 .

[14]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[15]  F. Itakura Line spectrum representation of linear predictor coefficients of speech signals , 1975 .

[16]  H. Strube Linear prediction on a warped frequency scale , 1980 .

[17]  Yair Shoham Vector predictive quantization of the spectral parameters for low rate speech coding , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  A. Gray,et al.  Distance measures for speech processing , 1976 .

[19]  Søren Holdt Jensen,et al.  Variable Dimension Trellis-Coded Quantization of Sinusoidal Parameters , 2008, IEEE Signal Processing Letters.

[20]  Bernd Edler,et al.  Parametric audio coding , 2000, WCC 2000 - ICCT 2000. 2000 International Conference on Communication Technology Proceedings (Cat. No.00EX420).

[21]  Richard Heusdens,et al.  A new psychoacoustical masking model for audio coding applications , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.