Convolutional effects in transform coding with TDAC: an optimal window

Perceptual coders have proven to be highly efficient in the context of audio or video applications involving bit rate reduction. However, this efficiency is strongly limited in very low bit rate coding conditions. This paper studies the multiplicative effects of quantization in the frequency domain, when an overlapped filter bank (TDAC, time domain aliasing cancellation) is used to shape the quantization noise in a perceptually optimal way. The associated circular convolution operation generates aliased components in the time domain that are examined and subjected to minimization. A closed form expression is suggested to approximate an optimal transform window offering a desired tradeoff between the reduction of the time artifacts produced by a coarse quantization and the reduction of the stop-band leakage, relative to other transform windows commonly used.

[1]  Karlheinz Brandenburg,et al.  The iso/mpeg-audio codec: A generic standard for coding of high quality digital audio , 1992 .

[2]  Ronald W. Schafer,et al.  Digital Processing of Speech Signals , 1978 .

[3]  John Princen,et al.  Subband/Transform coding using filter bank designs based on time domain aliasing cancellation , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[4]  Lawrence R. Rabiner,et al.  On the design of optimum FIR low-pass filters with even impulse response duration , 1973 .

[5]  Ernst F Schroeder,et al.  Aspec-Adaptive Spectral Entropy Coding of High Quality Music Signals , 1991 .

[6]  John Princen,et al.  Analysis/Synthesis filter bank design based on time domain aliasing cancellation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[7]  R. Hellman Asymmetry of masking between noise and tone , 1972 .

[8]  L. Rabiner Linear program design of finite impulse response (FIR) digital filters , 1972 .

[9]  J. D. Johnston,et al.  Sum-difference stereo transform coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Robert J. Safranek,et al.  Signal compression based on models of human perception , 1993, Proc. IEEE.

[11]  E. Zwicker,et al.  Subdivision of the audible frequency range into critical bands , 1961 .

[12]  John W. Adams A new optimal window [signal processing] , 1991, IEEE Trans. Signal Process..

[13]  Aníbal Ferreira Optimizing high quality audio coding: advantages of full system observability , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[14]  J. McClellan,et al.  A unified approach to the design of optimum FIR linear-phase digital filters , 1973 .

[15]  P. Urcun,et al.  A MUSICAM source codec for digital audio broadcasting and storage , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[16]  Aníbal Ferreira Tonality Detection in Perceptual Coding of Audio , 1995 .

[17]  L. Rabiner,et al.  A computer program for designing optimum FIR linear phase digital filters , 1973 .

[18]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[19]  Kenzo Akagiri,et al.  ATRAC: Adaptive Transform Acoustic Coding for MiniDisc , 1992 .

[20]  T. W. Parks,et al.  Digital Filter Design , 1987 .