Low Bit Rate Transparent Audio Compression using

This paper descrihes a novel wavelet based audio synthesis and coding method. The method uses optimal adap- tive wavelet selection and wavelet coefficients quantization pro- cedures together with a dynamic dictionary approach. The adaptive wavelet transform selection and transform coefficient bit allocation procedures are designed to take advantage of the masking effect in human hearing. They minimize the number of hits required to represent each frame of audio material at a fixed distortion level. The dynamic dictionary greatly reduces statistical redundancies in the audio source. Experiments in- dicate that the proposed adaptive wavelet selection procedure by itself can achieve almost transparent coding of monophonic compact disk (CD) quality signals (sampled at 44.1 kHz) at bit rates of 64-70 kilobits per second (kb/s). The combined adap- tive wavelet selection and dynamic dictionary coding proce- dures achieve almost transparent coding of monophonic CD quality signals at bit rates of 48-66 kh/s.

[1]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[2]  Michael A. Krasner The critical band coder-Digital encoding of speech signals based on the perceptual requirements of the auditory system , 1980, ICASSP.

[3]  S. Singhal High quality audio coding using multipulse LPC , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[4]  L. Humes,et al.  Models of the additivity of masking. , 1989, The Journal of the Acoustical Society of America.

[5]  Allen Gersho,et al.  Adaptive vector quantization by progressive codevector replacement , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  R. Coifman,et al.  Fast wavelet transforms and numerical algorithms I , 1991 .

[7]  R. Gray,et al.  Vector quantization , 1984, IEEE ASSP Magazine.

[8]  I. Daubechies Orthonormal bases of compactly supported wavelets , 1988 .

[9]  R. Gray,et al.  Speech coding based upon vector quantization , 1980, ICASSP.

[10]  Raymond N. J. Veldhuis,et al.  Bit Rates in Audio Source Coding , 1992, IEEE J. Sel. Areas Commun..

[11]  Deepen Sinha,et al.  On the optimal choice of a wavelet for signal representation , 1992, IEEE Trans. Inf. Theory.

[12]  C. S. Burrus,et al.  Wavelet transforms and filter banks , 1993 .

[13]  Stéphane Mallat,et al.  Multifrequency channel decompositions of images and wavelet models , 1989, IEEE Trans. Acoust. Speech Signal Process..

[14]  G. Stoll,et al.  High quality audio bit-rate reduction system family for different applications , 1990, IEEE International Conference on Communications, Including Supercomm Technical Sessions.

[15]  Allen Gersho,et al.  High fidelity audio transform coding with vector quantization , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[16]  Karlheinz Brandenburg,et al.  The iso/mpeg-audio codec: A generic standard for coding of high quality digital audio , 1992 .

[17]  S. Chiba,et al.  Dynamic programming algorithm optimization for spoken word recognition , 1978 .

[18]  Marcel Breeuwer,et al.  Subband coding of digital audio signals , 1989 .

[19]  Peter No,et al.  Digital Coding of Waveforms , 1986 .

[20]  A. Grossmann,et al.  DECOMPOSITION OF HARDY FUNCTIONS INTO SQUARE INTEGRABLE WAVELETS OF CONSTANT SHAPE , 1984 .

[21]  Bishnu S. Atal,et al.  Predictive Coding of Speech at Low Bit Rates , 1982, IEEE Trans. Commun..

[22]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[23]  Gilbert Strang,et al.  Wavelets and Dilation Equations: A Brief Introduction , 1989, SIAM Rev..