ifferential Perceptual Audio Coding ethod with Reduce Bitrate Requirements

A new audio transform coding technique is proposed that reduces the bitrate requirements of the perceptual transform audio coders by utilizing the stationarity characteristics of the audio signals. The method detects the frames that have significant audible content and codes them in a way similar to conventional perceptual transform coders. However, when successive data frames are found to be similar to those sections, then their audible differences only are coded. An error analysis for the proposed method is presented and results from tests on diffemnt types of audio material are listed, indicating that an average of 30% in compression gain (over the conventional perceptual audio coders bitrate) can be achieved, with a small deterioration in the audio quality of the coded signal. The proposed method has the advantage of easy adaptation within the perceptual transform coders architecture and add only small computational overhead to these systems.

[1]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[2]  B. Paillard,et al.  PERCEVAL: Perceptual Evaluation of the Quality of Audio Signals , 1992 .

[3]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[4]  J. D. Johnston,et al.  Estimation of perceptual entropy using noise masking criteria , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[5]  James M. Kates,et al.  A time-domain digital cochlear model , 1991, IEEE Trans. Signal Process..

[6]  Ernst Eberlein,et al.  Advanced Audio Measurement System Using Psychoacoustic Properties , 1992 .

[7]  B. Atal,et al.  Optimizing digital speech coders by exploiting masking properties of the human ear , 1978 .

[8]  Henrique S. Malvar Lapped transforms for efficient transform/subband coding , 1990, IEEE Trans. Acoust. Speech Signal Process..

[9]  Pierre Duhamel,et al.  A fast algorithm for the implementation of filter banks based on 'time domain aliasing cancellation' , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[10]  Akihiko Sugiyama,et al.  A 128 kb/s Hi-Fi Audio CODEC Based on Adaptive Transform Coding with Adaptive Block Size MDCT , 1992, IEEE J. Sel. Areas Commun..

[11]  B. Paillard,et al.  A Study of Strategies for the Perceptual Coding of Audio Signals , 1991 .

[12]  Nikil Jayant,et al.  Signal Compression: Technology Targets and Research Directions , 1992, IEEE J. Sel. Areas Commun..

[13]  Gerhard Stoll,et al.  Bitrate Reduction of High Quality Audio Signals by Modeling the Ears Masking Thresholds , 1990 .

[14]  Sarto Morissette,et al.  Transparent Coding of a Monophonic Audio Signal at 100 Kb/s , 1992 .

[15]  Ernst F Schroeder,et al.  Aspec-Adaptive Spectral Entropy Coding of High Quality Music Signals , 1991 .

[16]  Dieter Seitzer,et al.  Low Bit-Rate Coding of High Quality Digital Audio: Algorithms and Evaluation of Quality , 1989 .

[17]  N. Jayant,et al.  Digital Coding of Waveforms: Principles and Applications to Speech and Video , 1990 .

[18]  Kenzo Akagiri,et al.  ATRAC: Adaptive Transform Acoustic Coding for MiniDisc , 1992 .

[19]  Ernst Eberlein,et al.  Evaluation of Concealment Techniques for Compressed Digital Audio , 1993 .

[20]  John Mourjopoulos,et al.  Audio Coding Based on Subjective Differences , 1993 .

[21]  Richard F. Lyon,et al.  An analog electronic cochlea , 1988, IEEE Trans. Acoust. Speech Signal Process..

[22]  Y. Mahieux,et al.  Transform coding of audio signals using correlation between successive transform blocks , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[23]  Joseph Rothweiler,et al.  Polyphase quadrature filters-A new subband coding technique , 1983, ICASSP.

[24]  R. Hellman Asymmetry of masking between noise and tone , 1972 .

[25]  John G. Beerends,et al.  A Perceptual Audio Quality Measure Based on a Psychoacoustic Sound Representation , 1992 .