Cascaded trellis-based rate-distortion control algorithm for MPEG-4 advanced audio coding

In this paper, a few low-complexity and high-performance rate-distortion control algorithms for MPEG-4 Advanced Audio Coding (AAC) are proposed. One key element in producing good quality compressed audio particularly at medium and low rates is a high performance rate-distortion controller in the audio encoder. Although the trellis-based rate-distortion control algorithms previously proposed can achieve a praiseworthy performance, their computational complexity is extremely high. Therefore, for practical applications, it is very desirable to achieve a similar performance at a much lower complexity. Two types of techniques are proposed in this paper to reduce the computational burden of the trellis-based algorithms. One is splitting a very heavy calculation stage into two sequential steps with much less computation. The other is reducing the candidates in the trellis for parameter search. Together, when applicable, our approach achieves a similar coding performance (audio quality) but requires less than 1/1000 complexity in computation.

[1]  Methods for the subjective assessment of small impairments in audio systems , 2015 .

[2]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[3]  S.L. Regunathan,et al.  Trellis-based optimization of MPEG-4 advanced audio coding , 2000, 2000 IEEE Workshop on Speech Coding. Proceedings. Meeting the Challenges of the New Millennium (Cat. No.00EX421).

[4]  RECOMMENDATION ITU-R BS.1387-1 - Method for objective measurements of perceived audio quality , 2002 .

[5]  J. Herre,et al.  Overview of MPEG-4 audio and its applications in mobile communications , 2000, WCC 2000 - ICSP 2000. 2000 5th International Conference on Signal Processing Proceedings. 16th World Computer Congress 2000.

[6]  Information technology — Coding of audio-visual objects — Part 3 : Audio Technologies de l ' information — Codage des objets audiovisuels — Partie , 1999 .

[7]  Louis Dunn Fielder,et al.  ISO/IEC MPEG-2 Advanced Audio Coding , 1997 .

[8]  Khalid Sayood,et al.  Introduction to data compression (2nd ed.) , 2000 .

[9]  G. Blelloch Introduction to Data Compression * , 2022 .

[10]  S. L. Regunathan,et al.  Near-optimal selection of encoding parameters for audio coding , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[11]  Heiko Purnhagen,et al.  An Overview of MPEG-4 Audio Version 2 , 1999 .

[12]  Peter Kabal,et al.  Perceptual bit allocation for low rate coding of narrowband audio , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[13]  S. Geneva,et al.  Sound Quality Assessment Material: Recordings for Subjective Tests , 1988 .

[14]  S. Golomb Run-length encodings. , 1966 .

[15]  Solomon W. Golomb,et al.  Run-length encodings (Corresp.) , 1966, IEEE Trans. Inf. Theory.