MPEG Audio Compression Basics

MPEG Audio was the first international standard for high quality audio coding and it opened the doors to a variety of applications in the world of digital music. In this chapter we review the basic ideas and features behind the general purpose, perceptual audio coders specified in the MPEG-1 and MPEG-2 audio standards which include the MP3 and AAC formats. The widely successful MP3 and AAC coders represent some of the most remarkable achievements of the MPEG committee that highly influenced not only the technology but also largely enabled different ways of digital media consumption.

[1]  Henrique S. Malvar Lapped transforms for efficient transform/subband coding , 1990, IEEE Trans. Acoust. Speech Signal Process..

[2]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[3]  Bernd Edler Codierung von Audiosignalen mit überlappender Transformation und adaptiven Fensterfunktionen , 1989 .

[4]  Marina Bosi,et al.  Introduction to Digital Audio Coding and Standards , 2004, J. Electronic Imaging.

[5]  R. Hellman Asymmetry of masking between noise and tone , 1972 .

[6]  J. D. Johnston,et al.  Sum-difference stereo transform coding , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[7]  W. R. Th. ten Kate,et al.  Matrixing of bit rate reduced audio signals , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Ernst F Schroeder,et al.  Aspec-Adaptive Spectral Entropy Coding of High Quality Music Signals , 1991 .

[9]  Thomas Sporer,et al.  PEAQ - The ITU Standard for Objective Measurement of Perceived Audio Quality , 2000 .

[10]  Louis Dunn Fielder,et al.  AC-2 and AC-3: Low-Complexity Transform-Based Audio Coding , 1996 .

[11]  Joseph Rothweiler,et al.  Polyphase quadrature filters-A new subband coding technique , 1983, ICASSP.

[12]  Deepen Sinha,et al.  The Perceptual Audio Coder , 2009 .

[13]  Jürgen Herre,et al.  Extending the MPEG-4 AAC Codec by Perceptual Noise Substitution , 1998 .

[14]  Kristofer Kjörling,et al.  Spectral Band Replication, a Novel Approach in Audio Coding , 2002 .

[15]  Michel C. Lavoie,et al.  Subjective evaluation of state-of-the-art two-channel audio codecs , 1998 .

[16]  Gerhard Stoll ISO-MPEG-2 Audio: A Generic Standard for the Coding of Two-Channel and Multichannel Sound , 1996 .

[17]  John Princen,et al.  Subband/Transform coding using filter bank designs based on time domain aliasing cancellation , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  B. Atal,et al.  Optimizing digital speech coders by exploiting masking properties of the human ear , 1978 .

[19]  James David Johnston,et al.  Enhancing the Performance of Perceptual Audio Coders by Using Temporal Noise Shaping (TNS) , 1996 .

[20]  Raymond N. J. Veldhuis,et al.  Subband coding of stereophonic digital audio signals , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[21]  Louis Dunn Fielder,et al.  ISO/IEC MPEG-2 Advanced Audio Coding , 1997 .

[22]  B. Edler Aliasing reduction in sub-bands of cascaded filter banks with decimation , 1992 .

[23]  J. D. Johnston Estimation of perceptual entropy using noise masking criteria , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.