Analysis of the MPEG-1 Layer III (MP3) Algorithm Using MATLAB

The MPEG-1 Layer III (MP3) algorithm is one of the most successful audio formats for consumer audio storage and for transfer and playback of music on digital audio players. The MP3 compression standard along with the AAC (Advanced Audio Coding) algorithm are associated with the most successful music players of the last decade. This book describes the fundamentals and the MATLAB implementation details of the MP3 algorithm. Several of the tedious processes in MP3 are supported by demonstrations using MATLAB software. The book presents the theoretical concepts and algorithms used in the MP3 standard. The implementation details and simulations with MATLAB complement the theoretical principles. The extensive list of references enables the reader to perform a more detailed study on specific aspects of the algorithm and gain exposure to advancements in perceptual coding. Table of Contents: Introduction / Analysis Subband Filter Bank / Psychoacoustic Model II / MDCT / Bit Allocation, Quantiza ion and Coding / Decoder

[1]  B. Atal,et al.  Optimizing digital speech coders by exploiting masking properties of the human ear , 1978 .

[2]  W. Voessing,et al.  High Quality Digital Audio Encoding with 3.0 Bits/Sample Using Adaptive Transform Coding , 1986 .

[3]  Allen Gersho,et al.  High fidelity audio transform coding with vector quantization , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[4]  Mohamed A. Deriche,et al.  High quality audio coding using multipulse LPC and wavelet decomposition , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[5]  Deepen Sinha,et al.  Low bit rate transparent audio compression using adapted wavelets , 1993, IEEE Trans. Signal Process..

[6]  Ahmed H. Tewfik,et al.  Low bit rate high quality audio coding with combined harmonic and wavelet representations , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[7]  A. Spanias,et al.  Perceptual coding of digital audio , 2000, Proceedings of the IEEE.

[8]  Deepen Sinha,et al.  Audio compression at low bit rates using a signal adaptive switched filterbank , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[9]  Louis Dunn Fielder,et al.  AC-2: A Family of Low Complexity Transform-Based Music Coders , 1991 .

[10]  Jürgen Herre,et al.  IntMDCT - A link between perceptual and lossless audio coding , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[11]  G.C.P. Lokhoff Precision adaptive subband coding (PASC) for the digital compact cassette (DCC) , 1992 .

[12]  Eric Allamanche,et al.  MPEG-4 Low Delay Audio Coding Based on the AAC Codec , 1999 .

[13]  J. D. Johnston Perceptual transform coding of wideband stereo signals , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[14]  Schuyler Quackenbush,et al.  MPEG Audio Coding , 2002 .

[15]  Marina Bosi High Quality Multichannel Audio Coding: Trends and Challenges , 1999 .

[16]  P. P. Vaidyanathan,et al.  Cosine-modulated FIR filter banks satisfying perfect reconstruction , 1992, IEEE Trans. Signal Process..

[17]  Michael A. Gerzon,et al.  Lossless Coding for Audio Discs , 1996 .

[18]  Günther Theile,et al.  Low-Bit Rate Coding of High Quality Audio Signals , 1987 .

[19]  Marina Bosi,et al.  Aspects of current standardization activities for high-quality, low-rate multi-channel audio coding , 1993, Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[20]  Brian C. J. Moore Masking in the Human Auditory System , 1996 .

[21]  James David Johnston,et al.  Enhancing the Performance of Perceptual Audio Coders by Using Temporal Noise Shaping (TNS) , 1996 .

[22]  Davis Pan,et al.  A Tutorial on MPEG/Audio Compression , 1995, IEEE Multim..

[23]  Liang-Gee Chen,et al.  An MPEG audio decoder chip , 1995 .

[24]  Heiko Purnhagen,et al.  HILN-the MPEG-4 parametric audio coding tools , 2000, 2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No.00CH36353).

[25]  Ernst Eberlein,et al.  Comparison of filterbanks for high quality audio coding , 1992, [Proceedings] 1992 IEEE International Symposium on Circuits and Systems.

[26]  J. Robert Stuart,et al.  The MLP Lossless Compression System , 1999 .

[27]  Henrique S. Malvar,et al.  Signal processing with lapped transforms , 1992 .

[28]  Joseph Rothweiler,et al.  Polyphase quadrature filters-A new subband coding technique , 1983, ICASSP.

[29]  Erwin Janssen,et al.  Super-audio CD: an introduction , 2003 .

[30]  Ernst Terhardt,et al.  Calculating virtual pitch , 1979, Hearing Research.

[31]  John Princen,et al.  Analysis/Synthesis filter bank design based on time domain aliasing cancellation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[32]  P. Chu Quadrature mirror filter design for an arbitrary number of equal bandwidth channels , 1985, IEEE Trans. Acoust. Speech Signal Process..

[33]  Mark J. T. Smith,et al.  Subband and Wavelet Transforms , 1996 .

[34]  Deepen Sinha,et al.  AT&T Perceptual Audio Coding (PAC) , 1996 .

[35]  Gerald Schuller,et al.  Ultra Low Delay Audio Coding with Constant Bit Rate , 2004 .

[36]  Gerhard Stoll,et al.  Bitrate Reduction of High Quality Audio Signals by Modeling the Ears Masking Thresholds , 1990 .

[37]  Ted Painter,et al.  Audio Signal Processing and Coding , 2007 .

[38]  Kenzo Akagiri,et al.  ATRAC: Adaptive Transform Acoustic Coding for MiniDisc , 1992 .

[39]  R. Hellman Asymmetry of masking between noise and tone , 1972 .

[40]  Abraham Lempel,et al.  A universal algorithm for sequential data compression , 1977, IEEE Trans. Inf. Theory.

[41]  James L. Flanagan,et al.  Digital coding of speech in sub-bands , 1976, The Bell System Technical Journal.

[42]  J. D. Johnston,et al.  Estimation of perceptual entropy using noise masking criteria , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[43]  Martin Link An Attack Processing of Audio Signals for Optimizing the Temporal Characteristics of a Low Bit-Rate Audio Coding System , 1993 .

[44]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[45]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[46]  Joseph L. Hall,et al.  Auditory Psychophysics for Coding Applications , 1999 .

[47]  Karlheinz Brandenburg OCF--A new coding algorithm for high quality sound signals , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[48]  Gerald Schuller,et al.  Reduced Bit Rate Ultra Low Delay Audio Coding , 2006 .

[49]  Takehiro Moriya,et al.  High-quality audio-coding at less than 64 kbit/s by using transform-domain weighted interleave vector quantization (TwinVQ) , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[50]  L. Rabiner,et al.  Multirate digital signal processing: Prentice-Hall, Inc. Englewood Cliffs, New Jersey 07362, 1983, 411 pp., ISBN 0-13-605162-6 , 1983 .

[51]  P. P. Vaidyanathan,et al.  New results on cosine-modulated FIR filter banks satisfying perfect reconstruction , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[52]  Jürgen Herre,et al.  Perceptual Coding of High-Quality Digital Audio , 2013, Proceedings of the IEEE.

[53]  P. Vaidyanathan Multirate Systems And Filter Banks , 1992 .

[54]  Ian H. Witten,et al.  Arithmetic coding for data compression , 1987, CACM.

[55]  P. P. Vaidyanathan,et al.  Multirate digital filters, filter banks, polyphase networks, and applications: a tutorial , 1990, Proc. IEEE.

[56]  Davis Yen Pan Digital Audio Compression , 1993, Digit. Tech. J..

[57]  P. Urcun,et al.  A MUSICAM source codec for digital audio broadcasting and storage , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[58]  Michael A. Krasner Digital encoding of speech and audio signals based on the perceptual requirements of the auditory system , 1979 .

[59]  R. W. Schafer,et al.  Lossless compression of digital audio , 2001, IEEE Signal Process. Mag..

[60]  Mark Davis The AC-3 Multichannel Coder , 1993 .

[61]  A.R.D. Thornton,et al.  Foundations of Modern Auditory Theory , 1970 .

[62]  Louis Dunn Fielder,et al.  AC-2 and AC-3: Low-Complexity Transform-Based Audio Coding , 1996 .

[63]  T. Yoshida,et al.  The rewritable MiniDisc system , 1994, Proc. IEEE.

[64]  Ronald W. Schafer,et al.  AudioPaK-an integer arithmetic lossless audio codec , 1998, Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225).

[65]  Richard V. Cox,et al.  The design of uniformly and nonuniformly spaced pseudoquadrature mirror filters , 1986, IEEE Trans. Acoust. Speech Signal Process..

[66]  M. Kovac,et al.  A fast MPEG audio layer III software decoder , 1998, Proceedings IEEE Southeastcon '98 'Engineering for a New Era'.

[67]  S. Shlien,et al.  Guide to MPEG-1 audio standard , 1994, IEEE Trans. Broadcast..

[68]  Alan B. Bradley,et al.  Filter bank design based on time domain aliasing cancellation with non-identical windows , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[69]  John Princen,et al.  Audio coding with signal adaptive filterbanks , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[70]  Tomlinson Holman New Factors in Sound for Cinema and Television , 1991 .

[71]  B. Moore,et al.  Suggested formulae for calculating auditory-filter bandwidths and excitation patterns. , 1983, The Journal of the Acoustical Society of America.

[72]  Henrique S. Malvar Lapped transforms for efficient transform/subband coding , 1990, IEEE Trans. Acoust. Speech Signal Process..

[73]  Z. Picel,et al.  Flexible design of computationaly efficient nearly perfect QMF filter banks , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[74]  P. Noll,et al.  Wideband speech and audio coding , 1993, IEEE Communications Magazine.

[75]  P. Vaidyanathan Quadrature mirror filter banks, M-band extensions and perfect-reconstruction techniques , 1987, IEEE ASSP Magazine.

[76]  T.S. Perry,et al.  Consumer electronics , 1990, IEEE Spectrum.

[77]  K. Konstantinides,et al.  Fast subband filtering in MPEG audio coding , 1994, IEEE Signal Processing Letters.

[78]  Robert J. Safranek,et al.  Signal compression based on models of human perception , 1993, Proc. IEEE.

[79]  T. Ramstad,et al.  Cosine-modulated analysis-synthesis filterbank with critical sampling and perfect reconstruction , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[80]  Ralf Geiger,et al.  Audio Coding based on Integer Transforms , 2001 .

[81]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[82]  Marcus Purat,et al.  Lossless Transform Coding of Audio Signals , 1997 .

[83]  G. Maturi Single chip MPEG audio decoder , 1992 .

[84]  Fred Wylie apt-X100: Low-Delay, Low-Bit-Rate Subband ADPCM Digital : Audio Coding , 1996 .

[85]  B. Moore An Introduction to the Psychology of Hearing , 1977 .