MPEG-4 natural audio coding

MPEG-4 audio represents a new kind of audio coding standard. Unlike its predecessors, MPEG-1 and MPEG-2 high-quality audio coding, and unlike the speech coding standards which have been completed by the ITU-T, it describes not a single or small set of highly efficient compression schemes but a complete toolbox to do everything from low bit-rate speech coding to high-quality audio coding or music synthesis. The natural coding part within MPEG-4 audio describes traditional type speech and high-quality audio coding algorithms and their combination to enable new functionalities like scalability (hierarchical coding) across the boundaries of coding algorithms. This paper gives an overview of the basic algorithms and how they can be combined.

[1]  Andreas Johannes Gerrits,et al.  On scalability in CELP coding systems , 1997, 1997 IEEE Workshop on Speech Coding for Telecommunications Proceedings. Back to Basics: Attacking Fundamental Problems in Speech Coding.

[2]  Kazunori Ozawa,et al.  A bitrate and bandwidth scalable CELP coder , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[3]  Joseph P. Campbell,et al.  The Dod 4.8 Kbps Standard (Proposed Federal Standard 1016) , 1991 .

[4]  Kazunori Ozawa,et al.  M-LCELP Speech Coding at 4 kb/s with Multi-Mode and Multi-Codebook (Special Issue on Mobile Multimedia Communications) , 1994 .

[5]  Allen Gersho,et al.  Advances in speech and audio compression , 1994, Proc. IEEE.

[6]  Jürgen Herre,et al.  Extending the MPEG-4 AAC Codec by Perceptual Noise Substitution , 1998 .

[7]  N.S. Jayant High-quality coding of telephone speech and wideband audio , 1990, IEEE Communications Magazine.

[8]  P. Mabilleau,et al.  16 kbps wideband speech coding technique based on algebraic CELP , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[9]  Takehiro Moriya,et al.  AUDIO CODING USING TRANSFORM-DOMAIN WEIGHTED INTERLEAVE VECTOR QUANTIZATION (TWIN VQ) , 1998 .

[10]  S. Aign,et al.  Overview of the MPEG-4 Standard and Error Resilience Investigations , 1998 .

[11]  James David Johnston,et al.  Enhancing the Performance of Perceptual Audio Coders by Using Temporal Noise Shaping (TNS) , 1996 .

[12]  P. Mabilleau,et al.  Fast CELP coding based on algebraic codes , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[13]  K. Yoshida,et al.  A multi-mode variable rate speech coder for CDMA cellular systems , 1996, Proceedings of Vehicular Technology Conference - VTC.

[14]  Ed F. Deprettere,et al.  Regular-pulse excitation-A novel approach to effective and efficient multipulse coding of speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[15]  Louis Dunn Fielder,et al.  ISO/IEC MPEG-2 Advanced Audio Coding , 1997 .

[16]  Daniel Schulz Improving audio codecs by noise substitution , 1996 .