Perceptual Audio Coding of Speech Signals

Traditionally algorithms for speech coding exploit the features of speech signals by employing algorithmic models of the human vocal tract. More recently, the use of generic audio coders for coding of speech signals has gained increasing importance. Based on the properties of human hearing, such perceptual audio coders offer attractive properties including full-bandwidth audio output, increased naturalness, and good handling of any type of non-speech material. The chapter discusses the principles of perceptual audio coding, some relevant standards, and a number of perceptual audio coders that find application in speech and audio transmission and storage.

[1]  Jürgen Herre,et al.  Extending the MPEG-4 AAC Codec by Perceptual Noise Substitution , 1998 .

[2]  B. Moore An Introduction to the Psychology of Hearing , 1977 .

[3]  Joseph Rothweiler,et al.  Polyphase quadrature filters-A new subband coding technique , 1983, ICASSP.

[4]  Unto K. Laine,et al.  Backward adaptive warped lattice for wideband stereo coding , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[5]  Bernd Edler Codierung von Audiosignalen mit überlappender Transformation und adaptiven Fensterfunktionen , 1989 .

[6]  Kristofer Kjörling,et al.  Spectral Band Replication, a Novel Approach in Audio Coding , 2002 .

[7]  Roch Lefebvre,et al.  Universal speech/audio coding using hybrid ACELP/TCX techniques , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[8]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[9]  S.L.J.D.E. van de Par,et al.  Rate-distortion optimized hybrid sound coding , 2005 .

[10]  Hugo Fastl,et al.  Psychoacoustics: Facts and Models , 1990 .

[11]  W. Bastiaan Kleijn,et al.  Rate-distortion optimized quantization in multistage audio coding , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Ernst Eberlein,et al.  Comparison of filterbanks for high quality audio coding , 1992, [Proceedings] 1992 IEEE International Symposium on Circuits and Systems.

[13]  Jürgen Herre,et al.  Temporal Noise Shaping, Qualtization and Coding Methods in Perceptual Audio Coding: A Tutorial Introduction , 1999 .

[14]  Eric Allamanche,et al.  MPEG-4 Low Delay Audio Coding Based on the AAC Codec , 1999 .

[15]  Unto K. Laine,et al.  On the utilization of overshoot effects in low-delay audio coding , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[16]  Marina Bosi,et al.  Filter Banks in Perceptual Audio Coding , 1999 .

[17]  Roch Lefebvre,et al.  Extended AMR-WB for high-quality audio on mobile devices , 2006, IEEE Communications Magazine.

[18]  G. Schuller,et al.  Packet loss concealment in predictive audio coding , 2005, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005..

[19]  R. Hellman Asymmetry of masking between noise and tone , 1972 .

[20]  Bernd Edler,et al.  Audio coding using a psychoacoustic pre- and post-filter , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[21]  Yen-Chun Lin,et al.  A Low-Delay CELP Coder for the CCITT 16 kb/s Speech Coding Standard , 1992, IEEE J. Sel. Areas Commun..

[22]  Bin Yu,et al.  Perceptual audio coding using adaptive pre- and post-filters and lossless compression , 2002, IEEE Trans. Speech Audio Process..

[23]  Jeroen Breebaart,et al.  ADVANCES IN PARAMETRIC CODING FOR HIGH-QUALITY AUDIO , 2003 .

[24]  John Princen,et al.  Subband/Transform coding using filter bank designs based on time domain aliasing cancellation , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.