High quality audio coding for mobile multimedia communications

Publisher Summary This chapter highlights the growing importance of research into audio to the fast-moving field of mobile multimedia. The main body of this chapter presents results from two projects that examine high-quality, low bit-rate coding of audio, both music and speech. One approach uses wavelets, which offer advantages over polyphase filter banks and discrete cosine transform (DCT) in MPEG music coding at low bit rates. The other approach uses the long established linear predictive coding (LPC) technique, but in a modified guise, with orders significantly greater than 10. It also uses least mean squares techniques to fit lines in time-frequency space to the line spectral pair (LSP) representation of LPC. The first section offers an introduction to audio coding and processing in the broadest sense as is applicable to mobile multimedia. This chapter emphasizes that in a complete mobile multimedia (MMM) system, the designer needs to take account of more than just the most effective coding technique. Following this, a new approach to speech coding is described and some experimental results presented. Some attention is paid to the means by which high-quality, high-order LPC speech may be resynthesized. Finally, the principles of the use of wavelets in audio coding are covered and these too are supported by results.

[1]  Mark Sandler,et al.  Hybrid pulse width modulation/sigma-delta modulation power digital-to-analogue converter , 1996 .

[2]  Yen-Chun Lin,et al.  A Low-Delay CELP Coder for the CCITT 16 kb/s Speech Coding Standard , 1992, IEEE J. Sel. Areas Commun..

[3]  Karlheinz Brandenbrg,et al.  First Ideas on Scalable Audio Coding , 1994 .

[4]  Mark Sandler,et al.  Wavelet packet based scalable audio coding , 1996, 1996 IEEE International Symposium on Circuits and Systems. Circuits and Systems Connecting the World. ISCAS 96.

[5]  N. Jayant,et al.  Digital Coding of Waveforms: Principles and Applications to Speech and Video , 1990 .

[6]  Kenzo Akagiri,et al.  ATRAC: Adaptive Transform Acoustic Coding for MiniDisc , 1992 .

[7]  Louis Dunn Fielder,et al.  AC-3: Flexible Perceptual Coding for Audio Transmission and Storage , 1994 .

[8]  Douglas D. O'Shaughnessy,et al.  Speech communication : human and machine , 1987 .

[9]  Karlheinz Brandenburg,et al.  A Two- or Three-Stage Bit-Rate Scalable Audio Coding System , 1995 .

[10]  Ming Yan,et al.  DTS Coherent Acoustics Delivering High-Quality Multichannel Sound to the Consumer , 1996 .

[11]  M.B. Sandler,et al.  On the compression obtainable with four-tap wavelets , 1996, IEEE Signal Processing Letters.

[12]  Leonardo Chiariglione MPEG and multimedia communications , 1997, IEEE Trans. Circuits Syst. Video Technol..

[13]  Malcolm J. Hawksford,et al.  Multi-Pulse Adaptive Sub-band Coding (MASC) Using Psychoacoustic Optimization Algorithm , 1992 .

[14]  Y. J. Liu A robust 400-bps speech coder against background noise , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[15]  Mark Sandler,et al.  Usage of short wavelets for scaleable audio coding , 1997, Optics & Photonics.

[16]  Mark B. Sandler,et al.  On the performance of wavelets for low bit rate coding of audio signals , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[17]  John E. Markel,et al.  Linear Prediction of Speech , 1976, Communication and Cybernetics.

[18]  Peter Kabal,et al.  Efficient computation and encoding of the multipulse excitation for LPC , 1984, ICASSP.