Multi-sinusoid excitation model for audio coding

Most studies in LPC-based audio coders decompose the signal into the product of excitation and system spectra, and then quantize the excitation by using either a stochastic codebook or multiple pulses. But their near white spectra cannot precisely describe the harmonic characteristics of excitation, especially when dealing with instrumental music. The paper explores the benefits of sinusoidal representation for excitation in the design of analysis-by-synthesis predictive coders. Furthermore, an efficient parameter extraction algorithm has also been developed to identify the associated parameters of the sinusoidal components. Simulation results indicate that the proposed multi-sinusoid excitation model allows the implementation of an LPC-based audio coder which delivers near toll quality at the rate of 92.61 kbps.<<ETX>>

[1]  R. Steele,et al.  High quality audio coding using analysis-by-synthesis technique , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[2]  Andrew Sekey,et al.  An Objective Measure for Predicting Subjective Quality of Speech Coders , 1992, IEEE J. Sel. Areas Commun..

[3]  James D. Johnston,et al.  Transform coding of audio signals using perceptual noise criteria , 1988, IEEE J. Sel. Areas Commun..

[4]  S. Singhal High quality audio coding using multipulse LPC , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[5]  Günther Theile,et al.  Low-Bit Rate Coding of High Quality Audio Signals , 1987 .

[6]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[7]  Manfred R. Schroeder,et al.  Code-excited linear prediction(CELP): High-quality speech at very low bit rates , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  F. Itakura Line spectrum representation of linear predictor coefficients of speech signals , 1975 .

[9]  Mark J. T. Smith,et al.  A new speech coding model based on a least-squares sinusoidal representation , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.