Perceptual linear predictive noise modelling for sinusoid-plus-noise audio coding

Sinusoidal coding of an audio subject to a bit-rate constraint, in general, results in a noise-like residual signal. This residual signal is of high perceptual importance; reconstruction of audio using the sinusoidal representation only typically results in an artificial sounding reconstruction. We present a new method, called perceptual linear predictive coding (PLPC), where the residual is encoded by applying LPC in the perceptual domain. This method minimizes a perceptual modelling error and therefore represents only residual components that are of perceptual relevance, while automatically discarding components masked by the sinusoidally coded part. Subjective listening tests show that PLPC performs significantly better than ordinary LPC as a sinusoidal residual coding technique. Furthermore, PLPC combined with a flexible segmentation and model order allocation algorithm leads to a significant gain in terms of R/D performance for fragments with fast changing characteristics.

[1]  Michael M. Goodwin,et al.  Adaptive Signal Models , 1998 .

[2]  N.H. van Schijndel,et al.  Towards a better balance in sinusoidal plus stochastic representation , 2003, 2003 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (IEEE Cat. No.03TH8684).

[3]  Paolo Prandoni,et al.  R/D optimal linear prediction , 2000, IEEE Trans. Speech Audio Process..

[4]  R. Vafin,et al.  Sinusoidal modeling using psychoacoustic-adaptive matching pursuits , 2002, IEEE Signal Processing Letters.

[5]  Ahmed H. Tewfik,et al.  Low bit rate high quality audio coding with combined harmonic and wavelet representations , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[6]  T.H. Crystal,et al.  Linear prediction of speech , 1977, Proceedings of the IEEE.

[7]  Richard Heusdens,et al.  Rate-distortion optimal sinusoidal modeling of audio and speech using psychoacoustical matching pursuits , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Hynek Hermansky,et al.  Perceptually based linear predictive analysis of speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Jesper Jensen,et al.  Jointly Optimal Time Segmentation, Distribution and Quantisation for Sinusoidal Audio Coding , 2005 .

[10]  John Makhoul,et al.  Spectral linear prediction: Properties and applications , 1975 .