Volterra adaptive prediction of speech with application to waveform coding

Recent studies have shown that the airflow in the vocal tract is highly unstable and oscillates between its walls. Therefore linear prediction speech analysis, which is based on laminar airflow hypothesis, leads to approximate representations. This paper deals with nonlinear speech modeling and its exploitation to high quality medium-rate coding. We first give evidence that the nonlinearities in speech can be described by a second-order finite memory Volterra operator. An algorithm for performing adaptive nonlinear prediction is described. Application of the algorithm to speech coding is then reported and stability and computational issues are discussed. Performance evaluations and comparisons with linear predictive speech coding are reported and show that improvements in coding performances can be obtained.

[1]  B. Townshend,et al.  Nonlinear prediction of speech , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[2]  G. Sicuranza Quadratic filters for signal processing , 1992, Proc. IEEE.

[3]  Petros Maragos,et al.  Energy separation in signal modulations with application to speech analysis , 1993, IEEE Trans. Signal Process..

[4]  K. A. Prabhu,et al.  A Predictor Switching Scheme for DPCM Coding of Video Signals , 1985, IEEE Trans. Commun..

[5]  S. D. Hansen,et al.  Non-linear short-term prediction in speech coding , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[6]  H. Teager Some observations on oral air flow during phonation , 1980 .

[7]  Giovanni Ramponi,et al.  Adaptive nonlinear prediction of TV image sequences , 1989 .

[8]  V. J. Mathews Adaptive polynomial filters , 1991, IEEE Signal Processing Magazine.

[9]  Enzo Mumolo,et al.  Adaptive predictive coding of speech by means of volterra predictors , 1993, IEEE Winter Workshop on Nonlinear Digital Signal Processing.

[10]  Junghsi Lee,et al.  A fast recursive least squares adaptive second order Volterra filter and its performance analysis , 1993, IEEE Trans. Signal Process..

[11]  Lizhong Wu,et al.  Fully vector-quantized neural network-based code-excited nonlinear predictive speech coding , 1994, IEEE Trans. Speech Audio Process..