Linear prediction using homomorphic deconvolution in the autocorrelation domain

The conventional model of the linear prediction analysis suffers from difficulties in estimating vocal tract characteristics of high-pitched speakers. This paper shows that for voiced speech the vocal tract characteristics can be estimated accurately by homomorphic deconvolution in the autocorrelation domain. The speech autocorrelation function used by linear prediction is actually an 'aliased' version of that of the vocal tract system impulse response. This aliasing occurs due to the periodic nature of voiced speech. By using cepstrum analysis, the effect of this periodicity is eliminated from the autocorrelation function which is also periodic with the same periodicity as speech itself. The formant frequencies estimated using the deconvolved autocorrelation sequences of the system impulse response are found to be accurate by more than an order of magnitude when compared with the conventional linear prediction. The accuracy of formant estimation is verified on synthetic vowels for a wide range of pitch periods. The validity of the proposed method is also examined by inspecting the estimated spectral envelopes of real speech spoken by a female child.

[1]  Thomas Quatieri,et al.  Discrete-Time Speech Signal Processing: Principles and Practice , 2001 .

[2]  Riichiro Mizoguchi,et al.  Analysis of speech signals of short pitch period by a sample-selective linear prediction , 1987, IEEE Trans. Acoust. Speech Signal Process..

[3]  J. Makhoul,et al.  Linear prediction: A tutorial review , 1975, Proceedings of the IEEE.

[4]  Alan V. Oppenheim,et al.  Predictive coding in a homomorphic vocoder , 1971 .

[5]  Chin-Hui Lee,et al.  On robust linear prediction of speech , 1988, IEEE Trans. Acoust. Speech Signal Process..

[6]  L. H. Anauer,et al.  Speech Analysis and Synthesis by Linear Prediction of the Speech Wave , 2000 .

[7]  Werner Verhelst,et al.  A new model for the short-time complex cepstrum of voiced speech , 1986, IEEE Trans. Acoust. Speech Signal Process..

[8]  D. Childers,et al.  Two-channel speech analysis , 1986, IEEE Trans. Acoust. Speech Signal Process..