A new model for the short-time complex cepstrum of voiced speech

Traditionally, a very simple model for short-time homomorphic analysis has been used. It is shown that there is no theoretical justification for applying this model to voiced speech and that the model is of limited value for improving cepstral deconvolution procedures. Consequently, a more elaborate model is introduced in which the influence of window length is approximated and the spectral sampling inherent in voiced speech is explicitly represented. As a result, this new model shows that the vocal tract contribution to the complex cepstrum is repeated at every multiple of the pitch quefrency (n p ) and is multiplied by a double sinclike distortion (D(n)). It is shown that in order to achieve deconvolution with a low-time gating system, a cepstral lifter of length n p /2 should be used (instead of the usual length "less than n p "). Furthermore, the lifter should compensate for the distortion D(n). Unfortunately, the accuracy of straightforward homomorphic deconvolution approximations is limited by aliasing distortion which results from the repeated nature of the vocal tract contribution. Nevertheless, reasonable deconvolution approximations are obtained.