Spectral representation of speech based on mel‐generalized cepstral coefficients and its properties

In the mel-generalized cepstral method of analysis, the spectrum model can be varied continuously from the all-pole type to the cepstral type. It is also possible to include human hearing characteristics. This article discusses the spectral representation by mel-generalized cepstral coefficients, aiming at application of the mel-generalized cepstral analysis to speech coding and analysis/synthesis. By the proposed spectral representation parameters, the stability condition for the synthesis filter can be clearly stated, and stability after quantization can easily be guaranteed. The distribution property and the spectral sensitivity of the proposed method are derived, and the quantization and interpolation performance is compared to that of LSP. As a result of subjective evaluation of the synthesized sound, it is shown that the quantization and interpolation performance of the proposed method is better than that of LSP. © 1999 Scripta Technica, Electron Comm Jpn Pt 3, 83(3): 50–59, 2000

[1]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[2]  H. Strube Linear prediction on a warped frequency scale , 1980 .

[3]  Yuval Bistritz,et al.  A discrete stability equation theorem and method of stable model reduction , 1982 .

[4]  H. Schussler,et al.  A stability theorem for discrete systems , 1976 .

[5]  Keiichi Tokuda,et al.  CELP coding based on mel-cepstral analysis , 1995, 1995 International Conference on Acoustics, Speech, and Signal Processing.

[6]  Peter Kabal,et al.  The computation of line spectral frequencies using Chebyshev polynomials , 1986, IEEE Trans. Acoust. Speech Signal Process..

[7]  Hans Werner Strube,et al.  Linear prediction on a warped frequency scale [speech processing] , 1988, IEEE Trans. Acoust. Speech Signal Process..

[8]  Biing-Hwang Juang,et al.  Line spectrum pair (LSP) and speech data compression , 1984, ICASSP.