Estimation of the parameters of the quantitative intonation model with continuous wavelet analysis

Intonation generation in state-of-the-art speech synthesis requires the analysis of a large amount of data. Therefore reliable algorithms for the extraction of the parameters of an intonation model from a given F0 contour are required. This contribution proposes improvements concerning the extraction of the parameters of the quantitative intonation model developed by Fujisaki. The improvements are mainly based on the application of the continuous wavelet transform for the detection of accents and phrases in a F0 contour. A detailed explanation of the underlying idea of this approach is given and the implemented algorithm is described. Results prove that with the proposed method a significant improvement in the accuracy of the extracted parameters is achieved. Thereby the structure and the rules of the algorithm are kept relatively simple.

[1]  Sumio Ohno,et al.  A method for automatic extraction of parameters of the fundamental frequency contour , 2000, INTERSPEECH.

[2]  Hansjörg Mixdorff,et al.  A novel approach to the fully automatic extraction of Fujisaki model parameters , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).