Minimum Generation Error Based Optimization of HMM Model Clustering for Speech Synthesis

To improve the decision tree clustering and avoid possible clustered model over-training and less-training,a minimal generation error criterion and cross-validation(CV) based minimal description length factor optimizing method is introduced.CV based generation error is calculated to optimize the scale of the decision tree.Results of both subjective and objective tests show that synthesized speech by the proposed method outperforms the synthesized speech by the baseline one system in both quality and naturalness.