Polyphone model training method, and speech synthesis method and device

The invention discloses a polyphone model training method for speech synthesis, and a speech synthesis method and device. The method comprises the following steps of processing a voice data set and a text set so as to generate a training corpus set, wherein the text set corresponds to the voice data set, and the training corpus set comprises texts and Pinyin sequences corresponding to the texts; extracting feature information of the texts; and training polyphone models according to the feature information and the Pinyin sequence. According to the polyphone model training method for speech synthesis, in a polyphone model training process, manual labeling on Pinyin of the texts is not required, a training period of the polyphone models is greatly shortened, meanwhile, the circumstance that the trained polyphone models are inaccurate due to wrong manual labeling is avoided, and accuracy of the trained polyphone models is improved.