Formation method of prosody model with speech style control and apparatus of synthesizing text-to-speech using the same and method for
暂无分享,去创建一个
A device and a method for synthesizing voices are provided to realize various styles of voices with a voice database of a singular radio performer, thereby vividly expressing conversation voices. Levels of intimacy are defined(S10). Voices recording text constructed corresponding to each intimacy level are stored(S20). At least one of a sentence final intonation contour pattern, an intonation pattern of a primary intonation phrase in a sentence, and a pitch mean value of a sentence of each voice data is statistically modeled to extract a metrical characteristic according to each intimacy(S30). Rhythm models by intimacy levels are generated based on the extracted metrical characteristic(S40).