Formant-frequency-based Mandarin single final vioce visualizing method
暂无分享,去创建一个
The invention provides a formant-frequency-based Mandarin single final voice visualizing method. The method comprises the following steps: framing and windowing an original voice signal; extracting short-time energy, formant frequency and fundamental tone frequency of each frame of signal; correcting errors of specific values of the formant frequency and the fundamental tone frequency by adoptinga median smoothing method; mapping different pronunciations into different color aspects by utilizing the formant frequency, and correcting; reflecting variation tendency of pronunciation time, energy and fundamental tone frequency in an image; and differentiating different Mandarin single final pronunciations by colors. The method is easy to implement by only extracting acoustic phonetic parameters of short-time energy, formant frequency and fundamental tone frequency of a voice signal; soft decision is introduced, each pronunciation is not subject to hard decision, but represented by different colors, and visualizing effect of different speakers on the same pronunciation is based on the principle of seeking common grounds while reserving differences, so that the decision on pronunciation more accords with subjective perception of people.