Perceptual Linear Predictive (PLP) Analysis-Resynthesis Technique

A common wisdom in speech re-synthesis is that while the vocal tract excitation can be modified to represent the message prosody, the accurate preservation of the formants is needed in order to ensure that both the linguistic message and the speaker-dependent information is well represented in the synthesized speech. Formants are speaker-dependent. A further decomposition of the formant-based speech representation into its message-bearing and the speaker-dependent parts and the inverse problem of combining those two sources of speech information is of interest. The current paper addresses this issues.

[1]  D. Broad,et al.  Formant estimation by linear trans-formation of the lpc cepstrum , 1989 .

[2]  Hynek Hermansky,et al.  The effective second formant F2' and the vocal tract front-cavity , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[3]  H Hermansky,et al.  Perceptual linear predictive (PLP) analysis of speech. , 1990, The Journal of the Acoustical Society of America.