Effect of glottal pulse shape on the quality of natural vowels.

A pitch-synchronous analysis was carried out over the vowel portions of the CVC utterances HAYED, HOD, ItODE and the sentence FEW THIEVES ARE NEVER SENT TO THE JUG recorded by a male speaker. For every pitch period, the analysis provides formant frequencies and the waveform of the vocM-cord excitation. The excitation waveform was replaced by a simulated excitation waveform, with which the utterances were resynthesized. In Expt. I, sLx simulated waveforms with pulse shapes differing in the number and location of slope discontinuities were investigated. Listening tests indicated that simulated excitations with pulse shapes with a single slope discontinuity at closure are preferred. In Expt. II, simulated excitations with 16 combinations ofopening and closing times of a preferred pulse shape were investigated. Listening tests indicated that very small opening or dosing times, or opening times approximately equal to or less than closing times, are not preferred. In general, it was demonstrated that good-quality synthetic speech can be generated by using simple excitation waveforms specified undormly over an utterance. The use of tournament testing strategies for perceptual evaluation of speech •mples Ls also described.