Waveform Preserving Time Stretching and Pitch Shifting for Sinusoidal Models of Sound

A method for performing waveform invariant time stretching and pitch shifting on a quasi harmonic and sinusoidally modeled sound is presented. The method is based on the relative phase delay representation of the phase, defined as the difference between the phase delay of the partials and the phase delay of the fundamental. This representation makes the waveform characterization independent from the phase of the first partial. It is therefore possible to compute a smooth trajectory for the phase of the modified fundamental and rebuild the waveform on the synthesis frame boundaries by adding the relative phase delays to the new fundamental phase delay.

[1]  Mark J. T. Smith,et al.  Speech analysis/synthesis and modification using an analysis-by-synthesis/overlap-add sinusoidal model , 1997, IEEE Trans. Speech Audio Process..

[2]  Thomas F. Quatieri,et al.  Shape invariant time-scale and pitch modification of speech , 1992, IEEE Trans. Signal Process..

[3]  M.G. Bellanger,et al.  Digital processing of speech signals , 1980, Proceedings of the IEEE.

[4]  Barry M. G. Cheetham,et al.  Shape-invariant pitch and time-scale modification of speech by variable order phase interpolation , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[5]  Xavier Serra,et al.  Musical Sound Modeling with Sinusoids plus Noise , 1997 .

[6]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[7]  Mark Dolson,et al.  The Phase Vocoder: A Tutorial , 1986 .