The Comparison of the Effect of Haimming Window and Blackman Window in the Time-Scaling and Pitch-Shifting Algorithms

The real-time pitch shifting process is widely used in various types of music production. The pitch shifting technology can be divided into two major types, the time domain type and the frequency domain type. Compared with the time domain method, the frequency domain method has the advantage of large shifting scale, low total cost of computing and the more flexibility of the algorithm. However, the use of Fourier Transform in frequency domain processing leads to the inevitable inherent frequency leakage effects which decrease the accuracy of the pitch shifting effect. In order to restrain the side effect of Fourier Transform, window functions are used to fall down the spectrum-aliasing. In practical processing, Haimming Window and Blackman Window are frequently used. In this paper, we compare both the effect of the two window functions in the restraint of frequency leakage and the performance and accuracy in subjective based on the traditional phase vocoder[1]. Experiment shows that Haimming Window is generally better than Blackman Window in pitch shifting process.

[1]  Jean Laroche Time and Pitch Scale Modification of Audio Signals , 2002 .

[2]  Jean Laroche,et al.  Improved phase vocoder time-scale modification of audio , 1999, IEEE Trans. Speech Audio Process..

[3]  Mark J. T. Smith,et al.  Analysis-by-Synthesis/Overlap-Add Sinusoidal Modeling Applied to the Analysis and Synthesis of Musical Tones , 1992 .

[4]  Xavier Serra,et al.  A sound analysis/synthesis system based on a deterministic plus stochastic decomposition , 1990 .

[5]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[6]  Unto K. Laine,et al.  Splitting the unit delay [FIR/all pass filters design] , 1996, IEEE Signal Process. Mag..

[7]  M. Portnoff,et al.  Time-scale modification of speech based on short-time Fourier analysis , 1981 .

[8]  Julius O. Smith,et al.  Spectral modeling synthesis: A sound analysis/synthesis based on a deterministic plus stochastic decomposition , 1990 .

[9]  Stephan Tassart,et al.  Analytical approximations of fractional delays: Lagrange interpolators and allpass filters , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[10]  Miller S. Puckette,et al.  Phase-locked vocoder , 1995, Proceedings of 1995 Workshop on Applications of Signal Processing to Audio and Accoustics.

[11]  Logan Volkers,et al.  PHASE VOCODER , 2008 .

[12]  Luís B. Almeida,et al.  Variable-frequency synthesis: An improved harmonic coding scheme , 1984, ICASSP.

[13]  J.B. Allen,et al.  A unified approach to short-time Fourier analysis and synthesis , 1977, Proceedings of the IEEE.