Low latency audio pitch shifting in the frequency domain

This paper presents a low latency pitch shifting algorithm based on the Short-Time Fourier Transform (STFT). Unlike existing STFT-based implementations of pitch shifting, the presented algorithm is more robust to reductions of the Fourier transform size. As a result, it achieves latencies as low as 12ms and still produces good quality, whereas other algorithms are performing much worse with similar low latency constraints. The presented algorithm also provides an alternate way of mitigating the well-known phasiness problem of the phase vocoder.

[1]  Alan B. Bradley,et al.  Filter bank design based on time domain aliasing cancellation with non-identical windows , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[2]  Julius O. Smith,et al.  A Sines+Transients+Noise Audio Representation for Data Compression and Time/Pitch Scale Modifications , 1998 .

[3]  David Dorran,et al.  Audio Time-Scale Modification , 2005 .

[4]  Stefan Müller Arisona,et al.  Real-Time, low Latency audio Processing in Java , 2007, ICMC.

[5]  Jan O. Borchers,et al.  PhaVoRIT: A Phase Vocoder for Real-Time Interactive Time-Stretching , 2006, ICMC.

[6]  Jean Laroche,et al.  New Phase-Vocoder Techniques are Real-Time Pitch Shifting, Chorusing, Harmonizing, and Other Exotic Audio Modifications , 1999 .

[7]  Jean Laroche,et al.  New phase-vocoder techniques for real-time pitch shifting , 1999 .

[8]  Stefan Müller Arisona,et al.  A Hybrid Time and Frequency Domain Audio Pitch Shifting Algorithm , 2008 .

[9]  Thomas F. Quatieri,et al.  Speech transformations based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[10]  Jean Laroche,et al.  Phase-vocoder: about this phasiness business , 1997, Proceedings of 1997 Workshop on Applications of Signal Processing to Audio and Acoustics.

[11]  Fabio Kon,et al.  The Quest for Low Latency , 2004, ICMC.

[12]  Jan O. Borchers,et al.  An Analysis of Startup and Dynamic Latency in phase vocoder-Based Time-stretching Algorithms , 2007, ICMC.

[13]  Dinesh K. Pai,et al.  JASS: A JAVA AUDIO SYNTHESIS SYSTEM FOR PROGRAMMERS , 2001 .

[14]  J. L. Flanagan,et al.  PHASE VOCODER , 2008 .

[15]  John ffitch,et al.  The sliding phase vocoder , 2007, ICMC.

[16]  Jens GULDEN Using the JACK Audio Connection Kit with Java , 2007 .

[17]  R. Meddis,et al.  A unitary model of pitch perception. , 1997, The Journal of the Acoustical Society of America.

[18]  Jean Laroche Autocorrelation method for high-quality time/pitch-scaling , 1993, Proceedings of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[19]  Richard G. Lyons,et al.  Understanding Digital Signal Processing , 1996 .

[20]  Xavier Rodet,et al.  Efficient Fourier Synthesis of Nonstationary Sinusoids , 1994, ICMC.

[21]  Udo Zoelzer,et al.  DAFX: Digital Audio Effects , 2011 .

[22]  Jean Laroche,et al.  Improved phase vocoder time-scale modification of audio , 1999, IEEE Trans. Speech Audio Process..

[23]  Michael Goodwin,et al.  Overlap-Add Synthesis of NonStationary Sinusoids , 1995, ICMC.

[24]  S. Goldsack,et al.  IN REAL-TIME , 2008 .

[25]  Miller S. Puckette,et al.  Phase-locked vocoder , 1995, Proceedings of 1995 Workshop on Applications of Signal Processing to Audio and Accoustics.

[26]  Xavier Serra,et al.  Digital Audio Effects , 2011 .

[27]  N. Juillerat,et al.  Low latency audio pitch shifting in the time domain , 2008, 2008 International Conference on Audio, Language and Image Processing.

[28]  Mark Sandler,et al.  Fast implementation for non-linear time-scaling of stereo signals , 2005 .