Spectrogram consistency and its application to phase reconstruction

In this article, we derive the constraints which a set of complex numbers must verify to be a consistent STFT spectrogram, i.e., to be the STFT spectrogram of an actual real-valued signal, and describe how they lead to an objective function measuring the consistency of a set of complex numbers as a spectrogram. We then present a exible phase reconstruction algorithm based on a local approximation of the consistency constraints and derive a real-time time-scale modication algorithm.

[1]  Paris Smaragdis,et al.  Mitsubishi Electric Research Laboratories , 1994 .

[2]  Hirokazu Kameoka,et al.  Single Channel Speech and Background Segregation Through Harmonic-Temporal Clustering , 2007, 2007 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics.

[3]  Jae Lim,et al.  Signal estimation from modified short-time Fourier transform , 1984 .

[4]  Jan O. Borchers,et al.  PhaVoRIT: A Phase Vocoder for Real-Time Interactive Time-Stretching , 2006, ICMC.

[5]  Jean Laroche,et al.  Improved phase vocoder time-scale modification of audio , 1999, IEEE Trans. Speech Audio Process..

[6]  Morten Mørup,et al.  Nonnegative Matrix Factor 2-D Deconvolution for Blind Single Channel Source Separation , 2006, ICA.

[7]  Mark Dolson,et al.  The Phase Vocoder: A Tutorial , 1986 .

[8]  S. Boll,et al.  Suppression of acoustic noise in speech using spectral subtraction , 1979 .

[9]  J.B. Allen,et al.  A unified approach to short-time Fourier analysis and synthesis , 1977, Proceedings of the IEEE.

[10]  Eric Moulines,et al.  Non-parametric techniques for pitch-scale and time-scale modification of speech , 1995, Speech Commun..

[11]  Tuomas Virtanen,et al.  Monaural Sound Source Separation by Nonnegative Matrix Factorization With Temporal Continuity and Sparseness Criteria , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[12]  Miller S. Puckette,et al.  Phase-locked vocoder , 1995, Proceedings of 1995 Workshop on Applications of Signal Processing to Audio and Accoustics.

[13]  Hirokazu Kameoka,et al.  A Real-time Equalizer of Harmonic and Percussive Components in Music Signals , 2008, ISMIR.