Algorithms for computing the time-corrected instantaneous frequency (reassigned) spectrogram, with applications.

A modification of the spectrogram (log magnitude of the short-time Fourier transform) to more accurately show the instantaneous frequencies of signal components was first proposed in 1976 [Kodera et al., Phys. Earth Planet. Inter. 12, 142-150 (1976)], and has been considered or reinvented a few times since but never widely adopted. This paper presents a unified theoretical picture of this time-frequency analysis method, the time-corrected instantaneous frequency spectrogram, together with detailed implementable algorithms comparing three published techniques for its computation. The new representation is evaluated against the conventional spectrogram for its superior ability to track signal components. The lack of a uniform framework for either mathematics or implementation details which has characterized the disparate literature on the schemes has been remedied here. Fruitful application of the method is shown in the realms of speech phonation analysis, whale song pitch tracking, and additive sound modeling.

[1]  M O Magnasco,et al.  Instantaneous frequency decomposition: an application to spectrally sparse sounds with fast frequency modulations. , 2005, The Journal of the Acoustical Society of America.

[2]  William A. Ainsworth,et al.  Speech signal analysis with reallocated spectrogram , 1994, Proceedings of IEEE-SP International Symposium on Time- Frequency and Time-Scale Analysis.

[3]  Fang Liu,et al.  Yeyi Clicks: Acoustic Description and Analysis , 2003, Phonetica.

[4]  Robert M. Lerner The representation of signals , 1959, IRE Trans. Inf. Theory.

[5]  L. Montgomery,et al.  A GENERALIZATION OF THE GABOR-HELSTROM TRANSFORM. , 1966 .

[6]  R K Potter,et al.  VISIBLE PATTERNS OF SOUND. , 1945, Science.

[7]  D. Friedman,et al.  Instantaneous-frequency distribution vs. time: An interpretation of the phase structure of speech , 1985, ICASSP '85. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  B. Atal,et al.  Generalized Short‐Time Power Spectra and Autocorrelation Functions , 1962 .

[9]  R. Fano Short‐Time Autocorrelation Functions and Power Spectra , 1950 .

[10]  Joachim Hornegger,et al.  Representation of Signals , 1997 .

[11]  K. Kodera,et al.  A new method for the numerical analysis of nonstationary signals , 1976 .

[12]  Kelly Fitz,et al.  Correction to: 'On the Use of Time/Frequency Reassignment in Additive Sound Modeling' , 2002 .

[13]  K. Kodera,et al.  Analysis of time-varying signals with small BT values , 1978 .

[14]  T. Irino,et al.  Robust and accurate fundamental frequency estimation based on dominant harmonic components. , 2004, The Journal of the Acoustical Society of America.

[15]  Douglas J. Nelson Cross‐spectral methods with applications to speech processing , 1999 .

[16]  Irving S. Reed,et al.  A generalization of the Gabor-Helstrom transform (Corresp.) , 1967, IEEE Trans. Inf. Theory.

[17]  Lalu Mansinha,et al.  Localization of the complex spectrum: the S transform , 1996, IEEE Trans. Signal Process..

[18]  D. Nelson,et al.  Cross-spectral methods for processing speech. , 2001, The Journal of the Acoustical Society of America.

[19]  Patrick Flandrin,et al.  Improving the readability of time-frequency and time-scale representations by the reassignment method , 1995, IEEE Trans. Signal Process..

[20]  Douglas J. Nelson,et al.  Cross-spectral methods with an application to speech processing , 1999, Optics & Photonics.

[21]  August W. Rihaczek,et al.  Signal energy distribution in time and frequency , 1968, IEEE Trans. Inf. Theory.

[22]  J. R. Carson Notes on the Theory of Modulation , 1922, Proceedings of the Institute of Radio Engineers.

[23]  Carl W. Helstrom,et al.  An expansion of a signal in Gaussian elementary signals (Corresp.) , 1966, IEEE Trans. Inf. Theory.

[24]  Douglas Nelson,et al.  Special purpose correlation functions for improved signal detection and parameter estimation , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[25]  Fabrice Plante,et al.  Improvement of speech spectrogram accuracy by the method of reassignment , 1998, IEEE Trans. Speech Audio Process..

[26]  François Auger,et al.  Time-Frequency Reassignment: From Principles to Algorithms , 2018, Applications in Time-Frequency Signal Processing.

[27]  Douglas J. Nelson,et al.  Instantaneous Higher Order Phase Derivatives , 2002, Digit. Signal Process..

[28]  Niethammer,et al.  Time-frequency representation of Lamb waves using the reassigned spectrogram , 2000, The Journal of the Acoustical Society of America.

[29]  Patrick J. Wolfe,et al.  Analysis of reassigned spectrograms for musical transcription , 2001, Proceedings of the 2001 IEEE Workshop on the Applications of Signal Processing to Audio and Acoustics (Cat. No.01TH8575).