Truly nonstationary techniques for the analysis and display of voiced speech

The assumption of quasi-stationarity in the analysis of speech is questioned from the standpoint of the best resolution of global periodicity and vocal tract formant frequency locations. The generalized class of time-frequency representations is discussed from the standpoint of speech analysis and is integrated with a description of the spectrogram. The spectrogram is known to have minimal interference artifacts, and the truly nonstationary cone-kernel time-frequency representation (CK-TFR) is shown to be similarly free of interference. The CK-TFR is observed to give higher resolution of the time point of glottal closure, and the clarity of some formant frequencies, especially for a nasal consonant, is shown to be better than the spectrogram. It is shown that interference-free representations of speech with a higher simultaneous resolution in time and frequency than the spectrogram are possible, and that these new representations may be applicable to the better analysis and understanding of speech.<<ETX>>

[1]  J. Kirkwood Quantum Statistics of Almost Classical Assemblies , 1933 .

[2]  H. Margenau,et al.  Correlation between Measurements in Quantum Theory , 1961 .

[3]  L. Cohen Generalized Phase-Space Distribution Functions , 1966 .

[4]  August W. Rihaczek,et al.  Signal energy distribution in time and frequency , 1968, IEEE Trans. Inf. Theory.

[5]  M. Riley Speech Time-Frequency Representations , 1989 .

[6]  Boualem Boashash,et al.  Instantaneous Frequency Of Signals: Concepts, Estimation Techniques And Applications , 1989, Optics & Photonics.

[7]  L. Cohen,et al.  Time-frequency distributions-a review , 1989, Proc. IEEE.

[8]  William J. Williams,et al.  Improved time-frequency representation of multicomponent signals using exponential kernels , 1989, IEEE Trans. Acoust. Speech Signal Process..

[9]  Robert J. Marks,et al.  The use of cone-shaped kernels for generalized time-frequency representations of nonstationary signals , 1990, IEEE Trans. Acoust. Speech Signal Process..

[10]  Robert J. Marks,et al.  Kernel synthesis for generalized time-frequency distributions using the method of projections onto convex sets , 1990 .

[11]  W. J. Williams,et al.  On the cross-terms in spectrograms , 1990, IEEE International Symposium on Circuits and Systems.

[12]  Les Atlas,et al.  New stationary techniques for the analysis and display of speech transients , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[13]  Les Atlas,et al.  New properties to alleviate interference in time-frequency representations , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.