Resolution conversion of a local time-frequency region by convolution of a Gaussian and Gabor spectrum

The main purpose of music signal analysis is to extract the two types of information as follows. One is the time information such as onset, and the other is the frequency information such as pitch. Therefore, the required time-frequency resolution varies according to each time-frequency region. How­ever, the time-frequency resolution in the Short Time Fourier Transform (STFT) is constant in any time-frequency region. Thus, the analysis requirements cannot be satisfied by analyzing the music using STFT only once. Therefore, we must perform reanalysis employing different resolutions. In addition, the problem with reanalysis is that the computational cost increases. In this paper, we propose a method of converting to any resolution spectra in any local time-frequency region without performing reanalysis. We focused on the mathematical characteristics of the Gaussian that the convolution of two Gaussians become a Gaussian with different standard deviation. The proposed method can reduce the computational cost to about 1/10 compared with reanalysis using STFT.

[1]  J.B. Allen,et al.  A unified approach to short-time Fourier analysis and synthesis , 1977, Proceedings of the IEEE.

[2]  Prashant Parikh A Theory of Communication , 2010 .

[3]  Fuyuan Peng,et al.  Laser underwater target detection based on Gabor transform , 2009, 2009 4th International Conference on Computer Science & Education.

[4]  T. Sanger,et al.  Stereo disparity computation using Gabor filters , 1988, Biological Cybernetics.

[5]  Huangfu Kan,et al.  Detection of radar signals using Gabor transform and neural network , 1992, Proceedings of the IEEE 1992 National Aerospace and Electronics Conference@m_NAECON 1992.

[6]  Masataka Goto,et al.  RWC Music Database: Music genre database and musical instrument sound database , 2003, ISMIR.

[7]  Christina Gloeckner Foundations Of Time Frequency Analysis , 2016 .